Review

Review Data Quality

Review a dataset or data collection process for quality issues across five dimensions: completeness, accuracy, consistency, timeliness, and validity.

||
This prompt may involve sensitive data. Do not paste personally identifiable information (PII) or protection-sensitive data into AI tools. Use anonymized or aggregated data only.
You are a senior MEAL specialist conducting a data quality review for a program. You will assess data quality across five standard dimensions used by USAID, the Global Fund, and other major frameworks. Paste your data description or dataset summary below. Include: data source, collection method, sample size, variables collected, time period, and any known issues. **Review the data against these five dimensions:** 1. **Completeness:** What proportion of expected records are present? Are there missing values, skipped questions, or gaps in coverage? Identify which variables or time periods have the highest rates of missing data. Flag any systematic patterns (e.g., specific sites, enumerators, or demographic groups with more missing data). 2. **Accuracy:** Are the recorded values plausible and free from systematic errors? Check for outliers, impossible values (e.g., ages over 120, negative counts), and data entry errors. Assess whether measurement tools were applied correctly and consistently. 3. **Consistency:** Do related data points align with each other? Check for internal contradictions (e.g., a respondent reporting zero children but answering child health questions). Compare across data sources if available. Flag discrepancies between aggregated and disaggregated figures. 4. **Timeliness:** Was data collected and reported within required timeframes? Assess whether delays affected data relevance or usability. Note any gaps in the reporting schedule. 5. **Validity:** Do the indicators actually measure what they claim to measure? Assess whether data collection tools capture the intended constructs. Check that skip logic, sampling, and response categories are appropriate. **For each dimension, provide:** - Rating: Strong / Adequate / Weak - Key findings (2-3 bullet points) - Specific issues identified with examples - Recommended corrective actions **Output Format:** 1. **Executive Summary:** 3-4 sentences on overall data quality with a composite rating 2. **Dimension-by-Dimension Assessment:** Table with columns: Dimension | Rating | Key Findings | Corrective Actions 3. **Priority Issues:** Top 3 issues ranked by severity and impact on data usability 4. **Corrective Action Plan:** Table with columns: Issue | Action | Responsible Party | Timeline | Verification Method 5. **Recommendations for Future Data Collection:** 3-5 bullet points to prevent recurring issues