Scoring Criteria
SMART indicators at every output and outcome level. Each indicator names a specific metric, measurable unit, target population, and time period. Every indicator has a target with a baseline reference. Disaggregation variables specified per indicator where relevant.
Indicators at every output and outcome level. No more than 20 percent miss a single SMART element (typically time dimension or specificity). Every indicator has a target. Baselines specified for at least 80 percent. Disaggregation specified for major indicators.
Indicators at every output and outcome level. Half or more are fully operationalizable; the remainder are vague on metric, target population, or time. Baselines missing for half or more. Disaggregation specified inconsistently.
Half or more indicators are proxy measures or unmeasurable as stated, OR one or more results levels has no indicator, OR more than half lack targets.
Indicators absent or unmeasurable as stated. No targets.
Every indicator has a specific named tool, a stated collection frequency, and a named responsible position. The system is feasible given documented staffing and geography.
Every indicator has a specific tool and frequency. Named responsible position missing for no more than 20 percent of indicators.
Every indicator has a tool and frequency, but at least one of (tool specificity, frequency, named position) is missing for half or more of indicators.
Tools listed but generic for half or more of indicators (e.g., "project records" with no specified document), OR frequency missing for half or more, OR no positions named at all.
No collection system described.
DQA section explicitly addresses all three quality dimensions (validity, reliability, completeness) before data is used. Includes a named spot-check or verification process with a named responsible position and stated frequency.
DQA section addresses two of three quality dimensions with named processes. Spot-check process included, though one element (responsible position, frequency, or sample size) may be unspecified.
DQA section addresses one of three quality dimensions with a named process. The other two are referenced but described in general terms. No specified spot-check process.
DQA section present but none of the three quality dimensions has a named process. Generic statements only ("data will be checked for accuracy").
No DQA section. Data quality is assumed.
Named positions assigned for all five functions (data collection, entry, analysis, reporting, decision-making). Supervision and escalation paths described.
Named positions assigned for at least four of five functions. Supervision or escalation path described, even if not both.
Named positions assigned for at least three of five functions, typically collection and reporting plus one other. Remaining functions unassigned or assigned to units rather than positions. No supervision or escalation path.
Named positions assigned for at most two of five functions, OR all assignments are at unit level ("the M&E team") rather than position level.
No roles assigned. The plan does not specify who does anything.
At least one review cycle defined with stated frequency, named participating positions, and the specific decisions it feeds into. At least one named adaptive management mechanism with stated trigger conditions for program adjustment. Learning documentation process named with responsible position.
Review cycle defined with stated frequency and named participants. Decision links named for at least one cycle. Adaptive management mechanism mentioned but trigger conditions unspecified. Learning documentation referenced.
At least one review cycle defined with a stated frequency. Participating positions or decision links are not specified for any cycle. Adaptive management referenced without trigger conditions or adjustment process.
Statement that "data will inform decisions" or equivalent, with no specification of cycle frequency, participants, or decision scope. No adaptive management mechanism.
No learning provisions. The plan addresses data collection only.
Score Interpretation
| Total (out of 25) | Band | Next Step |
|---|---|---|
| 22-25 | Strong | Minor refinements only |
| 17-21 | Adequate | Address flagged dimensions before submission |
| 11-16 | Needs Revision | Return to MEL team with AI output as revision brief |
| 5-10 | Substantial Revision | Redesign the MEL system before proceeding |