Scoring Criteria
Causal chain is explicit and directionally correct at every level (activity → output → outcome → goal). Each link has a stated rationale. A reviewer unfamiliar with the programme can trace the full logic without additional explanation.
Causal chain is mostly coherent with 1-2 gaps where a connection is assumed rather than explained. Overall logic is plausible.
Causal chain is partially coherent. Key levels are present and broadly sequenced correctly, but multiple links are assumed rather than explained and a reviewer would need supplementary information to follow the logic.
Several gaps in causal reasoning. Some outputs do not clearly lead to stated outcomes. Requires significant explanation to make sense.
No clear causal logic. Activities and outcomes appear unrelated. Reads as a list of deliverables, not a causal argument.
All indicators are specific (who, what, where), measurable, time-bound, and directly measure the stated result. Disaggregation variables specified where relevant.
Most indicators meet SMART criteria. 1-2 lack a time dimension or are slightly broad but measurable in practice.
Some indicators meet SMART criteria but roughly half are missing one or more elements (time dimension, specificity, or direct alignment to the result statement). Data collection would be possible but would require clarification on several indicators.
Several indicators are broad proxy measures. Missing time dimensions or disaggregation. Would require clarification before data collection.
Indicators are vague, unmeasurable, or do not correspond to the result statements. Cannot be operationalized without major revision.
Assumptions are stated at outcome and goal levels. Each is specific, testable, and genuinely external to programme control. Not trivially true.
Assumptions present at most levels. 1-2 vague but major dependencies identified.
Assumptions present at some levels. Key external dependencies are acknowledged but stated broadly, and some are trivially true or could be interpreted as programme activities rather than external conditions.
Incomplete: present at some levels but absent at others. Many trivially true or actually programme activities.
No assumptions stated. Major external dependencies have not been identified.
Hierarchy follows standard structure (activities produce outputs; outputs enable outcomes; outcomes contribute to goal). Each level clearly defined and consistently differentiated.
Mostly correct. 1-2 items at the wrong level. Overall structure is recognizable.
Overall hierarchy is recognizable but several items appear at the wrong level. The distinction between outputs and outcomes is inconsistently applied, and a reviewer could identify which level is intended only with additional context.
Multiple items at the wrong level. Outputs and outcomes consistently confused.
No recognizable hierarchy. Activities and outcomes collapsed into a single list.
Sources of verification specified per indicator. Collection frequency, responsible party, and method documented. At least one indicator per results level.
Sources listed for most indicators. Frequency or responsible party missing for 1-2. Most levels have indicator coverage.
Sources of verification listed for most indicators but are often generic. At least one of collection frequency, responsible party, or method is consistently missing. Most results levels have some indicator coverage.
Sources generic ("project records") without specifying how data will actually be collected. Several levels lack indicators.
No sources of verification. Major results levels have no indicators. Monitoring approach is not addressed.
Score Interpretation
| Total (out of 25) | Band | Next Step |
|---|---|---|
| 22-25 | Strong | Minor refinements only |
| 17-21 | Adequate | Address flagged dimensions before submission |
| 11-16 | Needs Revision | Return to design team with AI output as revision brief |
| 5-10 | Substantial Revision | Facilitate a design workshop before further drafting |