Scoring Criteria
Causal chain explicit at every link from activity to goal. Each link has a stated rationale. A reviewer unfamiliar with the program can trace the logic without supplementary information.
Causal chain explicit at every level with no more than two links assumed rather than stated. Overall logic is plausible.
Causal chain present but half or more links are assumed rather than stated. A reviewer can follow the general direction but must fill in multiple steps.
Half or more outputs do not clearly lead to their stated outcomes, OR causal direction is wrong at one or more levels.
No clear causal logic. Activities and outcomes appear unrelated.
All indicators name a specific metric, measurable unit, target population, and time period. Each directly measures the result statement. Disaggregation specified where relevant.
All indicators meet SMART criteria with no more than 20 percent missing a single element (typically time dimension or specificity).
Half or more indicators are fully SMART; the remainder miss one or more elements (time dimension, specificity, or direct alignment to result). Data collection feasible but requires clarification.
Half or more indicators are proxy measures, OR indicators do not align with their result statements, OR time dimensions or disaggregation are systematically absent.
Indicators vague, unmeasurable, or do not correspond to result statements.
Assumptions stated at every outcome and goal level. Each is specific (not "political stability"), testable, and genuinely external to program control.
Assumptions present at every outcome and goal level. No more than two are vague, but major dependencies are identified.
Assumptions present at half or more outcome and goal levels. Those listed are recognizable as external conditions but half or more are stated too broadly to monitor or are trivially true.
Assumptions present at less than half of outcome and goal levels, OR half or more are trivially true or are program activities.
No assumptions stated, or all stated items are program activities.
Standard four-level structure (activity, output, outcome, goal). Each level consistently differentiated. No items at the wrong level.
Standard structure recognizable. No more than two items at the wrong level.
Standard structure recognizable but at least three items at the wrong level. Distinction between outputs and outcomes inconsistently applied.
Half or more items at the wrong level, OR outputs and outcomes systematically conflated.
No recognizable hierarchy. All results listed at one level.
Every indicator has a specific named source of verification, collection frequency, and method. Responsible party named per indicator. Every results level has at least one indicator.
Every indicator has a source. No more than 20 percent miss frequency or responsible party. All results levels covered.
Every indicator has a source, but sources are generic for half or more (e.g., "project records"). At least one of (frequency, responsible party, method) is missing for half or more indicators.
Half or more indicators have generic or no sources, OR several results levels lack indicators entirely.
No sources of verification documented. Major results levels have no indicators.
Score Interpretation
| Total (out of 25) | Band | Next Step |
|---|---|---|
| 22-25 | Strong | Minor refinements only |
| 17-21 | Adequate | Address flagged dimensions before submission |
| 11-16 | Needs Revision | Return to design team with AI output as revision brief |
| 5-10 | Substantial Revision | Facilitate a design workshop before further drafting |