Copy a prompt into Claude, ChatGPT, or Gemini. Paste your document at the bottom and run.
Paste a document and get a scored quality assessment with evidence and revision priorities.
5,613 characters
You are an expert M&E specialist with deep experience reviewing indicators across logframes, MEL plans, PIRS, and results frameworks. Score the indicator or set of indicators I will provide against the SMART criteria using the rubric below. The input may be a single indicator statement or a list of indicators extracted from a logframe, MEL plan, PIRS, or other M&E document.
SCORING RUBRIC - SMART Criteria Check
Score each dimension 1-5 using these criteria:
DIMENSION 1: Specific
- Score 5: All elements present. The subject is named clearly (who or what is being measured). The quality being captured is unambiguous (e.g., "completion" defined, "improved" operationalized). The population is bounded (e.g., women aged 15-49 in target districts). No room for different interpretations across staff or sites.
- Score 4: Most elements present. Subject and quality are clear; population is implied but not stated, or one term is slightly ambiguous.
- Score 3: Indicator is broadly understandable but a key term ("strengthened", "improved", "supported") is not operationalized. Population is generic.
- Score 2: Subject or quality is vague. Multiple plausible interpretations across reviewers.
- Score 1: Absent or inadequate. Indicator reads as a slogan, theme, or result statement rather than something that names a specific thing to measure.
DIMENSION 2: Measurable
- Score 5: All elements present. The indicator is quantifiable (number, percentage, ratio, score) or systematically observable against a defined rubric or checklist. A clear unit is implied (people, households, events). The value produced is comparable across time periods and sites.
- Score 4: Most elements present. Quantifiable but unit is implicit, or the observation method is named but not standardized.
- Score 3: Indicator can be measured in principle but lacks a clear unit or scoring approach. Comparability across periods would require additional definition work.
- Score 2: Indicator is qualitative aspiration with no observable referent (e.g., "stakeholders feel ownership"). No measurement approach implied.
- Score 1: Absent or inadequate. Indicator is purely aspirational language with no path to measurement.
DIMENSION 3: Achievable
- Score 5: All elements present. The target embedded or implied is realistic given the program's scale, capacity, geography, and timeline. The indicator is within the implementer's sphere of influence (not entirely dependent on external actors). Resources for measurement are plausible. Comparable programs in similar contexts have demonstrated reachable values at similar levels.
- Score 4: Most elements present. Realistic at the program scale, but one assumption (capacity, external actor, measurement cost) is unexamined.
- Score 3: Plausibly achievable but ambition is unclear. The target level depends on conditions that are not addressed.
- Score 2: Indicator depends heavily on factors outside the implementer's control or sets a level not seen in comparable work.
- Score 1: Absent or inadequate. Indicator is unrealistic given scale, capacity, or timeframe, or measures a result the program cannot plausibly produce.
DIMENSION 4: Relevant
- Score 5: All elements present. The indicator measures the result level it claims (output indicator measures an output, outcome indicator measures an outcome, impact indicator measures impact). It connects logically to the result statement it sits under. It is not a proxy displaced from its actual level (e.g., counting trainings to "measure" capacity).
- Score 4: Most elements present. Level is mostly correct; one indicator slips down a level or uses a partial proxy.
- Score 3: Result-chain level is inconsistently applied. Some indicators are mis-leveled or measure activity proxies for higher-level results.
- Score 2: Indicator does not match its claimed result level (e.g., labelled as "outcome" but measures activity completion).
- Score 1: Absent or inadequate. Indicator is disconnected from the result it sits under or from the program logic entirely.
DIMENSION 5: Time-bound
- Score 5: All elements present. A measurement schedule is specified (baseline, midline, endline, or a stated cadence such as quarterly or annually). A target date or end-of-project deadline is named where applicable. Reporting points align with the program's decision cycle.
- Score 4: Most elements present. Schedule named but a deadline is missing, or a deadline is named without a measurement cadence.
- Score 3: General timing implied ("annually", "during the project") but no specific schedule or deadline.
- Score 2: Timing is vague or referenced only in the broader plan, not at the indicator level.
- Score 1: Absent or inadequate. No measurement schedule, cadence, or deadline associated with the indicator.
OUTPUT FORMAT:
Return your assessment as a table followed by a summary:
| Dimension | Score (1-5) | Evidence | Priority Revision |
|-----------|-------------|----------|-------------------|
| Specific | | | |
| Measurable | | | |
| Achievable | | | |
| Relevant | | | |
| Time-bound | | | |
**Total: X/25**
**Band:** Strong (22-25) / Adequate (17-21) / Needs Revision (11-16) / Substantial Revision (5-10)
**Single Most Important Revision:** [One specific sentence]
For any dimension scored 1 or 2, add a brief explanation and a concrete revised indicator statement.
If multiple indicators are provided, score each one separately and then report a portfolio summary identifying which indicators are strongest, which need revision, and which should be rebuilt or dropped.
INDICATOR(S) TO SCORE:
[Paste your indicator(s) here]
Scoring Criteria
Specific
5Excellent
Subject, quality, and population all clearly named. Evaluative terms operationalized. No ambiguity across reviewers.
4Good
Subject and quality clear; population implied or one term mildly ambiguous.
3Adequate
Key term ("improved", "strengthened") not operationalized. Population generic.
2Needs Improvement
Subject or quality vague. Multiple plausible interpretations.
1Inadequate
Reads as a slogan, theme, or result statement, not an indicator.
Measurable
5Excellent
Quantifiable or systematically observable against a defined rubric. Clear unit. Values comparable across periods and sites.
4Good
Quantifiable but unit implicit, or observation method named but not standardized.
3Adequate
Measurable in principle but lacks clear unit or scoring approach.
2Needs Improvement
Qualitative aspiration with no observable referent.
1Inadequate
Purely aspirational language with no path to measurement.
Achievable
5Excellent
Realistic given scale, capacity, geography, and timeline. Within implementer's sphere of influence. Measurement resources plausible.
4Good
Realistic at program scale; one assumption (capacity, external actor, cost) unexamined.
3Adequate
Plausibly achievable but ambition unclear. Conditions unaddressed.
2Needs Improvement
Depends heavily on factors outside implementer's control, or level not seen in comparable work.
1Inadequate
Unrealistic given scale, capacity, or timeframe.
Relevant
5Excellent
Measures the result level it claims. Connects logically to the result statement. Not displaced by a proxy.
4Good
Level mostly correct; one indicator slips down a level or uses a partial proxy.
3Adequate
Level inconsistently applied. Some mis-leveling or activity proxies for higher results.
2Needs Improvement
Does not match its claimed result level.
1Inadequate
Disconnected from the result statement or from the program logic.
Time-bound
5Excellent
Measurement schedule specified (baseline/midline/endline or cadence). Deadline named where applicable. Aligned with decision cycle.
4Good
Schedule named but deadline missing, or deadline named without cadence.
3Adequate
General timing implied ("annually") but no specific schedule or deadline.
2Needs Improvement
Timing vague or only in surrounding text, not at the indicator level.
1Inadequate
No schedule, cadence, or deadline associated with the indicator.
Score Interpretation
Total (out of 25)
Band
Next Step
22-25
Strong
Indicator is SMART and ready for use. Pair with definition, source, and disaggregation work to complete the indicator package.
17-21
Adequate
Address flagged criteria before baseline. Most likely fix: operationalize an evaluative term or attach a measurement schedule.
11-16
Needs Revision
Substantial revision required. Use the Revise prompt to tighten specificity, measurability, and time-bounding before fielding any data collection.
5-10
Substantial Revision
Indicator is too thin to track meaningful change. Rebuild from the result statement using the Generate prompt, then re-score.