SMART Criteria Check

AI Prompt Templates

Copy a prompt into Claude, ChatGPT, or Gemini. Paste your document at the bottom and run.

Paste a document and get a scored quality assessment with evidence and revision priorities.

5,613 characters

You are an expert M&E specialist with deep experience reviewing indicators across logframes, MEL plans, PIRS, and results frameworks. Score the indicator or set of indicators I will provide against the SMART criteria using the rubric below. The input may be a single indicator statement or a list of indicators extracted from a logframe, MEL plan, PIRS, or other M&E document.

SCORING RUBRIC - SMART Criteria Check
Score each dimension 1-5 using these criteria:

DIMENSION 1: Specific
- Score 5: All elements present. The subject is named clearly (who or what is being measured). The quality being captured is unambiguous (e.g., "completion" defined, "improved" operationalized). The population is bounded (e.g., women aged 15-49 in target districts). No room for different interpretations across staff or sites.
- Score 4: Most elements present. Subject and quality are clear; population is implied but not stated, or one term is slightly ambiguous.
- Score 3: Indicator is broadly understandable but a key term ("strengthened", "improved", "supported") is not operationalized. Population is generic.
- Score 2: Subject or quality is vague. Multiple plausible interpretations across reviewers.
- Score 1: Absent or inadequate. Indicator reads as a slogan, theme, or result statement rather than something that names a specific thing to measure.

DIMENSION 2: Measurable
- Score 5: All elements present. The indicator is quantifiable (number, percentage, ratio, score) or systematically observable against a defined rubric or checklist. A clear unit is implied (people, households, events). The value produced is comparable across time periods and sites.
- Score 4: Most elements present. Quantifiable but unit is implicit, or the observation method is named but not standardized.
- Score 3: Indicator can be measured in principle but lacks a clear unit or scoring approach. Comparability across periods would require additional definition work.
- Score 2: Indicator is qualitative aspiration with no observable referent (e.g., "stakeholders feel ownership"). No measurement approach implied.
- Score 1: Absent or inadequate. Indicator is purely aspirational language with no path to measurement.

DIMENSION 3: Achievable
- Score 5: All elements present. The target embedded or implied is realistic given the program's scale, capacity, geography, and timeline. The indicator is within the implementer's sphere of influence (not entirely dependent on external actors). Resources for measurement are plausible. Comparable programs in similar contexts have demonstrated reachable values at similar levels.
- Score 4: Most elements present. Realistic at the program scale, but one assumption (capacity, external actor, measurement cost) is unexamined.
- Score 3: Plausibly achievable but ambition is unclear. The target level depends on conditions that are not addressed.
- Score 2: Indicator depends heavily on factors outside the implementer's control or sets a level not seen in comparable work.
- Score 1: Absent or inadequate. Indicator is unrealistic given scale, capacity, or timeframe, or measures a result the program cannot plausibly produce.

DIMENSION 4: Relevant
- Score 5: All elements present. The indicator measures the result level it claims (output indicator measures an output, outcome indicator measures an outcome, impact indicator measures impact). It connects logically to the result statement it sits under. It is not a proxy displaced from its actual level (e.g., counting trainings to "measure" capacity).
- Score 4: Most elements present. Level is mostly correct; one indicator slips down a level or uses a partial proxy.
- Score 3: Result-chain level is inconsistently applied. Some indicators are mis-leveled or measure activity proxies for higher-level results.
- Score 2: Indicator does not match its claimed result level (e.g., labelled as "outcome" but measures activity completion).
- Score 1: Absent or inadequate. Indicator is disconnected from the result it sits under or from the program logic entirely.

DIMENSION 5: Time-bound
- Score 5: All elements present. A measurement schedule is specified (baseline, midline, endline, or a stated cadence such as quarterly or annually). A target date or end-of-project deadline is named where applicable. Reporting points align with the program's decision cycle.
- Score 4: Most elements present. Schedule named but a deadline is missing, or a deadline is named without a measurement cadence.
- Score 3: General timing implied ("annually", "during the project") but no specific schedule or deadline.
- Score 2: Timing is vague or referenced only in the broader plan, not at the indicator level.
- Score 1: Absent or inadequate. No measurement schedule, cadence, or deadline associated with the indicator.

OUTPUT FORMAT:
Return your assessment as a table followed by a summary:

| Dimension | Score (1-5) | Evidence | Priority Revision |
|-----------|-------------|----------|-------------------|
| Specific | | | |
| Measurable | | | |
| Achievable | | | |
| Relevant | | | |
| Time-bound | | | |

**Total: X/25**
**Band:** Strong (22-25) / Adequate (17-21) / Needs Revision (11-16) / Substantial Revision (5-10)
**Single Most Important Revision:** [One specific sentence]

For any dimension scored 1 or 2, add a brief explanation and a concrete revised indicator statement.

If multiple indicators are provided, score each one separately and then report a portfolio summary identifying which indicators are strongest, which need revision, and which should be rebuilt or dropped.

INDICATOR(S) TO SCORE:
[Paste your indicator(s) here]

Scoring Criteria

Specific

5Excellent

Subject, quality, and population all clearly named. Evaluative terms operationalized. No ambiguity across reviewers.

4Good

Subject and quality clear; population implied or one term mildly ambiguous.

3Adequate

Key term ("improved", "strengthened") not operationalized. Population generic.

2Needs Improvement

Subject or quality vague. Multiple plausible interpretations.

1Inadequate

Reads as a slogan, theme, or result statement, not an indicator.

Measurable

5Excellent

Quantifiable or systematically observable against a defined rubric. Clear unit. Values comparable across periods and sites.

4Good

Quantifiable but unit implicit, or observation method named but not standardized.

3Adequate

Measurable in principle but lacks clear unit or scoring approach.

2Needs Improvement

Qualitative aspiration with no observable referent.

1Inadequate

Purely aspirational language with no path to measurement.

Achievable

5Excellent

Realistic given scale, capacity, geography, and timeline. Within implementer's sphere of influence. Measurement resources plausible.

4Good

Realistic at program scale; one assumption (capacity, external actor, cost) unexamined.

3Adequate

Plausibly achievable but ambition unclear. Conditions unaddressed.

2Needs Improvement

Depends heavily on factors outside implementer's control, or level not seen in comparable work.

1Inadequate

Unrealistic given scale, capacity, or timeframe.

Relevant

5Excellent

Measures the result level it claims. Connects logically to the result statement. Not displaced by a proxy.

4Good

Level mostly correct; one indicator slips down a level or uses a partial proxy.

3Adequate

Level inconsistently applied. Some mis-leveling or activity proxies for higher results.

2Needs Improvement

Does not match its claimed result level.

1Inadequate

Disconnected from the result statement or from the program logic.

Time-bound

5Excellent

Measurement schedule specified (baseline/midline/endline or cadence). Deadline named where applicable. Aligned with decision cycle.

4Good

Schedule named but deadline missing, or deadline named without cadence.

3Adequate

General timing implied ("annually") but no specific schedule or deadline.

2Needs Improvement

Timing vague or only in surrounding text, not at the indicator level.

1Inadequate

No schedule, cadence, or deadline associated with the indicator.

Score Interpretation

Total (out of 25)	Band	Next Step
22-25	Strong	Indicator is SMART and ready for use. Pair with definition, source, and disaggregation work to complete the indicator package.
17-21	Adequate	Address flagged criteria before baseline. Most likely fix: operationalize an evaluative term or attach a measurement schedule.
11-16	Needs Revision	Substantial revision required. Use the Revise prompt to tighten specificity, measurability, and time-bounding before fielding any data collection.
5-10	Substantial Revision	Indicator is too thin to track meaningful change. Rebuild from the result statement using the Generate prompt, then re-score.

Scoring Dimensions

1
Specific
Whether the indicator is clear about what is being measured, with no ambiguity about the subject, the quality being captured, or the population concerned.
2
Measurable
Whether the indicator is quantifiable or systematically observable, expressed in a way that produces a comparable value rather than an aspirational statement.
3
Achievable
Whether the indicator is realistic given the program context, capacity, geography, and budget, rather than dependent on conditions outside the implementer's reach.
4
Relevant
Whether the indicator matches the result level it claims to measure, so that an output indicator measures an output, an outcome indicator measures an outcome, and so on.
5
Time-bound
Whether a measurement deadline or reporting schedule is specified, so the indicator can be tracked against a defined point or cadence rather than indefinitely.

Each dimension scored 1–5. Maximum score: 25.

Prompts Using This Rubric

Review Indicators Against SMART

Score indicators specifically against SMART (Specific / Measurable / Achievable / Relevant / Time-bound) with per-criterion feedback.

Back to Rubric Library