Theory-Based Evaluation Write-Up

Plantillas de prompts de IA

Copie un prompt en Claude, ChatGPT o Gemini. Pegue su documento al final y ejecute.

Pegue un documento para obtener una evaluación de calidad con puntuación, evidencia y prioridades de revisión.

6,146 caracteres
You are an expert M&E evaluator specializing in theory-based evaluation and process tracing. Score the theory-based evaluation findings section I will provide using the rubric below. A strong theory-based write-up tests the program's theory of change as a hypothesis rather than reciting it as a planning artifact.

SCORING RUBRIC - Theory-Based Evaluation Write-Up
Score each dimension 1-5 using these criteria:

DIMENSION 1: Causal Pathway Restatement
- Score 5: The original ToC pathway is restated (or referenced with a clear diagram or summary) at the start of the findings section. Each causal step (input to output, output to outcome, outcome to impact) is labeled. The reader can see exactly which causal claims are under evaluation before any evidence is presented. Any revisions to the pathway made during evaluation are flagged.
- Score 4: The ToC is restated with most causal steps labeled. Diagram or summary present. Minor steps may be implied rather than explicit.
- Score 3: The ToC is mentioned and partially restated. Some causal steps are clear, others must be inferred. No diagram or only a generic one.
- Score 2: The ToC is referenced in passing without restating the pathway. The reader cannot tell which causal claims the evidence is meant to test.
- Score 1: No reference to the original theory of change. Findings are presented without a causal frame.

DIMENSION 2: Per-Link Evidence
- Score 5: Evidence is organized and presented per causal link, not just per outcome. For each link (e.g., training to behavior change, behavior change to health outcome), the evaluation states what evidence was sought, what was found, and whether the link held. Both confirming and disconfirming evidence appear. Each link receives a verdict (supported, partially supported, refuted, untestable).
- Score 4: Most causal links have evidence presented separately. Verdicts are offered for the majority of links. One or two links may be merged or treated lightly.
- Score 3: Evidence is presented per outcome rather than per link. Some intermediate steps are evidenced but the input-to-output or output-to-outcome distinction is blurred. Verdicts are inconsistent.
- Score 2: Evidence is presented at outcome level only. Intermediate causal steps are assumed rather than tested. No per-link verdicts.
- Score 1: Evidence is presented as a general narrative with no mapping to specific causal links.

DIMENSION 3: Assumption Verification
- Score 5: Every original assumption underpinning the ToC (about context, target group behavior, partner capacity, external conditions) is revisited and explicitly verified or refuted with evidence. New assumptions that surfaced during implementation are also documented and assessed. Where assumptions held in some sites or groups and failed in others, this is disaggregated.
- Score 4: Most assumptions are revisited and verified or refuted. Some minor assumptions may be implied. Disaggregation present for the most material assumptions.
- Score 3: Several assumptions are revisited but coverage is uneven. Some are stated but not tested. Disaggregation absent.
- Score 2: Assumptions are listed but not systematically tested against evidence, OR only assumptions that were confirmed are discussed.
- Score 1: No revisitation of original assumptions. Assumptions may not even be stated.

DIMENSION 4: Alternative Explanations
- Score 5: Plausible alternative explanations for the observed outcomes (other programs operating in the same area, policy changes, economic shifts, maturation effects, selection effects, measurement artifacts) are systematically considered and either ruled out with evidence or acknowledged as competing explanations. The evaluation explicitly states the limits of attribution and where contribution language is more appropriate.
- Score 4: Major alternative explanations are considered and addressed. One or two minor alternatives may be acknowledged briefly without full treatment.
- Score 3: Some alternative explanations are mentioned but assessment is light. Attribution language is used without fully ruling out competing causes.
- Score 2: Alternative explanations are noted in passing or only in a limitations section, OR strong attribution claims are made without considering alternatives.
- Score 1: No alternative explanations considered. Outcomes are attributed directly to the program without justification.

DIMENSION 5: ToC Revision Recommendations
- Score 5: Where evidence warrants, specific recommendations are offered for revising the ToC: links to add, links to drop, links to rewire, assumptions to restate or remove, new outcome pathways to incorporate. Each recommendation is grounded in a finding from the evaluation. A revised ToC diagram or summary is provided. The implications for program design and future MEL are stated.
- Score 4: Several ToC revision recommendations are offered with evidence backing. Revised ToC implied or partially documented.
- Score 3: Some ToC implications are noted but recommendations are general (e.g., "the theory needs updating") without specific changes.
- Score 2: ToC revision discussed in passing without recommendations, OR original ToC reaffirmed without justification despite evidence to the contrary.
- Score 1: No reference to revising the ToC. The theory is treated as fixed regardless of findings.

OUTPUT FORMAT:
Return your assessment as a table followed by a summary:

| Dimension | Score (1-5) | Evidence from Section | Priority Revision |
|-----------|-------------|----------------------|-------------------|
| Causal Pathway Restatement | | | |
| Per-Link Evidence | | | |
| Assumption Verification | | | |
| Alternative Explanations | | | |
| ToC Revision Recommendations | | | |

**Total: X/25**
**Band:** Strong (22-25) / Adequate (17-21) / Needs Revision (11-16) / Substantial Revision (5-10)
**Single Most Important Revision:** [One specific sentence]

For any dimension scored 1 or 2, add a brief explanation and a concrete revision example showing how the section could be rewritten.

THEORY-BASED EVALUATION SECTION TO SCORE:
[Paste your theory-based evaluation findings section here]

Scoring Criteria

Causal Pathway Restatement
5Excellent

Original ToC restated with all causal steps labeled. Diagram or summary present. Reader can see exactly which causal claims are under evaluation. Any pathway revisions flagged.

4Good

ToC restated with most steps labeled. Diagram or summary present. Minor steps may be implied.

3Adequate

ToC mentioned and partially restated. Some steps clear, others inferred. No diagram or only a generic one.

2Needs Improvement

ToC referenced in passing without restating the pathway. Reader cannot tell which causal claims the evidence tests.

1Inadequate

No reference to the original theory of change. Findings presented without a causal frame.

Per-Link Evidence
5Excellent

Evidence organized per causal link. Each link receives a verdict (supported, partially supported, refuted, untestable). Confirming and disconfirming evidence both present.

4Good

Most links have evidence presented separately. Verdicts offered for majority. One or two links merged or treated lightly.

3Adequate

Evidence presented per outcome rather than per link. Some intermediate steps evidenced but distinctions blurred. Verdicts inconsistent.

2Needs Improvement

Evidence presented at outcome level only. Intermediate steps assumed. No per-link verdicts.

1Inadequate

Evidence presented as a general narrative with no mapping to causal links.

Assumption Verification
5Excellent

Every original assumption revisited and verified or refuted with evidence. New assumptions documented and assessed. Disaggregated where assumptions held in some contexts and failed in others.

4Good

Most assumptions revisited and tested. Minor assumptions may be implied. Disaggregation for material assumptions.

3Adequate

Several assumptions revisited but coverage uneven. Some stated but not tested. No disaggregation.

2Needs Improvement

Assumptions listed but not systematically tested, OR only confirmed assumptions discussed.

1Inadequate

No revisitation of original assumptions. Assumptions may not even be stated.

Alternative Explanations
5Excellent

Plausible alternatives systematically considered and ruled in or out with evidence. Limits of attribution stated. Contribution language used where attribution is not warranted.

4Good

Major alternatives considered and addressed. Minor alternatives acknowledged briefly.

3Adequate

Some alternatives mentioned but assessment is light. Attribution language used without ruling out alternatives.

2Needs Improvement

Alternatives noted in passing or only in limitations, OR strong attribution claims made without considering them.

1Inadequate

No alternative explanations considered. Outcomes attributed directly to program without justification.

ToC Revision Recommendations
5Excellent

Specific revision recommendations offered (links to add, drop, rewire; assumptions to restate). Each grounded in a finding. Revised ToC provided. Implications for program design and MEL stated.

4Good

Several revision recommendations offered with evidence backing. Revised ToC implied or partially documented.

3Adequate

Some ToC implications noted but recommendations general (e.g., "needs updating") without specific changes.

2Needs Improvement

ToC revision discussed in passing without recommendations, OR original ToC reaffirmed without justification despite contradictory evidence.

1Inadequate

No reference to revising the ToC. Theory treated as fixed regardless of findings.

Score Interpretation

Total (out of 25)BandNext Step
22-25StrongSection is ready. The ToC has been genuinely tested as a hypothesis.
17-21AdequateAddress flagged dimensions. Most common gap: per-link verdicts or alternative explanations.
11-16Needs RevisionThe theory is referenced but not tested. Restructure evidence by causal link and add assumption verification.
5-10Substantial RevisionThe ToC is decorative rather than evaluated. Rebuild the section from the pathway diagram outward.