Findings Evidence Strength

AI Prompt Templates

Copy a prompt into Claude, ChatGPT, or Gemini. Paste your document at the bottom and run.

Paste a document and get a scored quality assessment with evidence and revision priorities.

5,603 characters

You are an expert M&E evaluator with experience assessing the evidentiary basis of findings in evaluation, monitoring, and research reports. Score the findings section I will provide using the rubric below. Treat each major finding as a unit and look at the evidence behind it.

SCORING RUBRIC - Findings Evidence Strength
Score each dimension 1-5 using these criteria:

DIMENSION 1: Source Citation
- Score 5: All elements present. Every finding is traceable to specific evidence: a data point with value and source, a quote with role and date, an observation with location and date, or a document reference with page or section. A reader could check any finding against its evidence.
- Score 4: At least 80 percent of findings cite specific evidence. A minority rely on aggregated phrases ("respondents reported that...") without naming the source instrument.
- Score 3: Roughly half of findings cite specific evidence; the rest are general claims with the data source named at the section level only.
- Score 2: Less than half of findings cite specific evidence. Most findings are unattributed claims; the reader cannot trace them.
- Score 1: Findings appear as evaluator assertions with no traceable evidence base.

DIMENSION 2: Triangulation Visibility
- Score 5: All elements present. Major findings show multi-source corroboration: the sources are named (KIIs, FGDs, survey, document review, observation, secondary data), points of agreement are explicit, and points of divergence are addressed. Single-source findings are explicitly labeled.
- Score 4: At least 80 percent of major findings show named triangulation. The remainder rely on a single source with the limitation acknowledged.
- Score 3: Half or more findings show triangulation but sources are referred to generically ("multiple sources confirm") without naming which. Divergence rarely addressed.
- Score 2: Triangulation language is used loosely. Reader cannot tell whether corroboration is real or assumed. Less than half of findings show named triangulation.
- Score 1: No triangulation visible. Findings appear to rely on a single source or on evaluator synthesis without source attribution.

DIMENSION 3: Quantitative-Qualitative Integration
- Score 5: All elements present. Where both quantitative and qualitative data exist for a finding, they are used together within the finding: numbers establish magnitude or pattern; qualitative data explains why or how. Integration is at the finding level, not the chapter level.
- Score 4: At least 80 percent of findings with both data types show integration within the finding. Some integration relegated to a separate paragraph but linked.
- Score 3: Mixed practice. Some findings integrate; others present numbers and quotes side by side without explanation of how they connect.
- Score 2: Quantitative and qualitative findings are reported in separate sections or sub-sections. The reader is left to reconcile them.
- Score 1: Quantitative and qualitative data sit in fully separate chapters. No integration at the finding level.

DIMENSION 4: Inference Strength
- Score 5: All elements present. Language of each finding is calibrated to the strength of the evidence. Strong language ("the program produced," "evidence shows") is used only where evidence is strong (triangulated, multiple data points, consistent direction). Hedged language ("suggests," "appears to," "in the perceptions of") is used where evidence is suggestive or thin. Causal language is reserved for findings with a defensible causal claim.
- Score 4: At least 80 percent of findings have calibrated language. A minority overclaim or underclaim relative to their evidence.
- Score 3: Mixed practice. Strong language sometimes appears where evidence is thin, and hedged language sometimes appears where evidence is strong. Reader must judge inference strength from the data, not the language.
- Score 2: Most findings use uniformly strong language regardless of evidence strength. Causal claims attached to single-source or perception-based findings.
- Score 1: Findings overclaim throughout. Strong assertions made on thin or absent evidence.

DIMENSION 5: Contrary Evidence
- Score 5: All elements present. Where contradictory or disconfirming evidence exists, it is acknowledged and addressed. The finding either accommodates the contrary evidence (revised in light of it) or explains why it is set aside (with reasoning). Counter-cases are named.
- Score 4: Contrary evidence acknowledged for at least 80 percent of findings where it would apply. Some left implicit.
- Score 3: Contrary evidence acknowledged for some findings but inconsistently. Where acknowledged, treatment is brief.
- Score 2: Contrary evidence rarely acknowledged. Findings present a clean narrative that the reader cannot verify against alternative views.
- Score 1: No contrary evidence anywhere. Findings read as a single coherent story without dissent.

OUTPUT FORMAT:

| Dimension | Score (1-5) | Evidence | Priority Revision |
|-----------|-------------|----------|-------------------|
| Source Citation | | | |
| Triangulation Visibility | | | |
| Quantitative-Qualitative Integration | | | |
| Inference Strength | | | |
| Contrary Evidence | | | |

**Total: X/25**
**Band:** Strong (22-25) / Adequate (17-21) / Needs Revision (11-16) / Substantial Revision (5-10)
**Single Most Important Revision:** [One specific sentence]

For any dimension scored 1 or 2, list two or three specific findings that exemplify the problem and propose a rewrite for one of them.

FINDINGS SECTION TO SCORE:
[Paste your findings section here]

Scoring Criteria

Source Citation

5Excellent

All elements present. Every finding traceable to specific evidence (data point, quote, observation, document reference).

4Good

At least 80 percent of findings cite specific evidence. Minority rely on aggregated phrasing.

3Adequate

Half of findings cite specific evidence; rest are general claims with section-level source.

2Needs Improvement

Less than half cite specific evidence. Most findings unattributed.

1Inadequate

Findings are evaluator assertions with no traceable evidence base.

Triangulation Visibility

5Excellent

Major findings show named triangulation. Agreements and divergences explicit. Single-source findings labeled.

4Good

At least 80 percent of major findings show named triangulation. Remainder single-source with acknowledgment.

3Adequate

Half show triangulation but sources referred to generically. Divergence rarely addressed.

2Needs Improvement

Triangulation language loose. Reader cannot tell whether corroboration is real or assumed.

1Inadequate

No triangulation visible. Findings rely on single source or unattributed synthesis.

Quantitative-Qualitative Integration

5Excellent

Where both exist, qual and quant used together within each finding. Integration at finding level.

4Good

At least 80 percent of mixed-data findings show within-finding integration.

3Adequate

Mixed practice. Some integrate; others present side by side without explaining connection.

2Needs Improvement

Quant and qual in separate sections. Reader reconciles.

1Inadequate

Quant and qual in fully separate chapters. No integration.

Inference Strength

5Excellent

All elements present. Language calibrated to evidence strength. Strong, hedged, and causal language used appropriately.

4Good

At least 80 percent of findings calibrated. Minority overclaim or underclaim.

3Adequate

Mixed practice. Strong language sometimes on thin evidence; hedged on strong evidence.

2Needs Improvement

Most findings use uniformly strong language. Causal claims on single-source data.

1Inadequate

Findings overclaim throughout. Strong assertions on thin or absent evidence.

Contrary Evidence

5Excellent

Contradictory evidence acknowledged and addressed. Findings either accommodate or explain why set aside. Counter-cases named.

4Good

Contrary evidence acknowledged for at least 80 percent where it applies. Some implicit.

3Adequate

Contrary evidence acknowledged inconsistently. Brief treatment where present.

2Needs Improvement

Contrary evidence rarely acknowledged. Clean narrative.

1Inadequate

No contrary evidence anywhere. Findings read as a single coherent story.

Score Interpretation

Total (out of 25)	Band	Next Step
22-25	Strong	Findings rest on a defensible evidentiary base. Approve.
17-21	Adequate	Address flagged dimensions before approving. Most likely fixes: name triangulation sources and calibrate language to evidence.
11-16	Needs Revision	Return to the evaluation team with the scorecard as revision brief. Common gap is source citation or contrary evidence.
5-10	Substantial Revision	Evidentiary basis is too thin to support the findings. Rebuild using the Revise prompt or a supplementary data collection round.

Scoring Dimensions

1
Source Citation
Whether each finding is traceable to specific evidence (data point, quote, observation, document reference) rather than presented as an unattributed claim.
2
Triangulation Visibility
Whether multi-source corroboration is shown for major findings, with the sources named and the points of agreement and divergence explicit rather than left to the reader to assume.
3
Quantitative-Qualitative Integration
Whether qualitative and quantitative data are used together within each finding where both exist, rather than reported in separate silos that the reader has to reconcile.
4
Inference Strength
Whether the language of each finding is calibrated to the strength of the evidence behind it, using stronger phrasing only when the evidence is strong and hedging when evidence is suggestive or thin.
5
Contrary Evidence
Whether disconfirming or contradictory evidence is acknowledged where it exists, with an explanation of how the finding accommodates or sets it aside, rather than being omitted to produce a cleaner narrative.

Each dimension scored 1–5. Maximum score: 25.

Prompts Using This Rubric

Review Findings for Evidence Strength

Review the findings section of a report for evidence quality.

Review a Findings Section

Review the findings section of an evaluation, monitoring, or research report for source citation, triangulation visibility, qual-quant integration, inference strength, and contrary evidence.

Back to Rubric Library