Copiez un prompt dans Claude, ChatGPT ou Gemini. Collez votre document en bas et exécutez.
Collez un document pour obtenir une évaluation de qualité notée, avec preuves et priorités de révision.
5,906 caractères
You are an expert M&E advisor specializing in learning and adaptive management. Score the learning agenda I will provide using the rubric below.
SCORING RUBRIC - MEL Learning Agenda Quality
Score each dimension 1-5 using these criteria:
DIMENSION 1: Strategic Relevance
- Score 5: Every question addresses a genuine uncertainty in the program theory (an untested assumption, a contested mechanism, or a context-specific unknown). Questions reflect input from program staff, partners, and intended users. None of the questions are already answered by existing evidence or are peripheral to program decisions.
- Score 4: No fewer than 80 percent of questions address genuine uncertainties tied to program theory. One or two questions may be lower-priority but remain plausibly useful. Stakeholder input is visible in question framing.
- Score 3: Half or more questions address genuine uncertainties. The remainder are interesting but not tied to a specific program assumption or decision. Stakeholder input partial or implied.
- Score 2: Fewer than half of questions address genuine uncertainties. Several questions are already answered by existing evidence, OR several are peripheral (descriptive curiosities rather than learning questions).
- Score 1: Questions are generic ("what works?") or unrelated to the program theory. No connection to documented assumptions or program decisions.
DIMENSION 2: Answerability
- Score 5: Every question is phrased so it can be answered with data the program will collect or can reasonably gather within budget and timeline. The data type, source, and method that will answer the question are implied by the question's wording (e.g., "how does X vary across rural and peri-urban sites" implies disaggregated quantitative data).
- Score 4: No fewer than 80 percent of questions are answerable with available or readily collectable data. The remainder may require additional data collection that is feasible but not yet planned.
- Score 3: Half or more questions are answerable with available data. The remainder are too broad or abstract to be answered without significant additional methods design.
- Score 2: Fewer than half are answerable as phrased. Several questions are unfalsifiable ("is the program transformative?") or require methods well beyond the program's resources.
- Score 1: Questions are unanswerable as phrased. Wording is so abstract that no data source or method could resolve them.
DIMENSION 3: Decision Linkage
- Score 5: Every question names a specific decision, adaptation, or audience that depends on the answer. The decision-maker or user is identified for each question. The link between answer and action is explicit (e.g., "if X, then we will adjust Y").
- Score 4: No fewer than 80 percent of questions name a decision or audience. The remainder are clearly relevant but the specific decision is not stated.
- Score 3: Half or more questions name a decision or audience. The remainder reference learning in general terms without naming who will act on the answer.
- Score 2: Fewer than half of questions name a decision or audience. Several questions are presented as learning for its own sake, with no link to program adaptation or reporting.
- Score 1: No decision linkage. Questions are listed without any reference to who will use the answers or what they will do differently.
DIMENSION 4: Scope and Prioritization
- Score 5: The set is right-sized for the program (typically 3 to 8 questions for a multi-year program). Priority questions are explicitly distinguished from secondary ones. The prioritization rationale is stated (e.g., highest-uncertainty, most decision-relevant).
- Score 4: Set size is appropriate (3 to 10 questions). Priority distinction made, though rationale for prioritization is brief or implicit.
- Score 3: Set size is plausible but no priority distinction made, OR set size is on the high end (8 to 12) with weak prioritization. All questions presented as equally important.
- Score 2: Set is too large (more than 12 questions) and unprioritized, OR set is a single question that cannot represent the program's learning needs.
- Score 1: No structured set. Either no questions are presented as a learning agenda, or the list is so long it functions as a research wish list with no priorities.
DIMENSION 5: Operationalization
- Score 5: For every question: named responsible position, stated timing (when the question will be addressed), synthesis method (how data will be analyzed and discussed), and audience for the answer. A learning event or product is scheduled per question (e.g., quarterly reflection, annual learning brief).
- Score 4: For no fewer than 80 percent of questions: responsibility, timing, and synthesis method named. Audience may be implied rather than stated.
- Score 3: Operational details given for half or more questions. The remainder are listed without responsibility, timing, or method. No learning event or product scheduled.
- Score 2: Operational details given for fewer than half of questions. Most appear as a list with no execution plan.
- Score 1: No operational detail at all. Questions are stated but nothing is said about who answers them, when, or how.
OUTPUT FORMAT:
Return your assessment as a table followed by a summary:
| Dimension | Score (1-5) | Evidence | Priority Revision |
|-----------|-------------|----------|-------------------|
| Strategic Relevance | | | |
| Answerability | | | |
| Decision Linkage | | | |
| Scope and Prioritization | | | |
| Operationalization | | | |
**Total: X/25**
**Band:** Strong (22-25) / Adequate (17-21) / Needs Revision (11-16) / Substantial Revision (5-10)
**Single Most Important Revision:** [One specific sentence]
For any dimension scored 1 or 2, add a brief explanation and a concrete revision example.
LEARNING AGENDA TO SCORE:
[Paste your Learning agenda or learning questions section of a MEL plan here]
Scoring Criteria
Strategic Relevance
5Excellent
Every question addresses a genuine uncertainty in the program theory. Stakeholder input visible. None already answered by existing evidence or peripheral.
4Good
No fewer than 80 percent address genuine uncertainties. One or two lower-priority but plausibly useful. Stakeholder input visible.
3Adequate
Half or more address genuine uncertainties. The remainder interesting but untethered to specific assumptions. Stakeholder input partial.
2Needs Improvement
Fewer than half address genuine uncertainties. Several already answered or peripheral.
1Inadequate
Questions generic ("what works?") or unrelated to program theory. No connection to assumptions or decisions.
Answerability
5Excellent
Every question phrased so it can be answered with available or readily collectable data. Data type, source, and method implied by wording.
4Good
No fewer than 80 percent answerable with available or feasible data. The remainder need feasible but unplanned collection.
3Adequate
Half or more answerable. The remainder too broad or abstract without significant methods design.
2Needs Improvement
Fewer than half answerable. Several unfalsifiable or require methods beyond program resources.
1Inadequate
Unanswerable as phrased. No data source or method could resolve them.
Decision Linkage
5Excellent
Every question names a specific decision, adaptation, or audience. Decision-maker identified. Link between answer and action explicit.
4Good
No fewer than 80 percent name a decision or audience. The remainder relevant but decision unstated.
3Adequate
Half or more name a decision or audience. The remainder reference learning in general terms.
2Needs Improvement
Fewer than half name a decision or audience. Several presented as learning for its own sake.
1Inadequate
No decision linkage. Questions listed without reference to use or action.
Scope and Prioritization
5Excellent
Right-sized (3-8 questions). Priority distinction explicit with stated rationale.
4Good
Set size appropriate (3-10). Priority distinction made, rationale brief or implicit.
3Adequate
Set size plausible but no priority distinction, OR set on high end (8-12) with weak prioritization.
2Needs Improvement
Set too large (12+) and unprioritized, OR single question covering everything.
1Inadequate
No structured set. Either no agenda or a research wish list.
Operationalization
5Excellent
For every question: responsible position, timing, synthesis method, audience, scheduled learning event or product.
4Good
No fewer than 80 percent: responsibility, timing, synthesis method named. Audience may be implied.
3Adequate
Half or more: operational details given. The remainder listed without execution plan.
2Needs Improvement
Fewer than half have operational detail. Most appear as a list with no execution plan.
1Inadequate
No operational detail. Questions stated but nothing about who, when, or how.
Score Interpretation
Total Score
Band
Next Step
22-25
Strong
Minor refinements only
17-21
Adequate
Address flagged dimensions before submission
11-16
Needs Revision
Return to MEL team with AI output as revision brief
5-10
Substantial Revision
Redesign the learning agenda with stakeholder input before proceeding