A data collection budget covers the actual cost of collecting M&E data: enumerators, transport, instruments, devices, training, supervision, and quality control. For programs with primary data collection, it typically runs 30-50% of the total M&E budget, and more for survey-heavy programs.
What the Budget Covers
A credible data collection budget itemizes the following lines, not a single lump sum:
- Enumerator fees: day rate times days deployed times number of enumerators. The most common hidden driver of cost.
- Per diem, transport, and accommodation for field teams moving between sites.
- Device or paper instrument costs: tablets or phones if digital, printing if paper. Digital usually wins on round two onward.
- Pretesting and piloting: a real line item, not an afterthought. Plan for one to two pilot days per instrument.
- Supervisor time in field: a field supervisor for every five to eight enumerators, with their own per diem and transport.
- Enumerator training days: typically three to five days per round, paid.
- Instrument translation into local languages, including back-translation for quality.
- Platform subscription: Kobo, SurveyCTO, CommCare, or equivalent. Annual or per-submission pricing.
- Data cleaning and quality checks: a line, not absorbed into staff time.
Budgeting Per Round
Estimate on a per-round basis: sample size times enumeration cost per interview, plus overhead. Multi-round studies scale by number of rounds (baseline plus midline plus endline equals three rounds) minus shared setup costs. Instrument development, translation, and platform build are done once. Training is partially reused, though refresher training is needed each round. Fieldwork is paid fresh every round.
A useful shortcut: cost out one complete round end to end, then multiply by rounds and subtract shared setup. If you cannot cost a single round cleanly, the budget is not ready.
Typical Scale
For a mid-size program around $5M over three years, data collection costs typically land in the $150,000 to $400,000 range depending on survey intensity. Programs with quarterly panel surveys run higher. Programs with annual small assessments or secondary data reliance run lower. A program doing a single household survey at baseline and endline will sit at the low end. A program running quarterly facility surveys across 40 sites will sit at the high end.
Proposal Context
The data collection budget is where proposal reviewers check whether the MEL plan is actually executable. Common red flags: (a) proposing baseline, midline, and endline surveys with n=2,000 but budgeting $30,000 total, which is not enough for even one round; (b) no enumerator training line item; (c) no supervision cost for field teams; (d) no instrument translation budget in multilingual contexts. The budget should itemize per round and be defensible against the indicator set. If the MEL plan requires a 40-village household survey quarterly, the budget must cover four rounds of 40-village deployment. Weak data collection budgets are one of the top reasons MEL plans fail at execution: the plan looks good, the budget does not match, and fieldwork gets cut mid-program.
Common Mistakes
Underbudgeting by costing a single round when multi-round is needed. Teams price one baseline and forget that midline and endline cost almost the same, minus one-time setup. A three-round design is not 1.2x a single round. It is closer to 2.3x to 2.6x.
Collapsing the data collection budget into the M&E staffing budget. Enumeration and supervision are activity costs, not staff costs. Keep them separate so reviewers and program managers can see what fieldwork actually costs against what the M&E team costs.
Related Topics
- MEL Plans: The plan the data collection budget must resource
- M&E Staffing Plan: The sibling budget line covering MEL team cost
- Data Quality Assurance: The verification activities that sit inside the collection budget
- Baseline: The first and often most expensive collection round