Definition
A census measures every single unit in your target population: every beneficiary, household, or facility. A sample selects a subset of units using a defined method, then uses statistical techniques to infer characteristics about the whole population. The choice between them is fundamental to survey design and directly affects your data collection burden, cost, and the precision of your findings.
A census eliminates sampling error entirely but introduces other challenges: it is expensive, time-consuming, and often impractical for large populations. A sample is far more efficient but carries sampling uncertainty that must be quantified and managed through proper sampling methods.
Key Differences
| Census | Sample | |
|---|---|---|
| Coverage | Every unit in the population | Representative subset |
| Sampling error | None | Present (but quantifiable) |
| Cost | High | Lower |
| Time required | Longer | Shorter |
| Best for | Small, accessible populations | Large populations |
| Precision | Exact | Estimated with confidence interval |
| Generalisation | No inference needed | Requires probability sampling |
Why It Matters
This decision determines your entire baseline design and constrains what you can claim about program impact. If you measure everyone, you know your population's characteristics exactly - but you may never know whether changes would have occurred without your intervention. If you sample, you can construct a comparison group and estimate causal effects, but you must accept statistical uncertainty.
The choice also affects cost-effectiveness. A census of 10,000 households may cost 10x more than a well-designed sample of 1,000, yet provide only marginally better precision for most program-level indicators. Understanding this tradeoff is essential for designing M&E systems that deliver useful evidence without wasting resources.
In Practice
Use a census when:
- Your population is small and accessible (e.g., 200 students in one school district, 50 facilities in a region)
- You need exact counts for administrative purposes (e.g., beneficiary registration, resource allocation)
- Sampling frames are unreliable or non-existent, making probability sampling impossible
- The stakes are extremely high and sampling uncertainty is unacceptable (e.g., post-disaster needs assessment for limited resources)
Use a sample when:
- Your population is large (typically 1,000+ units)
- You need to generalise findings beyond your measured units
- Budget or time constraints make a census infeasible
- You're conducting an impact evaluation requiring a comparison group
- You can construct a reliable sampling frame (complete list of population members)
Common approaches:
- Complete enumeration for small programs (e.g., all 150 graduates of a scholarship program)
- Probability sampling (simple random, stratified, cluster) for representative surveys
- Mixed approach: census for administrative data, sample for outcome measurement
- LQAS (Lot Quality Assurance Sampling) for rapid classification of small areas
The key is matching your design to your program's scale, your evaluation questions, and your resources. A poorly executed census (with high non-response) often yields worse data than a well-executed sample.
Worked Example: Two Program Contexts
Context A: Scholarship program, 500 graduates across 3 cities
A census is feasible here. The population is bounded and known (the scholarship office has a complete list), geographically concentrated, and small enough that the cost of reaching everyone is manageable. A 500-unit survey typically costs $5,000 to $15,000 depending on mode (phone, in-person, online). Even with a 70% response rate, you recover 350 responses and can report exact figures for the whole cohort. Sampling here would introduce uncertainty for no meaningful saving.
Context B: Livelihoods program, 48,000 beneficiaries across 12 districts
A sample is required. A census would cost roughly $240,000 at $5 per household for enumerator time, which is usually more than the entire evaluation budget. A stratified random sample of 800 households, properly designed, produces estimates with a margin of error around plus or minus 3.5 percentage points at 95 percent confidence. The sample costs about $6,000 to $10,000 to field. For most program-level indicators (participation rates, satisfaction, outcome achievement), the sample gives decisions of the same quality as the census at a fraction of the cost.
What changes the answer?
- If the population is highly heterogeneous (different sub-groups with different outcomes), stratified sampling is essential and sample size grows.
- If you need disaggregated estimates for every district, you may need either a larger sample or a small census per district.
- If a complete list of the population is missing or unreliable, probability sampling is not possible and a de facto census of those you can reach becomes the fallback.
Common Mistakes to Avoid
1. Ignoring non-response in a census. A census with a 50 percent response rate is not a census of your population. It is an unrepresentative sample of the half who chose to respond. Non-response bias in "census" data is often worse than sampling error in a well-designed probability sample.
2. Confusing a complete enumeration with a purposive sample. Visiting every site you can reach is not a census if your sampling frame was incomplete to begin with. The distinction matters for what you can claim.
3. Defaulting to a census for small populations without checking cost. A small population is often a signal to census, but not always. If the 200 units are spread across 8 countries and require travel, a sample of 60 may be far more efficient and still answer the question.
4. Sampling when exact counts are required. Beneficiary registration, resource distribution, and financial reporting need exact counts, not estimates. Sampling is inappropriate for these administrative functions even when the population is large.
5. Reporting sample estimates as if they were census counts. A sample gives you an estimate with uncertainty. Reports should always include confidence intervals or margins of error, not point estimates presented as facts.
Frequently Asked Questions
What is census sampling?
"Census sampling" is a slightly ambiguous term. Strictly, a census is not a sample: it measures every unit. The phrase is sometimes used loosely to mean "complete enumeration within a defined sub-group," such as censusing all facilities in one district while sampling households within each. When you see the term, check what the author actually did.
When is a census infeasible?
A census becomes infeasible when (a) the population is too large for your budget, (b) the population is geographically dispersed and access costs dominate, (c) you cannot construct a complete list of the population, or (d) the timeframe for the evaluation is too short to reach everyone. In these cases, a probability sample is the practical choice.
Can you mix census and sample approaches?
Yes, and mixed designs are common. A typical M&E system might census all implementing facilities (for administrative data and quality assurance) while sampling beneficiaries within facilities (for outcome measurement). This captures the precision advantages of census coverage at the structural level while keeping beneficiary measurement costs manageable.
Is a census always more accurate than a sample?
No. A well-designed probability sample with a high response rate often produces more reliable estimates than a census with substantial non-response. Accuracy depends on data quality, not coverage alone. A 100 percent sampling frame with a 40 percent response rate is worse than a 10 percent sample with an 85 percent response rate.
What is the difference between a census and a sample survey in statistics?
In statistics, a census is a complete count of a population; a sample survey collects data from a subset and uses statistical inference to draw conclusions about the whole. The sample introduces sampling error (quantifiable through confidence intervals), while a census eliminates sampling error but may introduce non-sampling errors like non-response, measurement error, and coverage gaps.
Related Topics
- Sampling Methods: detailed approaches for selecting representative subsets
- Baseline Design: establishing measurement timing and comparison groups
- Survey Design: constructing instruments and protocols
- Data Quality Assurance: ensuring measurement reliability
- Cost-Effectiveness Analysis: evaluating resource tradeoffs