Can I combine cluster and stratified sampling?

Yes, and most large M&E surveys do. Stratified cluster sampling first divides the population into strata (e.g., regions or urban/rural), then samples clusters (e.g., villages) within each stratum, then samples individuals within clusters. This gives you the cost efficiency of clustering with the representativeness of stratification.

Which approach needs a larger sample size?

Cluster sampling always requires a larger sample than simple random sampling because of the design effect. Stratified sampling can actually reduce the required sample size if the strata are internally homogeneous. In practice, stratified cluster sampling needs 1.5-3x the sample of simple random sampling, depending on the intra-cluster correlation.

Cluster Sampling vs Stratified Sampling

At a Glance

Factor	Cluster Sampling	Stratified Sampling
Primary purpose	Reduce cost and logistics	Ensure subgroup representation
How it works	Sample groups (clusters), survey everyone or subsample within	Divide population into strata, sample independently from each
Cost	Lower (fewer locations to visit)	Higher (must reach every stratum)
Precision	Lower (design effect inflates variance)	Higher (reduces variance if strata are homogeneous)
Sample size needed	Larger (1.5-3x simple random)	Same or smaller than simple random
Requires	List of clusters (villages, schools, facilities)	Known population characteristics for stratification
Best for	Geographically dispersed populations	Surveys needing subgroup comparisons

These two approaches solve different problems. Cluster sampling makes data collection affordable when your population is spread across a large area. Stratified sampling ensures you can say something meaningful about specific subgroups. Most large M&E surveys use both.

Stratified Cluster Sampling: The Common Hybrid

Most large-scale M&E surveys use stratified cluster sampling. This is not an either/or choice. The two approaches combine naturally.

How it works in practice:

Stratify the population by the characteristic you need to compare (region, urban/rural, program/comparison).
Within each stratum, list all clusters (villages, enumeration areas, schools).
Randomly select clusters within each stratum, using probability proportional to size (PPS) so larger clusters have a proportionally higher chance of selection.
Within each selected cluster, randomly select a fixed number of respondents (typically 10-25 households).

Worked example. A baseline survey for an education program covers 4 provinces (strata). Each province has 80-120 schools (clusters). The design selects 15 schools per province (60 total) and surveys 20 students per school (1,200 total). Stratification guarantees provincial-level estimates. Clustering makes it feasible to reach 1,200 students without visiting all 400+ schools.

Sample size calculation for the hybrid. Start with the sample size you would need for simple random sampling. Multiply by the design effect to account for clustering. Then allocate across strata. For proportional allocation, each stratum gets a share proportional to its population. For equal allocation (when you need stratum-level estimates with equal precision), each stratum gets the same number. See How to Choose Sample Size for the full calculation.

Design Effect: The Number That Matters

The design effect (DEFF) is the ratio of the variance under your actual sampling design to the variance under simple random sampling. It is the single most important number in cluster sampling. Ignore it and your confidence intervals are wrong, your significance tests are invalid, and your conclusions are unreliable.

Typical design effects in development evaluations:

Indicator type	Typical ICC	DEFF (20 per cluster)
Vaccination coverage	0.02-0.05	1.4-2.0
Stunting prevalence	0.03-0.08	1.6-2.5
School attendance	0.05-0.15	2.0-3.9
Income/expenditure	0.05-0.10	2.0-2.9
Knowledge/attitudes	0.02-0.06	1.4-2.1

Rule of thumb: If you do not have a local ICC estimate, use 0.05 for health indicators, 0.10 for education and economic indicators, and 0.15 for attitudinal indicators. Always err on the high side.

Common Mistakes

Mistake 1: Ignoring the design effect entirely. This is the most common sampling error in M&E. A sample of 384 households (the classic "infinite population" calculation) spread across 20 clusters does not give you the precision you think it does. If the DEFF is 2.0, your effective sample size is only 192. Your confidence intervals are wider and your statistical tests are weaker than reported. Always calculate the design effect and adjust your sample size accordingly.

Mistake 2: Too few clusters with too many respondents per cluster. Precision in cluster sampling depends more on the number of clusters than on the number of respondents per cluster. Surveying 50 people in 10 villages gives you less precision than surveying 20 people in 25 villages. Once you pass 20-25 respondents per cluster, adding more people in the same cluster adds very little information. Spend your budget on more clusters, not more interviews per cluster.

Mistake 3: Stratifying on too many variables. Every stratum you add subdivides your sample further. Stratify by region (4 strata) and urban/rural (2 strata) and you have 8 strata. Add gender and you have 16. If your total sample is 800, each stratum gets 50 respondents, which is often too few for meaningful analysis. Stratify only on variables where you genuinely need stratum-level estimates or where the difference between strata is large enough to affect your overall estimate.

Mistake 4: Using cluster sampling when you can afford simple random sampling. If your population is concentrated in a small area (one city, one district), the cost savings of clustering are minimal but the precision loss is real. Use simple random or stratified random sampling when logistics allow it.

Mistake 5: Forgetting to account for non-response. Inflate your sample by 10-20% to account for households that are absent, refuse, or are unreachable. In cluster sampling, losing an entire cluster (security incident, road washout) is devastating because you lose all respondents at once. Plan for 1-2 replacement clusters.

Mistake 6: Treating disaggregation as stratification. Disaggregation means breaking down results by subgroup after data collection. Stratification means designing the sample to ensure adequate subgroup representation before data collection. If you need reliable subgroup estimates, stratify. If you just want to report overall results broken down by group, disaggregation of a well-designed sample may be sufficient, but check that subgroup sample sizes are adequate.

Decision Guide

1. Is your population geographically spread out?

Yes, and field logistics are a major cost driver: Use cluster sampling (or stratified cluster sampling).
No, population is concentrated: Use simple random or stratified sampling. Skip clustering.

2. Do you need to compare subgroups?

Yes, with reliable estimates for each: Stratify by those subgroups. Allocate enough sample to each stratum for the analysis you plan.
No, you just need an overall estimate: Stratification is optional but may still improve precision.

3. How many subgroups do you need to compare?

2-4: Stratified sampling works well. Each stratum gets enough sample for meaningful estimates.
5+: Consider which comparisons are most important. You cannot stratify on everything with a finite sample.

4. What is your budget?

Enough for all locations: Simple random or stratified. Maximum precision.
Limited (must reduce travel): Cluster sampling. Accept the precision tradeoff and increase sample size to compensate.

Use the Sampling Calculator to compute sample sizes for any combination of these approaches.

At a Glance

Factor	Cluster Sampling	Stratified Sampling
Primary purpose	Reduce cost and logistics	Ensure subgroup representation
How it works	Sample groups (clusters), survey everyone or subsample within	Divide population into strata, sample independently from each
Cost	Lower (fewer locations to visit)	Higher (must reach every stratum)
Precision	Lower (design effect inflates variance)	Higher (reduces variance if strata are homogeneous)
Sample size needed	Larger (1.5-3x simple random)	Same or smaller than simple random
Requires	List of clusters (villages, schools, facilities)	Known population characteristics for stratification
Best for	Geographically dispersed populations	Surveys needing subgroup comparisons

Stratified Cluster Sampling: The Common Hybrid

Most large-scale M&E surveys use stratified cluster sampling. This is not an either/or choice. The two approaches combine naturally.

How it works in practice:

Stratify the population by the characteristic you need to compare (region, urban/rural, program/comparison).
Within each stratum, list all clusters (villages, enumeration areas, schools).
Randomly select clusters within each stratum, using probability proportional to size (PPS) so larger clusters have a proportionally higher chance of selection.
Within each selected cluster, randomly select a fixed number of respondents (typically 10-25 households).

Design Effect: The Number That Matters

Typical design effects in development evaluations:

Indicator type	Typical ICC	DEFF (20 per cluster)
Vaccination coverage	0.02-0.05	1.4-2.0
Stunting prevalence	0.03-0.08	1.6-2.5
School attendance	0.05-0.15	2.0-3.9
Income/expenditure	0.05-0.10	2.0-2.9
Knowledge/attitudes	0.02-0.06	1.4-2.1

Common Mistakes

Decision Guide

1. Is your population geographically spread out?

Yes, and field logistics are a major cost driver: Use cluster sampling (or stratified cluster sampling).
No, population is concentrated: Use simple random or stratified sampling. Skip clustering.

2. Do you need to compare subgroups?

Yes, with reliable estimates for each: Stratify by those subgroups. Allocate enough sample to each stratum for the analysis you plan.
No, you just need an overall estimate: Stratification is optional but may still improve precision.

3. How many subgroups do you need to compare?

2-4: Stratified sampling works well. Each stratum gets enough sample for meaningful estimates.
5+: Consider which comparisons are most important. You cannot stratify on everything with a finite sample.

4. What is your budget?

Enough for all locations: Simple random or stratified. Maximum precision.
Limited (must reduce travel): Cluster sampling. Accept the precision tradeoff and increase sample size to compensate.

Use the Sampling Calculator to compute sample sizes for any combination of these approaches.

Cluster Sampling vs Stratified Sampling

At a Glance

When Cluster Sampling Saves Money

When Stratified Sampling Ensures Representation

Stratified Cluster Sampling: The Common Hybrid

Design Effect: The Number That Matters

Common Mistakes

Decision Guide

Frequently Asked Questions

Cluster Sampling vs Stratified Sampling

At a Glance

When Cluster Sampling Saves Money

When Stratified Sampling Ensures Representation

Stratified Cluster Sampling: The Common Hybrid

Design Effect: The Number That Matters

Common Mistakes

Decision Guide

Frequently Asked Questions