Primary vs Secondary Data

Library

Contribution Analysis Developmental Evaluation Impact Evaluation Logframe / Logical Framework Most Significant Change Outcome Harvesting Outcome Mapping Participatory Evaluation Process Tracing Quasi-Experimental Design Realist Evaluation Results Framework Results-Based Management Theory of Change Utilization-Focused Evaluation

Definition

Primary data is information you collect firsthand for a specific M&E purpose, through surveys, interviews, focus groups, or direct observation. Secondary data is existing data collected by someone else for a different purpose that you repurpose for your analysis.

The distinction matters because each type carries different trade-offs. Primary data gives you precise control over what you measure, how you measure it, and the quality standards applied, but it requires significant time and resources. Secondary data can provide immediate insights at minimal cost, but may not align perfectly with your indicators or context.

Why It Matters

Choosing between primary and secondary data is one of the most important cost-benefit decisions in M&E. Misjudging this choice leads to either wasteful data collection when existing data would suffice, or flawed analysis based on data that doesn't answer your evaluation questions.

Secondary data is particularly valuable for baseline studies (historical data can establish pre-intervention conditions), triangulation (independent sources strengthen validity), and rapid assessments where time or budget constraints rule out full primary collection. However, relying exclusively on secondary data risks measuring the wrong things, donor indicators, administrative records, or published statistics rarely match your programme's specific theory of change.

In Practice

Primary data collection includes:

Programme-specific survey design and implementation
Key informant interviews with programme beneficiaries and staff
Focus group discussions on programme experiences
Direct observation of activities and outputs
Most significant change stories collected through your monitoring system

Secondary data sources include:

Government census and household surveys (DHS, LSMS, EMIS)
Administrative records (health facility reports, school attendance logs)
Donor databases and sector-wide statistics
Academic research and evaluation reports from similar programmes
Satellite imagery and remote sensing data
Social media and digital platform analytics

Best practice: Use a mixed approach. Secondary data establishes context and benchmarks; primary data captures programme-specific outcomes. For example, a health programme might use national DHS data for baseline health indicators while collecting primary data on programme-specific service quality and patient satisfaction.

Definition

Why It Matters

In Practice

Related Topics

Further Reading

Primary vs Secondary Data

Definition

Why It Matters

In Practice

Related Topics

Further Reading