TermMethods

Primary vs Secondary Data

Primary data is collected firsthand for a specific purpose; secondary data is existing data repurposed for new analysis. Each has distinct trade-offs in cost, timeliness, and relevance.

3 min read
Also known as:Primary dataSecondary dataFirsthand dataExisting data

Definition

Primary data is information you collect firsthand for a specific M&E purpose — through surveys, interviews, focus groups, or direct observation. Secondary data is existing data collected by someone else for a different purpose that you repurpose for your analysis.

The distinction matters because each type carries different trade-offs. Primary data gives you precise control over what you measure, how you measure it, and the quality standards applied — but it requires significant time and resources. Secondary data can provide immediate insights at minimal cost, but may not align perfectly with your indicators or context.

Why It Matters

Choosing between primary and secondary data is one of the most important cost-benefit decisions in M&E. Misjudging this choice leads to either wasteful data collection when existing data would suffice, or flawed analysis based on data that doesn't answer your evaluation questions.

Secondary data is particularly valuable for baseline studies (historical data can establish pre-intervention conditions), triangulation (independent sources strengthen validity), and rapid assessments where time or budget constraints rule out full primary collection. However, relying exclusively on secondary data risks measuring the wrong things — donor indicators, administrative records, or published statistics rarely match your programme's specific theory of change.

In Practice

Primary data collection includes:

  • Programme-specific survey design and implementation
  • Key informant interviews with programme beneficiaries and staff
  • Focus group discussions on programme experiences
  • Direct observation of activities and outputs
  • Most significant change stories collected through your monitoring system

Secondary data sources include:

  • Government census and household surveys (DHS, LSMS, EMIS)
  • Administrative records (health facility reports, school attendance logs)
  • Donor databases and sector-wide statistics
  • Academic research and evaluation reports from similar programmes
  • Satellite imagery and remote sensing data
  • Social media and digital platform analytics

Best practice: Use a mixed approach. Secondary data establishes context and benchmarks; primary data captures programme-specific outcomes. For example, a health programme might use national DHS data for baseline health indicators while collecting primary data on programme-specific service quality and patient satisfaction.

Related Topics

Further Reading


Last updated: 2026-02-27