This dataset supports the replication of results presented in "Estimating the Direct and Indirect Effects of Improved Seed Adoption on Yields: Evidence from DNA-Fingerprinting, Crop Cuts, and Self-Reporting in Ethiopia" (2025) by Nina Jovanovic and Jacob Ricker-Gilbert, published in the Journal of Development Economics. The data originates from the 2018/19 Ethiopia Socio-economic Survey conducted by the Central Statistical Agency of Ethiopia.
The dataset combines DNA-fingerprinting of maize seeds, GPS-based plot size measurements, and crop cut data, facilitating robust comparisons with farmers’ self-reported data on seed varieties, land area, and harvested quantities.
An accompanying do-file contains the Stata code required to replicate the analysis presented in the paper. The dataset has been cleaned and transformed for analytical purposes, including:
Missing Data Management: Mean imputation for continuous variables.
Outlier Treatment: Winsorized crop yield data.
This comprehensive dataset and code package enable researchers to validate and build upon the study's findings. Find more>>