Data

As of 2 July 2026, we have processed 2382 original data sets containing a total of 2,458,328 records. The map below shows all locations for which we have at least one observation.


For ease of organization we divide the data into thematic groups. These are not mutually exclusive. For example, the first place to look for crop response to fertilizer data would be in the “agronomy” group. However, the “survey”, and “varieties” groups may also contain fertilizer application data. Likewise, the “varieties” group has data for comparing crop varieties, but variety names are also reported in the “agronomy” group. This means that you may want to consider using data from multiple groups.

The table below shows the current groups and the number of original datasets and records in each group. We also show these numbers for the datasets that have a Creative Commons (CC) license.

Group Datasets Records CC-Datasets CC-Records
agronomy_latest 287 435358 217 284645
pest_disease_latest 8 3225 6 2593
soil_samples_latest 35 24355 11 13333
survey_latest 82 441595 56 347104
varieties_cassava_latest 1467 229179 1467 229173
varieties_cowpea_latest 76 14629 76 14629
varieties_latest 72 196686 68 195710
varieties_maize_latest 79 79527 62 69703
varieties_potato_latest 56 30290 53 29471
varieties_wheat_latest 220 1003484 4 19234


Below, you can download the compiled standardized data that come with a Creative Commons license. You can create the full datasets yourself by following these instructions.

You can download data by group, or, if you want all available data, select “everything”. If you want data for a single data set, you can find these here. You can use R package caramba to integrate the data download into an R workflow.

Please note that for most survey data, we have currently only partially processed the data, and the original data sources may contains many more variables. The data available here are our first attempt to standardize widely variable data with lots of data quality issues. The data still contain errors from the original data that remain, and likely also errors that we have introduced.


Download



.csv (Comma Separated Values)
.xlsx (Excel)