Cereal Yield Forecasting with Satellite Drought-Based Indices, Weather Data and Regional Climate Indices Using Machine Learning in Morocco

International audience Accurate seasonal forecasting of cereal yields is an important decision support tool for countries, such as Morocco, that are not self-sufficient in order to predict, as early as possible, importation needs. This study aims to develop an early forecasting model of cereal yield...

Full description

Bibliographic Details
Published in:Remote Sensing
Main Authors: Bouras, El Houssaine, Jarlan, Lionel, Er-Raki, Salah, Balaghi, Riad, Amazirh, Abdelhakim, Richard, Bastien, Khabba, Saïd
Other Authors: Université Cadi Ayyad Marrakech (UCA), Centre d'études spatiales de la biosphère (CESBIO), Institut de Recherche pour le Développement (IRD)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université de Toulouse (UT)-Institut national des sciences de l'Univers (INSU - CNRS)-Observatoire Midi-Pyrénées (OMP), Université de Toulouse (UT)-Université de Toulouse (UT)-Institut national des sciences de l'Univers (INSU - CNRS)-Centre National d'Études Spatiales Toulouse (CNES)-Centre National de la Recherche Scientifique (CNRS)-Météo-France-Institut national des sciences de l'Univers (INSU - CNRS)-Centre National d'Études Spatiales Toulouse (CNES)-Centre National de la Recherche Scientifique (CNRS)-Météo-France-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE), Institut national de la recherche agronomique Maroc (INRA Maroc), Université Mohammed VI Polytechnique Ben Guerir (UM6P), Gestion de l'Eau, Acteurs, Usages (UMR G-EAU), Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad)-Institut de Recherche pour le Développement (IRD)-AgroParisTech-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE)-Institut Agro - Montpellier SupAgro, Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro)-Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro), ARTS program from IRD, France, H2020 PRIMA ALTOS project, MISTRALS/SICMED2, PHC Toubkal - 39064WG/2018, PRIMA-IDEWA project, CHAAMS project - ERANETMED03-62, ACCWA project - 823965, SAGESSE - PPR/2015/48
Format: Article in Journal/Newspaper
Language:English
Published: HAL CCSD 2021
Subjects:
Online Access:https://hal.inrae.fr/hal-03326563
https://hal.inrae.fr/hal-03326563/document
https://hal.inrae.fr/hal-03326563/file/Bouras_2021_RS.pdf
https://doi.org/10.3390/rs13163101
Description
Summary:International audience Accurate seasonal forecasting of cereal yields is an important decision support tool for countries, such as Morocco, that are not self-sufficient in order to predict, as early as possible, importation needs. This study aims to develop an early forecasting model of cereal yields (soft wheat, barley and durum wheat) at the scale of the agricultural province considering the 15 most productive over 2000–2017 (i.e., 15 × 18 = 270 yields values). To this objective, we built on previous works that showed a tight linkage between cereal yields and various datasets including weather data (rainfall and air temperature), regional climate indices (North Atlantic Oscillation in particular), and drought indices derived from satellite observations in different wavelengths. The combination of the latter three data sets is assessed to predict cereal yields using linear (Multiple Linear Regression, MLR) and non-linear (Support Vector Machine, SVM; Random Forest, RF, and eXtreme Gradient Boost, XGBoost) machine learning algorithms. The calibration of the algorithmic parameters of the different approaches are carried out using a 5-fold cross validation technique and a leave-one-out method is implemented for model validation. The statistical metrics of the models are first analyzed as a function of the input datasets that are used, and as a function of the lead times, from 4 months to 2 months before harvest. The results show that combining data from multiple sources outperformed models based on one dataset only. In addition, the satellite drought indices are a major source of information for cereal prediction when the forecasting is carried out close to harvest (2 months before), while weather data and, to a lesser extent, climate indices, are key variables for earlier predictions. The best models can accurately predict yield in January (4 months before harvest) with an R2 = 0.88 and RMSE around 0.22 t. ha−1. The XGBoost method exhibited the best metrics. Finally, training a specific model separately for ...