Publications des agents du Cirad

Cirad

Multivariate regional deep learning prediction of soil properties from near-infrared, mid-infrared and their combined spectra

Nyawasha R., Wadoux A.M.J.C., Todoroff P., Chikowo R., Falconnier G., Lagorsse M., Corbeels M., Cardinael R.. 2024. Geoderma Regional, 37 : 11 p..

DOI: 10.1016/j.geodrs.2024.e00805

Artificial neural network (ANN) models have been successfully used in infrared spectroscopy research for the prediction of soil properties. They often show better performance than conventional methods such as partial least squares regression (PLSR). In this paper we develop and evaluate a multivariate extension of ANN for predicting correlated soil properties: total carbon (C), total nitrogen (N), clay, silt, and sand contents, using visible near-infrared (vis-NIR), mid-infrared (MIR) or combined spectra (vis-NIR + MIR). We hypothesize that accounting for the correlation through joint modelling of soil properties with a single model can eliminate “pedological chimera”: unrealistic values that may arise when properties are predicted independently such as when calculating ratio or soil texture values. We tested two types of ANN models, a univariate (ANN-UV) and a multivariate model (ANN-MV), using a dataset of 228 soil samples collected from Murehwa district in Zimbabwe at two soil depth intervals (0–20 and 20–40 cm). The models were compared with results from a univariate PLSR (PLSR-UV) model. We found that the multivariate ANN model was better at conserving the observed correlations between properties and consequently gave realistic soil C:N and C:Clay ratios, but that there was no improvement in prediction accuracy over using a univariate model (ANN or PLSR). The use of combined spectra (vis-NIR + MIR) did not make any significant improvements in prediction accuracy of the multivariate ANN model compared to using the vis-NIR or MIR only. We conclude that the multivariate ANN model is better suited for the prediction of multiple correlated soil properties and that it is flexible and can account for compositional constrains. The multivariate ANN model helps to keep realistic ratio values – with strong implications for assessment studies that make use of such predicted soil values.

Mots-clés : réseau de neurones; spectroscopie; caractéristiques du sol; apprentissage machine; technique de prévision; conservation des sols; sol argileux; zimbabwe; la réunion; france

Documents associés

Article (a-revue à facteur d'impact)

Agents Cirad, auteurs de cette publication :