Geostatistical indexes for quantifying spatial information on hyperspectral imaging: an application for the evaluation of regression models
Herrero-Langreo A., Gorretta N., Tisseyre B., Gowen A., Jun-Li Xu, Chaix G., Roger J.M.. 2019. In : Livret des résumés des 20èmes Rencontres HélioSPIR. Montpellier : HélioSPIR, p. 20-21. Rencontres HélioSPIR. 20, 2019-10-14/2019-10-15, Montpellier (France).
Hyperspectral (HS) images have the characteristic of containing both spectral and spatial information of a sample. Typically, spectral information can be related with chemical and physical properties through multivariate regression models. The application of these models onto HS images results in prediction maps, which provide an estimation of the modelled chemical information for each pixel of the image. This approach has wide applications in food processing industries for online monitoring of product quality and process control. One of the main difficulties derived from an imaging set up, is that the size of the pixels is usually much smaller than the area required to obtain a wet chemical reference. This means that, as opposed to point spectroscopy, the performance of the estimations cannot be evaluated by directly comparing observed and estimated values for each pixel. Moreover, the selection of regression model parameters, such as the number of latent variables (LV) in a partial least squares (PLS) model, cannot be assessed on a pixel basis either. Nonetheless, compared to point spectroscopy, HS imaging does provide information on the spatial distribution of the predicted values. The objective of this work is to propose a quantitative approach to use spatial information of prediction maps for supporting the evaluation of regression models applied to HS images. This approach is based on the use of geostatistical indexes, which allow decomposing the total variance of the prediction maps into two components: non spatially structured and spatially structured variance, represented respectively by the nugget effect (C0) and the partial sill (C1). This strategy was tested in a simulated dataset and two real case studies. Geostatistical indices of the prediction maps were compared to model performance metrics for PLS models with increasing number of LV. As a result, this work stablishes a connection between linear regression model performance estimates and the spatial
Documents associés
Communication de congrès
Agents Cirad, auteurs de cette publication :
- Chaix Gilles — Bios / UMR AGAP