How to combine spatio-temporal information and danger theory for animal disease surveillance?
Boudoua B., Richard M., Roche M., Teisseire M., Tran A.. 2023. In : Conte Annamaria (ed.), Ippoliti Carla (ed.), Savini Lara (ed.). Abstract book of the GeoVet 2023 International Conference. Expanding boundaries: interdisciplinary Geospatial Research for the One Health Era, held in Silvi Marina, Teramo, Italy (September 19th-21st 2023). Teramo : Edizioni IZSTe-press, p. 8-9. International Conference of Spatial Epidemiology, Geostatistics and GIS applied to animal health, public health and food safety (GeoVet 2023), 2023-09-19/2023-09-21, Silvi Marina (Italie).
Event Based Surveillance (EBS) systems such as HealthMap, Promed and PADI-web are used daily to detect outbreak events reported in web articles. Once the articles are collected, these systems rely on human moderation, and implement supervised classification algorithms to classify articles according to their relevance (Valentin et al., 2020). Applying such supervised methods can be challenging, as epidemiological datasets have an imbalanced class distribution, and because the annotation task, which is critical to the success of these models, can be expensive and time consuming. Another important limitation of EBS systems is that the drivers of disease transmission (e.g. disease characteristics, environmental and epidemiological risk factors) are not always found in textual data and are therefore not taken into account by EBS systems (Kim & Ahn, 2021). In this context, we propose an unsupervised approach that relies on the spatio-temporal information of the reported epidemiological events, to classify articles while taking into account the environmental factors associated with disease onset through risk mapping. This method, called EpiDCA, is an adaptation of the Dendritic Cells Algorithm (DCA), inspired by the danger theory (Greensmith et al., 2008). EpiDCA is characterized by expert-defined parameters, making it applicable to different diseases and environmental contexts. The proposed method was first tested and evaluated using PADI- web and HealthMap datasets related to avian influenza (AI) in Asia between 2018 and 2019, and a suitability map for AI produced for the same area. To measure the accuracy of the model, we calculated the precision, recall and F-score. EpiDCA achieved a very good performance with an F-score of 0.70 and 0.90 for an imbalanced and a balanced dataset respectively. The results confirmed that considering disease risk factors is a good approach in event classification. EpiDCA was then compared with state-of-the-art supervised machine learning m
Documents associés
Communication de congrès
Agents Cirad, auteurs de cette publication :
- Roche Mathieu — Es / UMR TETIS
- Tran Annelise — Es / UMR TETIS