Publications des agents du Cirad

Cirad

Identifying associations between epidemiological entities in news data for animal disease surveillance

Valentin S., Lancelot R., Roche M.. 2021. Artificial Intelligence in Agriculture, 5 : p. 163-174.

DOI: 10.1016/j.aiia.2021.07.003

Event-based surveillance systems are at the crossroads of human and animal (and plant and ecosystem) health, epidemiology, statistics, and informatics. Thus, their deployment faces many challenges specific to each domain and their intersections, such as relations among automation, artificial intelligence, and expertise. In this context, our work pertins to the extraction of epidemiological events in textual data (i.e. news) by unsupervised methods. We define the event extraction task as detecting pairs of epidemiological entities (e.g. a disease name and location). The quality of the ranked lists of pairs was evaluated using specific ranking evaluation metrics. We used a publicly available annotated corpus of 438 documents (i.e. news articles) related to animal disease events. The statistical approach was able to detect event-related pairs of epidemiological features with a good trade-off between precision and recall. Our results showed that using a window of words outperformed document-based and sentence-based approaches, while reducing the probability of detecting false pairs. Our results indicated that Mutual Information was less adapted than the Dice coefficient for ranking pairs of features in the event extraction framework. We believe that Mutual Information would be more relevant for rare pair detection (i.e. weak signals), but requires higher manual curation to avoid false positive extraction pairs. Moreover, generalising the country-level spatial features enabled better discrimination (i.e. ranking) of relevant disease-location pairs for event extraction.

Mots-clés : épidémiologie; surveillance épidémiologique; fouille de textes; maladie des animaux; santé animale; analyse de données; données spatiales; one health; données textuelles

Documents associés

Article (b-revue à comité de lecture)

Agents Cirad, auteurs de cette publication :