Publications des agents du Cirad


Disease outbreak documents as a source of queries for detection of signals of disease emergence on the Internet. [391]

Arsevska E., Lefrançois T., Lancelot R., Roche M., Chavernac D., Falala S., Hendrikx P., Dufour B.. 2015. In : 14th Conference of the International Society for Veterinary Epidemiology and Economics: planning our future. Mérida : ISVEE, 1 p.. ISVEE : Veterinary epidemiology and economics: Planning our future. 14, 2015-11-03/2015-11-07, Mérida (Mexique).

Timeliness and precision in detecting exotic animal infectious disease outbreaks is crucial for preventing their spread. In 2013, the French national platform for animal disease surveillance has set up an international epidemiological intelligence team (so-called VSI team) aiming at detecting, verifying and monitoring signals of disease emergence from different sources of information, including the Internet. We propose an innovative method for monitoring disease emergence on the Internet. It is based on 3sequential steps:1) web crawling,2) automatic classification of disease outbreak documents by machine learning approaches,3) extraction of information from documents(e.g., disease, number of cases, location, etc.).To query the web, the choice of relevant terms is crucial. For this purpose, we used text mining together with a collective domain expertise following a Delphi method. This approach allowed highlighting the relevant terms to detect signals of disease emergence on the Internet. We have applied it to detect documents addressing African swine fever (ASF) outbreaks(i.e. 123 dispatches from Google, and 45 from PubMed) written in English language, obtained for the period 2011-2014 with the baseline query ¿African swine fever outbreak¿. Based on 2400 terms extracted with the text-mining approach, our automatic search system associated with the collective domain expertise (i.e. evaluation of 20 groups of terms by 21 specialists) identified 3 groups of highly specific terms to detect signals of ASF emergence:1) haemorrhagic fever in Suidae, 2) mortality in Suidae and 3) swine fever. Implemented as complex queries, these groups of terms allowed finding previously undetected ASF outbreak articles with the baseline query (period 2011-14):3for each of groups 1 and 2, vs.54 for group 3.Monitoring disease emergence on the Internet is a promising method towards improved disease introduction risk assessment. Nevertheless, domain experts still play a central role. Our method is generic: we intend to evaluate it on data from other exotic infectious diseases and with real-time data stream. Should this evaluation be successful, the method might be routinely used by the VSI team. (Texte intégral)

Documents associés

Communication de congrès

Agents Cirad, auteurs de cette publication :