Publications des agents du Cirad

Cirad

How to exploit paralinguistic features to identify acronyms in texts?

Roche M.. 2014. In : Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, B. Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC'14): ISO Workshop on interopable semantic annotation, 2014, may 26-31, Reykjavik, Iceland. Paris : ELRA, p. 69-72. International Conference on Language Resources and Evaluation. 9, 2014-05-26/2014-05-31, Reykjavik (Islande).

This paper addresses the issue of acronym dictionary building. The first step of the process identifies acronym/definition candidates, the second one selects candidates based on a letter alignment method. This approach has two advantages because it enables (1) to annotate documents, (2) to build specific dictionaries. More precisely, this paper discusses the use of a specific linguistic concept, the gloss, in order to identify candidates. The proposed method based on paralinguistic markers is independent of languages.

Documents associés

Communication de congrès

Agents Cirad, auteurs de cette publication :