Publications des agents du Cirad

Cirad

Towards a (semi-)automatic urban planning rule identification in the french language

Koptelov M., Holveck M., Crémilleux B., Reynaud J., Roche M., Teisseire M.. 2023. In : 2023 IEEE 10th International Conference on Data Science and Advanced Analytics (DSAA 2023). New York : IEEE, p. 396-405. IEEE International Conference on Data Science and Advanced Analytics (DSAA). 10, 2023-10-09/2023-10-13, Thessaloniki (Grèce).

DOI: 10.57745/DWYGMB

DOI: 10.1109/DSAA60987.2023.10302561

ne of the objectives of the Hérelles project is to find new mechanisms to facilitate the labeling (or semantization) of clusters from time series of satellite images. To achieve this, a proposed solution is to associate textual elements of interest with satellite data. The first step in this process consists of an automatic extraction of the information in the form of rules from urban planning documents composed in the French language. To address this challenge, we propose a method which is based on the multi-label classification of textual segments. It includes a special format for representing segments, in which each segment has a title and a subtitle. In addition, we propose a cascade approach aiming to deal with hierarchy of class labels. Finally, we develop several text augmentation techniques for the texts in French, which are able to improve the prediction results. We demonstrate experimentally that the resulting framework correctly classifies each type of segment with more than 90% of accuracy.

Documents associés

Communication de congrès

Agents Cirad, auteurs de cette publication :