Publications des agents du Cirad

Cirad

Evaluating formal concept analysis software for anomaly detection and correction

Saab N., Huchard M., Martin P.. 2022. In : Cordero Pablo (ed.), Krídlo Ondrej (ed.). Proceedings of the Sixteenth International Conference on Concept Lattices and Their Applications (CLA 2022). Aachen : CEUR-WS, p. 217-218. (CEUR Workshop Proceedings, 3308). International Conference on Concept Lattices and their Applications (CLA 2022). 16, 2022-06-20/2022-06-22, Tallinn (Estonie).

Data cleaning is a process that precedes data mining. Particularly, in our dataset on pesticidal plant use, several types of anomalies were identified, ranging from incorrect values to a lack of data susceptible of causing users to draw wrong conclusions during its exploration. Literature presents three methods based on Formal Concept Analysis (FCA), i.e. implication rules computation, association rules computation, and attribute exploration, that may allow the detection and correction of anomalies. This paper evaluates 30 FCA-based software and their apposite features to the development of an anomaly detection and correction method applicable to our dataset. Results show that only ConExp and its reimplementations provide all three methods. Since the data model on plant use is relational but ConExp only allows formal contexts as input, this paper concludes on the importance of integrating Relational Concept Analysis (RCA) with ConExp in future work.

Documents associés

Communication de congrès

Agents Cirad, auteurs de cette publication :