Publications des agents du Cirad

Cirad

Using features based on elongation to enhance sentiment analysis

Rafae A., Erritali M., Madani Y., Roche M.. 2023. In : Lossio-Ventura Juan Antonio (ed.), Valverde-Rebaza Jorge (ed.), Díaz Eduardo (ed.) , Alatrista-Salas Hugo (ed.). Information management and big data. Cham : Springer, p. 70-81. (Communications in Computer and Information Science, 1837). Annual International Conference on Information Management and Big Dat (SIMBig 2022). 9, 2022-11-16/2022-11-18, Lima (Pérou).

DOI: 10.1007/978-3-031-35445-8_6

Elongated words such as “heellloo” or “heyyy” are a frequent feature of oral communication and are often used to underline or exaggerate the hidden message of the root word. Although elongated words are rarely found in written languages and dictionaries, they are common in social networks. They can be considered in the analysis of users' sentiments. In this paper, we analyze the impact of elongation on the classification of sentiments, in addition to an in-depth study at the level of lexical forms of elongation. In this work we present a method to improve the accuracy of sentiment classification based on elongations of features. Experimental results conducted on Twitter data show that our model achieves an accuracy of 0.79 in 7-fold cross-validation experiments.

Documents associés

Communication de congrès

Agents Cirad, auteurs de cette publication :