Scientific journal paper Q1
Towards cyberbullying detection: Building, benchmarking and longitudinal analysis of aggressiveness and conflicts/attacks datasets from Twitter
Paula Alexandra Nunes da Costa Ferreira (Ferreira, P.); Nádia Salgado Pereira (Pereira, N.); Hugo Rosa (Rosa, H.); Sofia Oliveira (Oliveira, S.); Luísa Coheur (Coheur, L.); Sofia Mateus Francisco (Francisco, S.); Sidclay Souza (Souza, S.); Ricardo Ribeiro (Ribeiro, R.); João Paulo Carvalho (Carvalho, J. P.); Paula Paulino (Paulino, P.); Isabel Trancoso (Trancoso, I.); Ana Margarida Veiga Simão (Veiga-Simão, A. M.); et al.
Journal Title
IEEE Transactions on Affective Computing
Year (definitive publication)
2024
Language
English
Country
United States of America
More Information
Web of Science®

This publication is not indexed in Web of Science®

Scopus

Times Cited: 0

(Last checked: 2025-03-03 10:15)

View record in Scopus

Google Scholar

Times Cited: 0

(Last checked: 2025-03-03 17:35)

View record in Google Scholar

This publication is not indexed in Overton

Abstract
Offense and hate speech are a source of online conflicts which have become common in social media and, as such, their study is a growing topic of research in machine learning and natural language processing. This article presents two Portuguese language offense-related datasets that deepen the study of the subject: an Aggressiveness dataset and a Conflicts/Attacks dataset. While the former is similar to other offense detection related datasets, the latter constitutes a novelty due to the use of the history of the interaction between users. Several studies were carried out to construct and analyze the data in the datasets. The first study included gathering expressions of verbal aggression witnessed by adolescents to guide data extraction for the datasets. The second study included extracting data from Twitter (in Portuguese) that matched the most frequent expressions/words/sentences that were identified in the previous study. The third study consisted in the development of the Aggressiveness dataset, the Conflicts/Attacks dataset, and classification models. In our fourth study, we proposed to examine whether online aggression and conflicts/attacks revealed any trend changes over time with a sample of 86 adolescents. With this study, we also proposed to investigate whether the amount of tweets sent over a period of 273 days was related to online aggression and conflicts/attacks. Lastly, we analyzed the percentage of participants who participated in the aggressions and/or attacks/conflicts.
Acknowledgements
--
Keywords
Aggression,Offense,Hate speech,Social networks,Natural language processing,Dataset
  • Computer and Information Sciences - Natural Sciences
Funding Records
Funding Reference Funding Entity
PTDC/MHC/PED/3297/2014 Fundação para a Ciência e a Tecnologia
UIDB/50021/2020 Fundação para a Ciência e a Tecnologia
UIDP/04527/2020 Fundação para a Ciência e a Tecnologia
UIDB/04527/2020 Fundação para a Ciência e a Tecnologia
PTDC/PSI-GER/1918/2020 Fundação para a Ciência e a Tecnologia

With the objective to increase the research activity directed towards the achievement of the United Nations 2030 Sustainable Development Goals, the possibility of associating scientific publications with the Sustainable Development Goals is now available in Ciência_Iscte. These are the Sustainable Development Goals identified by the author(s) for this publication. For more detailed information on the Sustainable Development Goals, click here.