Towards cyberbullying detection: Building, benchmarking and longitudinal analysis of aggressiveness and conflicts/attacks datasets from Twitter

Paula Alexandra Nunes da Costa Ferreira; Nádia Salgado Pereira; Hugo Rosa; Sofia Oliveira; Luísa Coheur; Sofia Mateus Francisco; Sidclay Souza; Ricardo Ribeiro; João Paulo Carvalho; Paula Paulino; Isabel Trancoso; Ana Margarida Veiga Simão

Ciência_Iscte Publicações Descrição Detalhada da Publicação

Artigo em revista científica Q1

Towards cyberbullying detection: Building, benchmarking and longitudinal analysis of aggressiveness and conflicts/attacks datasets from Twitter

Paula Alexandra Nunes da Costa Ferreira (Ferreira, P.); Nádia Salgado Pereira (Pereira, N.); Hugo Rosa (Rosa, H.); Sofia Oliveira (Oliveira, S.); Luísa Coheur (Coheur, L.); Sofia Mateus Francisco (Francisco, S.); Sidclay Souza (Souza, S.); Ricardo Ribeiro (Ribeiro, R.); João Paulo Carvalho (Carvalho, J. P.); Paula Paulino (Paulino, P.); Isabel Trancoso (Trancoso, I.); Ana Margarida Veiga Simão (Veiga-Simão, A. M.); et al.

Título Revista

IEEE Transactions on Affective Computing

Ano (publicação definitiva)

2024

Língua

Inglês

País

Estados Unidos da América

Mais Informação

Visitar Link

Web of Science®

Esta publicação não está indexada na Web of Science®

Scopus

N.º de citações: 0

(Última verificação: 2025-04-15 00:27)

Ver o registo na Scopus

Google Scholar

N.º de citações: 0

(Última verificação: 2025-04-13 15:55)

Ver o registo no Google Scholar

Overton

Esta publicação não está indexada no Overton

Abstract/Resumo

Offense and hate speech are a source of online conflicts which have become common in social media and, as such, their study is a growing topic of research in machine learning and natural language processing. This article presents two Portuguese language offense-related datasets that deepen the study of the subject: an Aggressiveness dataset and a Conflicts/Attacks dataset. While the former is similar to other offense detection related datasets, the latter constitutes a novelty due to the use of the history of the interaction between users. Several studies were carried out to construct and analyze the data in the datasets. The first study included gathering expressions of verbal aggression witnessed by adolescents to guide data extraction for the datasets. The second study included extracting data from Twitter (in Portuguese) that matched the most frequent expressions/words/sentences that were identified in the previous study. The third study consisted in the development of the Aggressiveness dataset, the Conflicts/Attacks dataset, and classification models. In our fourth study, we proposed to examine whether online aggression and conflicts/attacks revealed any trend changes over time with a sample of 86 adolescents. With this study, we also proposed to investigate whether the amount of tweets sent over a period of 273 days was related to online aggression and conflicts/attacks. Lastly, we analyzed the percentage of participants who participated in the aggressions and/or attacks/conflicts.

Agradecimentos/Acknowledgements

Palavras-chave

Aggression,Offense,Hate speech,Social networks,Natural language processing,Dataset

Classificação Fields of Science and Technology

Ciências da Computação e da Informação - Ciências Naturais

Registos de financiamentos

Referência de financiamento	Entidade Financiadora
PTDC/MHC/PED/3297/2014	Fundação para a Ciência e a Tecnologia
PTDC/PSI-GER/1918/2020	Fundação para a Ciência e a Tecnologia
UIDB/04527/2020	Fundação para a Ciência e a Tecnologia
UIDP/04527/2020	Fundação para a Ciência e a Tecnologia
UIDB/50021/2020	Fundação para a Ciência e a Tecnologia

Contribuições para os Objetivos do Desenvolvimento Sustentável das Nações Unidas

Com o objetivo de aumentar a investigação direcionada para o cumprimento dos Objetivos do Desenvolvimento Sustentável para 2030 das Nações Unidas, é disponibilizada no Ciência_Iscte a possibilidade de associação, quando aplicável, dos artigos científicos aos Objetivos do Desenvolvimento Sustentável. Estes são os Objetivos do Desenvolvimento Sustentável identificados pelo(s) autor(es) para esta publicação. Para uma informação detalhada dos Objetivos do Desenvolvimento Sustentável, clique aqui.

Identificadores da Publicação

Scopus (fonte: autor)	2-s2.0-85212842471
DOI (fonte: autor)	10.1109/TAFFC.2024.3518587
Scopus (fonte: Ciência_Iscte)	2-s2.0-85212842471
ID Ciência_Iscte	ci-pub-107041

Outros Detalhes da Publicação

Ano Publicação Online	2024
Editora	IEEE
Indexação	Scopus;
ISSN	1949-3045 (print) 1949-3045 (online)
ISBN	--
Factor de Impacto	--
Volume	N/A	Número
Série
Número Artigo
Páginas	--
Avaliado Cientificamente	Sim
Meio de Divulgação	Digital
Data Publicação (online)
Data Publicação (print)

Altmetric

Dimensions

PlumX Metrics

Captures

Readers: 9

see details