Artigo em revista científica Q1
A comprehensive review on automatic hate speech detection in the age of the transformer
Gil Ramos (Ramos, G.); Fernando Batista (Batista, F.); Ricardo Ribeiro (Ribeiro, R.); Pedro Fialho (Fialho, P.); Sérgio Moro (Moro, S.); António Fonseca (Fonseca, A.); Rita Guerra (Guerra, R.); Paula Carvalho (Carvalho, P.); Catarina Marques (Marques, C.); Cláudia Silva (Silva, C.); et al.
Título Revista
Social Network Analysis and Mining
Ano (publicação definitiva)
2024
Língua
Inglês
País
Reino Unido
Mais Informação
Web of Science®

N.º de citações: 0

(Última verificação: 2024-11-20 20:19)

Ver o registo na Web of Science®

Scopus

N.º de citações: 0

(Última verificação: 2024-11-21 07:25)

Ver o registo na Scopus

Google Scholar

N.º de citações: 0

(Última verificação: 2024-11-19 15:57)

Ver o registo no Google Scholar

Abstract/Resumo
The rapid proliferation of hate speech on social media poses significant challenges to maintaining a safe and inclusive digital environment. This paper presents a comprehensive review of automatic hate speech detection methods, with a particular focus on the evolution of approaches from traditional machine learning and deep learning models to the more advanced Transformer-based architectures. We systematically analyze over 100 studies, comparing the effectiveness, computational requirements, and applicability of various techniques, including Support Vector Machines, Long Short-Term Memory networks, Convolutional Neural Networks, and Transformer models like BERT and its multilingual variants. The review also explores the datasets, languages, and sources used for hate speech detection, noting the predominance of English-focused research while highlighting emerging efforts in low-resource languages and cross-lingual detection using multilingual Transformers. Additionally, we discuss the role of generative and multi-task learning models as promising avenues for future development. While Transformer-based models consistently achieve state-of-the-art performance, this review underscores the trade-offs between performance and computational cost, emphasizing the need for context-specific solutions. Key challenges such as algorithmic bias, data scarcity, and the need for more standardized benchmarks are also identified. This review provides crucial insights for advancing the field of hate speech detection and shaping future research directions.
Agradecimentos/Acknowledgements
--
Palavras-chave
Hate speech detection,Machine learning,Deep learning,Transfer learning,Transformers,Literature review
  • Ciências da Computação e da Informação - Ciências Naturais
  • Engenharia Civil - Engenharia e Tecnologia
  • Ciências da Comunicação - Ciências Sociais
Registos de financiamentos
Referência de financiamento Entidade Financiadora
CERV-2021-EQUAL (101049306) Comissão Europeia