A comprehensive review on automatic hate speech detection in the age of the transformer

Gil Ramos; Fernando Batista; Ricardo Ribeiro; Pedro Fialho; Sérgio Moro; António Fonseca; Rita Guerra; Paula Carvalho; Catarina Marques; Cláudia Silva

Ciência_Iscte Publicações Descrição Detalhada da Publicação

Artigo de revisão Q1

A comprehensive review on automatic hate speech detection in the age of the transformer

Gil Ramos (Ramos, G.); Fernando Batista (Batista, F.); Ricardo Ribeiro (Ribeiro, R.); Pedro Fialho (Fialho, P.); Sérgio Moro (Moro, S.); António Fonseca (Fonseca, A.); Rita Guerra (Guerra, R.); Paula Carvalho (Carvalho, P.); Catarina Marques (Marques, C.); Cláudia Silva (Silva, C.); et al.

Título Revista

Social Network Analysis and Mining

Ano (publicação definitiva)

2024

Língua

Inglês

País

Reino Unido

Mais Informação

Visitar Link

Web of Science®

N.º de citações: 7

(Última verificação: 2026-04-19 02:37)

Ver o registo na Web of Science®

Índice de Impacto do Artigo: 1.2

Ver Mais

Scopus

N.º de citações: 16

(Última verificação: 2026-04-11 08:44)

Ver o registo na Scopus

Índice de Impacto do Artigo: 1.8

Ver Mais

Google Scholar

N.º de citações: 32

(Última verificação: 2026-04-19 07:40)

Ver o registo no Google Scholar

Overton

Esta publicação não está indexada no Overton

Abstract/Resumo

The rapid proliferation of hate speech on social media poses significant challenges to maintaining a safe and inclusive digital environment. This paper presents a comprehensive review of automatic hate speech detection methods, with a particular focus on the evolution of approaches from traditional machine learning and deep learning models to the more advanced Transformer-based architectures. We systematically analyze over 100 studies, comparing the effectiveness, computational requirements, and applicability of various techniques, including Support Vector Machines, Long Short-Term Memory networks, Convolutional Neural Networks, and Transformer models like BERT and its multilingual variants. The review also explores the datasets, languages, and sources used for hate speech detection, noting the predominance of English-focused research while highlighting emerging efforts in low-resource languages and cross-lingual detection using multilingual Transformers. Additionally, we discuss the role of generative and multi-task learning models as promising avenues for future development. While Transformer-based models consistently achieve state-of-the-art performance, this review underscores the trade-offs between performance and computational cost, emphasizing the need for context-specific solutions. Key challenges such as algorithmic bias, data scarcity, and the need for more standardized benchmarks are also identified. This review provides crucial insights for advancing the field of hate speech detection and shaping future research directions.

Agradecimentos/Acknowledgements

Palavras-chave

Hate speech detection,Machine learning,Deep learning,Transfer learning,Transformers,Literature review

Classificação Fields of Science and Technology

Ciências da Computação e da Informação - Ciências Naturais
Engenharia Civil - Engenharia e Tecnologia
Ciências da Comunicação - Ciências Sociais

Registos de financiamentos

Referência de financiamento	Entidade Financiadora
CERV-2021-EQUAL (101049306)	Comissão Europeia

Identificadores da Publicação

WoS (fonte: Ciência_Iscte)	WOS:001329384400001
Scopus (fonte: autor)	2-s2.0-85206238543
DOI (fonte: autor)	10.1007/s13278-024-01361-3
Scopus (fonte: Ciência_Iscte)	2-s2.0-85206238543
Handle (fonte: Ciência-IUL)	http://hdl.handle.net/10071/32472
WoS (fonte: autor)	WOS:001329384400001
ID Ciência_Iscte	ci-pub-106025

Outros Detalhes da Publicação

Ano Publicação Online	2024
Editora	Springer
Indexação	Web of Science©; Scopus;
ISSN	1869-5450 (print) 1869-5469 (online)
ISBN	--
Factor de Impacto	--
Volume	14	Número	1
Série
Número Artigo	204
Páginas	--
Avaliado Cientificamente	Sim
Repositório ISCTE-IUL	Link para o repositório
Data Publicação (online)
Data Publicação (print)

Altmetric

Dimensions

PlumX Metrics