A comprehensive review on automatic hate speech detection in the age of the transformer

Gil Ramos; Fernando Batista; Ricardo Ribeiro; Pedro Fialho; Sérgio Moro; António Fonseca; Rita Guerra; Paula Carvalho; Catarina Marques; Cláudia Silva

Ciência_Iscte Publications Publication Detailed Description

Review article Q1

A comprehensive review on automatic hate speech detection in the age of the transformer

Gil Ramos (Ramos, G.); Fernando Batista (Batista, F.); Ricardo Ribeiro (Ribeiro, R.); Pedro Fialho (Fialho, P.); Sérgio Moro (Moro, S.); António Fonseca (Fonseca, A.); Rita Guerra (Guerra, R.); Paula Carvalho (Carvalho, P.); Catarina Marques (Marques, C.); Cláudia Silva (Silva, C.); et al.

Journal Title

Social Network Analysis and Mining

Year (definitive publication)

2024

Language

English

Country

United Kingdom

More Information

Visit Link

Web of Science®

Times Cited: 2

(Last checked: 2025-12-18 17:06)

View record in Web of Science®

Article Impact Index: 0.7

Scopus

Times Cited: 2

(Last checked: 2025-12-12 01:53)

View record in Scopus

Article Impact Index: 0.4

Google Scholar

Times Cited: 13

(Last checked: 2025-12-19 18:44)

View record in Google Scholar

Overton

This publication is not indexed in Overton

Abstract

The rapid proliferation of hate speech on social media poses significant challenges to maintaining a safe and inclusive digital environment. This paper presents a comprehensive review of automatic hate speech detection methods, with a particular focus on the evolution of approaches from traditional machine learning and deep learning models to the more advanced Transformer-based architectures. We systematically analyze over 100 studies, comparing the effectiveness, computational requirements, and applicability of various techniques, including Support Vector Machines, Long Short-Term Memory networks, Convolutional Neural Networks, and Transformer models like BERT and its multilingual variants. The review also explores the datasets, languages, and sources used for hate speech detection, noting the predominance of English-focused research while highlighting emerging efforts in low-resource languages and cross-lingual detection using multilingual Transformers. Additionally, we discuss the role of generative and multi-task learning models as promising avenues for future development. While Transformer-based models consistently achieve state-of-the-art performance, this review underscores the trade-offs between performance and computational cost, emphasizing the need for context-specific solutions. Key challenges such as algorithmic bias, data scarcity, and the need for more standardized benchmarks are also identified. This review provides crucial insights for advancing the field of hate speech detection and shaping future research directions.

Acknowledgements

Keywords

Hate speech detection,Machine learning,Deep learning,Transfer learning,Transformers,Literature review

Fields of Science and Technology Classification

Computer and Information Sciences - Natural Sciences
Civil Engineering - Engineering and Technology
Media and Communications - Social Sciences

Funding Records

Funding Reference	Funding Entity
CERV-2021-EQUAL (101049306)	Comissão Europeia

Publication Identifiers

Scopus (source: Ciência_Iscte)	2-s2.0-85206238543
DOI (source: author)	10.1007/s13278-024-01361-3
WoS (source: Ciência_Iscte)	WOS:001329384400001
WoS (source: author)	WOS:001329384400001
Scopus (source: author)	2-s2.0-85206238543
Ciência_Iscte ID	ci-pub-106025
Handle (source: Ciência-IUL)	http://hdl.handle.net/10071/32472

Other Publication Details

Online Publication Year	2024
Publisher	Springer
Indexes	Web of Science©; Scopus;
ISSN	1869-5450 (print) 1869-5469 (online)
ISBN	--
Impact Factor	--
Volume	14	Number	1
Series
Article Number	204
Pages	--
Peer Reviewed	Yes
ISCTE-IUL Repository	Link to the repository
Publication Date (online)
Publication Date (print)

Altmetric

Dimensions

PlumX Metrics