Comparing different approaches for detecting hate speech in online Portuguese comments

Bernardo Cunha Matos; Raquel Bento Santos; Paula Carvalho; Ricardo Ribeiro; Fernando Batista

Ciência_Iscte Publications Publication Detailed Description

Publication in conference proceedings

Comparing different approaches for detecting hate speech in online Portuguese comments

Bernardo Cunha Matos (Matos, B. C.); Raquel Bento Santos (Santos, R. B.); Paula Carvalho (Carvalho, P.); Ricardo Ribeiro (Ribeiro, R.); Fernando Batista (Batista, F.);

OpenAccess Series in Informatics

Year (definitive publication)

2022

Language

English

Country

Germany

More Information

Visit Link

Web of Science®

This publication is not indexed in Web of Science®

Scopus

Times Cited: 3

(Last checked: 2025-04-02 02:22)

View record in Scopus

Google Scholar

Times Cited: 8

(Last checked: 2025-04-01 13:01)

View record in Google Scholar

Overton

This publication is not indexed in Overton

Abstract

Online Hate Speech (OHS) has been growing dramatically on social media, which has motivated researchers to develop a diversity of methods for its automated detection. However, the detection of OHS in Portuguese is still little studied. To fill this gap, we explored different models that proved to be successful in the literature to address this task. In particular, we have explored transfer learning approaches, based on existing BERT-like pre-trained models. The performed experiments were based on CO-HATE, a corpus of YouTube comments posted by the Portuguese online community that was manually labeled by different annotators. Among other categories, those comments were labeled regarding the presence of hate speech and the type of hate speech, specifically overt and covert hate speech. We have assessed the impact of using annotations from different annotators on the performance of such models. In addition, we have analyzed the impact of distinguishing overt and and covert hate speech. The results achieved show the importance of considering the annotator’s profile in the development of hate speech detection models. Regarding the hate speech type, the results obtained do not allow to make any conclusion on what type is easier to detect. Finally, we show that pre-processing does not seem to have a significant impact on the performance of this specific task.

Acknowledgements

Keywords

Hate speech,Text classification,Transfer learning,Supervised learning,Deep learning

Fields of Science and Technology Classification

Computer and Information Sciences - Natural Sciences
Languages and Literature - Humanities

Funding Records

Funding Reference	Funding Entity
HATE Covid-19 (Proj. 759274510)	Fundação para a Ciência e a Tecnologia
PTDC/CCI-CIF/32607/2017	Fundação para a Ciência e a Tecnologia
UIDB/50021/2020	Fundação para a Ciência e a Tecnologia

Contributions to the Sustainable Development Goals of the United Nations

With the objective to increase the research activity directed towards the achievement of the United Nations 2030 Sustainable Development Goals, the possibility of associating scientific publications with the Sustainable Development Goals is now available in Ciência_Iscte. These are the Sustainable Development Goals identified by the author(s) for this publication. For more detailed information on the Sustainable Development Goals, click here.

Publication Identifiers

Scopus (source: Ciência_Iscte)	2-s2.0-85136156722
DOI (source: author)	10.4230/OASIcs.SLATE.2022.10
Ciência_Iscte ID	ci-pub-89930
Handle (source: Ciência-IUL)	http://hdl.handle.net/10071/25974

Other Publication Details

Online Publication Year	2022
Publisher	Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
Indexes	Scopus;
ISSN	2190-6807 (online)
ISBN	978-3-95977-245-7 (online)
Volume	104
Article Number	10
Pages	--	Total Pages	12
Peer Reviewed	Yes
Editors	Cordeiro, J., Pereira, M. J., Rodrigues, N. F., and Pais, S.
Event Title	11th Symposium on Languages, Applications and Technologies (SLATE 2022)
Event Organizer	Universidade da Beira Interior
City	Covilhã
Event Type	Conference
Event Classification	International
Event Year	2022
Event Publication Type	Full Paper
ISCTE-IUL Repository	Link to the repository
Publication Date (online)
Publication Date (print)

Altmetric

Dimensions

PlumX Metrics

Citations

Citation Indexes: 3

Captures

Readers: 13

see details