Comparing different approaches for detecting hate speech in online Portuguese comments

Bernardo Cunha Matos; Raquel Bento Santos; Paula Carvalho; Ricardo Ribeiro; Fernando Batista

Ciência-IUL Publicações Descrição Detalhada da Publicação

Publicação em atas de evento científico

Comparing different approaches for detecting hate speech in online Portuguese comments

Bernardo Cunha Matos (Matos, B. C.); Raquel Bento Santos (Santos, R. B.); Paula Carvalho (Carvalho, P.); Ricardo Ribeiro (Ribeiro, R.); Fernando Batista (Batista, F.);

OpenAccess Series in Informatics

Ano (publicação definitiva)

2022

Língua

Inglês

País

Alemanha

Mais Informação

Visitar Link

Web of Science®

Esta publicação não está indexada na Web of Science®

Scopus

N.º de citações: 0

(Última verificação: 2024-04-26 11:00)

Ver o registo na Scopus

Google Scholar

N.º de citações: 2

(Última verificação: 2024-04-30 14:31)

Ver o registo no Google Scholar

Abstract/Resumo

Online Hate Speech (OHS) has been growing dramatically on social media, which has motivated researchers to develop a diversity of methods for its automated detection. However, the detection of OHS in Portuguese is still little studied. To fill this gap, we explored different models that proved to be successful in the literature to address this task. In particular, we have explored transfer learning approaches, based on existing BERT-like pre-trained models. The performed experiments were based on CO-HATE, a corpus of YouTube comments posted by the Portuguese online community that was manually labeled by different annotators. Among other categories, those comments were labeled regarding the presence of hate speech and the type of hate speech, specifically overt and covert hate speech. We have assessed the impact of using annotations from different annotators on the performance of such models. In addition, we have analyzed the impact of distinguishing overt and and covert hate speech. The results achieved show the importance of considering the annotator’s profile in the development of hate speech detection models. Regarding the hate speech type, the results obtained do not allow to make any conclusion on what type is easier to detect. Finally, we show that pre-processing does not seem to have a significant impact on the performance of this specific task.

Agradecimentos/Acknowledgements

Palavras-chave

Hate speech,Text classification,Transfer learning,Supervised learning,Deep learning

Classificação Fields of Science and Technology

Ciências da Computação e da Informação - Ciências Naturais
Línguas e Literaturas - Humanidades

Registos de financiamentos

Referência de financiamento	Entidade Financiadora
HATE Covid-19 (Proj. 759274510)	Fundação para a Ciência e a Tecnologia
PTDC/CCI-CIF/32607/2017	Fundação para a Ciência e a Tecnologia
UIDB/50021/2020	Fundação para a Ciência e a Tecnologia

Contribuições para os Objetivos do Desenvolvimento Sustentável das Nações Unidas

Com o objetivo de aumentar a investigação direcionada para o cumprimento dos Objetivos do Desenvolvimento Sustentável para 2030 das Nações Unidas, é disponibilizada no Ciência-IUL a possibilidade de associação, quando aplicável, dos artigos científicos aos Objetivos do Desenvolvimento Sustentável. Estes são os Objetivos do Desenvolvimento Sustentável identificados pelo(s) autor(es) para esta publicação. Para uma informação detalhada dos Objetivos do Desenvolvimento Sustentável, clique aqui.

Identificadores da Publicação

DOI (fonte: autor)	10.4230/OASIcs.SLATE.2022.10
Scopus (fonte: Ciência-IUL)	2-s2.0-85136156722
Handle (fonte: Ciência-IUL)	http://hdl.handle.net/10071/25974
ID Ciência-IUL	ci-pub-89930

Outros Detalhes da Publicação

Ano Publicação Online	2022
Editora	Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
Indexação	Scopus;
ISSN	2190-6807 (online)
ISBN	978-3-95977-245-7 (online)
Volume	104
Número Artigo	10
Páginas	--	Total Páginas	12
Avaliado Cientificamente	Sim
Editores	Cordeiro, J., Pereira, M. J., Rodrigues, N. F., and Pais, S.
Título do Evento	11th Symposium on Languages, Applications and Technologies (SLATE 2022)
Organizador do Evento	Universidade da Beira Interior
Cidade	Covilhã
Tipo de Evento	Conferência
Classificação do Evento	Internacional
Ano do Evento	2022
Tipo de Publicação no Evento	Artigo Completo
Repositório ISCTE-IUL	Link para o repositório
Data Publicação (online)
Data Publicação (print)

Altmetric

Dimensions

PlumX Metrics