Export Publication

The publication can be exported in the following formats: APA (American Psychological Association) reference format, IEEE (Institute of Electrical and Electronics Engineers) reference format, BibTeX and RIS.

Export Reference (APA)
Matos, B. C., Santos, R. B., Carvalho, P., Ribeiro, R. & Batista, F. (2022). Comparing different approaches for detecting hate speech in online Portuguese comments. In Cordeiro, J., Pereira, M. J., Rodrigues, N. F., and Pais, S. (Ed.), OpenAccess Series in Informatics. Covilhã: Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing.
Export Reference (IEEE)
M. B. Cunha et al.,  "Comparing different approaches for detecting hate speech in online Portuguese comments", in OpenAccess Series in Informatics, Cordeiro, J., Pereira, M. J., Rodrigues, N. F., and Pais, S., Ed., Covilhã, Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing, 2022, vol. 104
Export BibTeX
@inproceedings{cunha2022_1716030877024,
	author = "Matos, B. C. and Santos, R. B. and Carvalho, P. and Ribeiro, R. and Batista, F.",
	title = "Comparing different approaches for detecting hate speech in online Portuguese comments",
	booktitle = "OpenAccess Series in Informatics",
	year = "2022",
	editor = "Cordeiro, J., Pereira, M. J., Rodrigues, N. F., and Pais, S.",
	volume = "104",
	number = "",
	series = "",
	doi = "10.4230/OASIcs.SLATE.2022.10",
	publisher = "Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing",
	address = "Covilhã",
	organization = "Universidade da Beira Interior",
	url = "https://drops.dagstuhl.de/opus/portals/oasics/index.php?semnr=16249"
}
Export RIS
TY  - CPAPER
TI  - Comparing different approaches for detecting hate speech in online Portuguese comments
T2  - OpenAccess Series in Informatics
VL  - 104
AU  - Matos, B. C.
AU  - Santos, R. B.
AU  - Carvalho, P.
AU  - Ribeiro, R.
AU  - Batista, F.
PY  - 2022
DO  - 10.4230/OASIcs.SLATE.2022.10
CY  - Covilhã
UR  - https://drops.dagstuhl.de/opus/portals/oasics/index.php?semnr=16249
AB  - Online Hate Speech (OHS) has been growing dramatically on social media, which has motivated researchers to develop a diversity of methods for its automated detection. However, the detection of OHS in Portuguese is still little studied. To fill this gap, we explored different models that proved to be successful in the literature to address this task. In particular, we have explored transfer learning approaches, based on existing BERT-like pre-trained models. The performed experiments were based on CO-HATE, a corpus of YouTube comments posted by the Portuguese online community that was manually labeled by different annotators. Among other categories, those comments were labeled regarding the presence of hate speech and the type of hate speech, specifically overt and covert hate speech. We have assessed the impact of using annotations from different annotators on the performance of such models. In addition, we have analyzed the impact of distinguishing overt and and covert hate speech. The results achieved show the importance of considering the annotator’s profile in the development of hate speech detection models. Regarding the hate speech type, the results obtained do not allow to make any conclusion on what type is easier to detect. Finally, we show that pre-processing does not seem to have a significant impact on the performance of this specific task.
ER  -