Ciência_Iscte Comunicações Descrição Detalhada da Comunicação Exportar

Exportar Publicação

A publicação pode ser exportada nos seguintes formatos: referência da APA (American Psychological Association), referência do IEEE (Institute of Electrical and Electronics Engineers), BibTeX e RIS.

Exportar Referência (APA)

Ribeiro, E., Mamede, N. & J. Baptista (2024). Text Readability Assessment in European Portuguese: A Comparison of Classification and Regression Approaches. 16th International Conference on Computational Processing of Portuguese (PROPOR 2024).

Exportar Referência (IEEE)

E. A. Ribeiro et al.,  "Text Readability Assessment in European Portuguese: A Comparison of Classification and Regression Approaches", in 16th Int. Conf. on Computational Processing of Portuguese (PROPOR 2024), Santiago de Compostela, 2024

Exportar BibTeX

@misc{ribeiro2024_1777870593584,
	author = "Ribeiro, E. and Mamede, N. and J. Baptista",
	title = "Text Readability Assessment in European Portuguese: A Comparison of Classification and Regression Approaches",
	year = "2024",
	url = "https://propor2024.citius.gal/"
}

Exportar RIS

TY  - CPAPER
TI  - Text Readability Assessment in European Portuguese: A Comparison of Classification and Regression Approaches
T2  - 16th International Conference on Computational Processing of Portuguese (PROPOR 2024)
AU  - Ribeiro, E.
AU  - Mamede, N.
AU  - J. Baptista
PY  - 2024
CY  - Santiago de Compostela
UR  - https://propor2024.citius.gal/
AB  - The automatic assessment of text readability and the classification of texts by levels is essential for language education and language-related industries that rely on effective communication. In European Portuguese, most of the studies on this subject focus on identifying the level of texts used for proficiency evaluation purposes according to the Common European Framework of Reference for Languages (CEFR). However, the ordinal nature of the levels is not considered by the classification models used in those studies. In this paper, we address the problem as a regression task in an attempt to leverage that information. Our experiments using fine-tuned versions of a state-of-the-art foundation model for Portuguese show that addressing the problem as a regression task leads to improved performance in terms of adjacent accuracy and improved generalization ability to different kinds of textual data.
ER  -