Master's Dissertation
Anotação Morfossintáctica Desambiguada do Português
Ricardo Ribeiro (Ribeiro, R.);
Year (definitive publication)
2003
Language
Portuguese
Country
Portugal
More Information
Web of Science®

This publication is not indexed in Web of Science®

Scopus

This publication is not indexed in Scopus

Google Scholar

Times Cited: 34

(Last checked: 2024-11-17 14:40)

View record in Google Scholar

Abstract
In this thesis we present the development of a part-of-speech tagging system for Portuguese. The main motivation for the development of the system was the intention of using it as a component of a text-to-speech synthesis system. The architecture of the tagger comprehends a morphological analysis module and a morphossyntactic disambiguation module. The importance of the morphological analysis module draws from the fact that neolatin languages, such as Portuguese, are highly inflectional, which results in the lack of the necessary examples to develop reliable language models – the data sparseness problem. The morphossyntactic disambiguation module combines two different approaches: linguistic-oriented rule-based disambiguation and probabilistic disambiguation. The system was trained and tested using the annotated PAROLE corpus. The results achieved show that the presented architecture is well suited for European Portuguese. Although it is difficult to do a fundamented comparison between this and other taggers addressing the Portuguese language – since, for example, the tagsets are different and the used corpora were not the same – this system seems to achieve a better performance. Additionally, it is important to stress the efforts made to ensure the modularity of the system, allowing an easy interchange of modules and simplicity of integration in other systems.
Acknowledgements
--
Keywords
Natural language processing,Part-of-speech tagging,Morphossyntax,Corpus-based language modeling,Rule-based approach,Probabilistic approach
  • Computer and Information Sciences - Natural Sciences
  • Electrical Engineering, Electronic Engineering, Information Engineering - Engineering and Technology

With the objective to increase the research activity directed towards the achievement of the United Nations 2030 Sustainable Development Goals, the possibility of associating scientific publications with the Sustainable Development Goals is now available in Ciência-IUL. These are the Sustainable Development Goals identified by the author(s) for this publication. For more detailed information on the Sustainable Development Goals, click here.