Export Publication

The publication can be exported in the following formats: APA (American Psychological Association) reference format, IEEE (Institute of Electrical and Electronics Engineers) reference format, BibTeX and RIS.

Export Reference (APA)
Ferreira, J., Chesi, C., Baldewijns, D., Dias, J., Braga, D., Pinto, F...Amadeu, F. (2014). Casa de la Lhéngua: A set of language resources and natural language processing tools for Mirandese . In Nicoletta Calzolari (Ed.), Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014). (pp. 536-540). Reykjavik: European Language Resources Association (ELRA).
Export Reference (IEEE)
F. J.P. et al.,  "Casa de la Lhéngua: A set of language resources and natural language processing tools for Mirandese ", in Proc. of the Ninth Int. Conf. on Language Resources and Evaluation (LREC 2014), Nicoletta Calzolari, Ed., Reykjavik, European Language Resources Association (ELRA), 2014, pp. 536-540
Export BibTeX
@inproceedings{j.p.2014_1716157007487,
	author = "Ferreira, J. and Chesi, C. and Baldewijns, D. and Dias, J. and Braga, D. and Pinto, F and Cho, H. and Correia, M. and Amadeu, F.",
	title = "Casa de la Lhéngua: A set of language resources and natural language processing tools for Mirandese ",
	booktitle = "Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)",
	year = "2014",
	editor = "Nicoletta Calzolari",
	volume = "",
	number = "",
	series = "",
	pages = "536-540",
	publisher = "European Language Resources Association (ELRA)",
	address = "Reykjavik",
	organization = "European Language Resources Association",
	url = "https://aclanthology.org/volumes/L14-1/"
}
Export RIS
TY  - CPAPER
TI  - Casa de la Lhéngua: A set of language resources and natural language processing tools for Mirandese 
T2  - Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)
AU  - Ferreira, J.
AU  - Chesi, C.
AU  - Baldewijns, D.
AU  - Dias, J.
AU  - Braga, D.
AU  - Pinto, F
AU  - Cho, H.
AU  - Correia, M.
AU  - Amadeu, F.
PY  - 2014
SP  - 536-540
CY  - Reykjavik
UR  - https://aclanthology.org/volumes/L14-1/
AB  - This paper describes the efforts for the construction of Language Resources and NLP tools for Mirandese, a minority language
spoken in North-eastern Portugal, now available on a community-led portal, Casa de la Lhéngua. The resources were developed in
the context of a collaborative citizenship project led by Microsoft, in the context of the creation of the first TTS system for
Mirandese. Development efforts encompassed the compilation of a corpus with over 1M tokens, the construction of a GTP system,
syllable-division, inflection and a Part-of-Speech (POS) tagger modules, leading to the creation of an inflected lexicon of about
200.000 entries with phonetic transcription, detailed POS tagging, syllable division, and stress mark-up. Alongside these tasks,
which were made easier through the adaptation and reuse of existing tools for closely related languages, a casting for voice talents
among the speaking community was conducted and the first speech database for speech synthesis was recorded for Mirandese. These
resources were combined to fulfil the requirements of a well-tested statistical parameter synthesis model, leading to an intelligible
voice font. These language resources are available freely at Casa de la Lhéngua, aiming at promoting the development of real-life
applications and fostering linguistic research on Mirandese.
ER  -