Non-peer-reviewed papers
Towards Using Machine Translation Techniques to Induce Multilingual Lexica of Discourse Markers
António Lopes (Lopes, A.); David Martins de Matos (de Matos, D.); Vera Cabarrão (Vera Cabarrão); Ricardo Ribeiro (Ribeiro, R.); Helena Moniz (Moniz, H.); Isabel Trancoso (Trancoso, I.); Ana Isabel Mata (Mata, Ana I.); et al.
Journal Title
arXiv
Year (definitive publication)
2015
Language
English
Country
--
More Information
Web of Science®

This publication is not indexed in Web of Science®

Scopus

This publication is not indexed in Scopus

Google Scholar

Times Cited: 12

(Last checked: 2024-11-17 14:40)

View record in Google Scholar

Abstract
Discourse markers are universal linguistic events subject to language variation. Although an extensive literature has already reported language specific traits of these events, little has been said on their cross-language behavior and on building an inventory of multilingual lexica of discourse markers. This work describes new methods and approaches for the description, classification, and annotation of discourse markers in the specific domain of the Europarl corpus. The study of discourse markers in the context of translation is crucial due to the idiomatic nature of these structures. Multilingual lexica together with the functional analysis of such structures are useful tools for the hard task of translating discourse markers into possible equivalents from one language to another. Using Daniel Marcu's validated discourse markers for English, extracted from the Brown Corpus, our purpose is to build multilingual lexica of discourse markers for other languages, based on machine translation techniques. The major assumption in this study is that the usage of a discourse marker is independent of the language, i.e., the rhetorical function of a discourse marker in a sentence in one language is equivalent to the rhetorical function of the same discourse marker in another language.
Acknowledgements
--
Keywords
  • Computer and Information Sciences - Natural Sciences
  • Electrical Engineering, Electronic Engineering, Information Engineering - Engineering and Technology
  • Languages and Literature - Humanities

With the objective to increase the research activity directed towards the achievement of the United Nations 2030 Sustainable Development Goals, the possibility of associating scientific publications with the Sustainable Development Goals is now available in Ciência-IUL. These are the Sustainable Development Goals identified by the author(s) for this publication. For more detailed information on the Sustainable Development Goals, click here.