Exportar Publicação

A publicação pode ser exportada nos seguintes formatos: referência da APA (American Psychological Association), referência do IEEE (Institute of Electrical and Electronics Engineers), BibTeX e RIS.

Exportar Referência (APA)
Ribeiro, E., Ribeiro, R. & de Matos, D. M. (2018). End-to-end multi-level dialog act recognition. In Antonio Bonafonte, Jordi Luque and Francesc Alías Pujol (Ed.), IberSPEECH 2018. (pp. 301-305). Barcelona: ISCA.
Exportar Referência (IEEE)
E. Ribeiro et al.,  "End-to-end multi-level dialog act recognition", in IberSPEECH 2018, Antonio Bonafonte, Jordi Luque and Francesc Alías Pujol, Ed., Barcelona, ISCA, 2018, pp. 301-305
Exportar BibTeX
@inproceedings{ribeiro2018_1714811214742,
	author = "Ribeiro, E. and Ribeiro, R. and de Matos, D. M.",
	title = "End-to-end multi-level dialog act recognition",
	booktitle = "IberSPEECH 2018",
	year = "2018",
	editor = "Antonio Bonafonte, Jordi Luque and Francesc Alías Pujol",
	volume = "",
	number = "",
	series = "",
	doi = "10.21437/IberSPEECH.2018-63",
	pages = "301-305",
	publisher = "ISCA",
	address = "Barcelona",
	organization = "",
	url = "https://www.isca-speech.org/archive/IberSPEECH_2018/abstracts/IberS18_O5-5_Ribeiro.html"
}
Exportar RIS
TY  - CPAPER
TI  - End-to-end multi-level dialog act recognition
T2  - IberSPEECH 2018
AU  - Ribeiro, E.
AU  - Ribeiro, R.
AU  - de Matos, D. M.
PY  - 2018
SP  - 301-305
DO  - 10.21437/IberSPEECH.2018-63
CY  - Barcelona
UR  - https://www.isca-speech.org/archive/IberSPEECH_2018/abstracts/IberS18_O5-5_Ribeiro.html
AB  - The three-level dialog act annotation scheme of the DIHANA corpus poses a multi-level classification problem in which the bottom levels allow multiple or no labels for a single segment. We approach automatic dialog act recognition on the three levels using an end-to-end approach, in order to implicitly capture relations between them. Our deep neural network classifier uses a combination of word- and character-based segment representation approaches, together with a summary of the dialog history and information concerning speaker changes. We show that it is important to specialize the generic segment representation in order to capture the most relevant information for each level. On the other hand, the summary of the dialog history should combine information from the three levels to capture dependencies between them. Furthermore, the labels generated for each level help in the prediction of those of the lower levels. Overall, we achieve results which surpass those of our previous approach using the hierarchical combination of three independent per-level classifiers. Furthermore, the results even surpass the results achieved on the simplified version of the problem approached by previous studies, which neglected the multi-label nature of the bottom levels and only considered the label combinations present in the corpus.
ER  -