Export Publication

The publication can be exported in the following formats: APA (American Psychological Association) reference format, IEEE (Institute of Electrical and Electronics Engineers) reference format, BibTeX and RIS.

Export Reference (APA)
Hämäläinen, A., Pinto, F. M., Rodrigues, S., Júdice, A., Silva, S. M., Calado, A....Dias, M. S. (2013). A multimodal educational game for 3-10-year-old children: Collecting and automatically recognising European Portuguese children’s speech. In Badin, P., Hueber, T., Bailly, G., Demolin, D., and Raby, F. (Ed.), 2013 ISCA International Workshop on Speech and Language Technology in Education (SLaTE 2013). (pp. 31-36). Grenoble, France: The International Society for Computers and Their Applications (ISCA).
Export Reference (IEEE)
A. Hämäläinen et al.,  "A multimodal educational game for 3-10-year-old children: Collecting and automatically recognising European Portuguese children’s speech", in 2013 ISCA Int. Workshop on Speech and Language Technology in Education (SLaTE 2013), Badin, P., Hueber, T., Bailly, G., Demolin, D., and Raby, F., Ed., Grenoble, France, The International Society for Computers and Their Applications (ISCA), 2013, pp. 31-36
Export BibTeX
@inproceedings{hämäläinen2013_1716157642387,
	author = "Hämäläinen, A. and Pinto, F. M. and Rodrigues, S. and Júdice, A. and Silva, S. M. and Calado, A. and Dias, M. S.",
	title = "A multimodal educational game for 3-10-year-old children: Collecting and automatically recognising European Portuguese children’s speech",
	booktitle = "2013 ISCA International Workshop on Speech and Language Technology in Education (SLaTE 2013)",
	year = "2013",
	editor = "Badin, P., Hueber, T., Bailly, G., Demolin, D., and Raby, F.",
	volume = "",
	number = "",
	series = "",
	pages = "31-36",
	publisher = "The International Society for Computers and Their Applications (ISCA)",
	address = "Grenoble, France",
	organization = "GIPSA-lab and LIDILEM with the ISCA-SLaTE group",
	url = "https://www.isca-speech.org/archive/slate_2013/"
}
Export RIS
TY  - CPAPER
TI  - A multimodal educational game for 3-10-year-old children: Collecting and automatically recognising European Portuguese children’s speech
T2  - 2013 ISCA International Workshop on Speech and Language Technology in Education (SLaTE 2013)
AU  - Hämäläinen, A.
AU  - Pinto, F. M.
AU  - Rodrigues, S.
AU  - Júdice, A.
AU  - Silva, S. M.
AU  - Calado, A.
AU  - Dias, M. S.
PY  - 2013
SP  - 31-36
CY  - Grenoble, France
UR  - https://www.isca-speech.org/archive/slate_2013/
AB  - Speech interfaces have tremendous potential in education. In this paper, we present our work in the Contents for Next Generation Networks project, an ongoing Portuguese industry-academia collaboration developing a multimodal educational game aimed at improving the physical coordination and the basic mathematical and musical skills of 3-10- year-old children. We focus on our work in the area of children's speech recognition: designing, collecting, transcribing and annotating a 21-hour corpus of prompted European Portuguese children's speech, as well as our first experiments with different acoustic modelling approaches. Our speech recognition results suggest that training children's speech models from scratch is a more promising approach than retraining adult speech models using children's speech when a sufficient amount of training data is available from the targeted age group. This finding also holds for adult female speech models retrained using children's speech. As compared with a baseline recogniser comprising gender-dependent adult speech models, the best-performing children's speech models that we have trained so far – genderindependent cross-word triphones trained with 17.5 hours of speech from 3-10-year-old children – resulted in a 45-percent (relative) decrease in word error rate in a task expecting isolated cardinal numbers, sequences of cardinal numbers or musical notes as speech input
ER  -