Ciência-IUL
Publicações
Descrição Detalhada da Publicação
Improving speech recognition through automatic selection of age group – specific acoustic models
Título Livro
Computational Processing of the Portuguese Language. PROPOR 2014. Lecture Notes in Computer Science
Ano
2014
Língua
Inglês
País
Suíça
Mais Informação
Web of Science®
Scopus
Abstract/Resumo
The acoustic models used by automatic speech recognisers are usually trained with speech collected from young to middle-aged adults. As the characteristics of speech change with age, such acoustic models tend to perform poorly on children’s and elderly people’s speech. In this study, we investigate whether the automatic age group classification of speakers, together with age group –specific acoustic models, could improve automatic speech recognition performance. We train an age group classifier with an accuracy of about 95% and show that using the results of the classifier to select age group –specific acoustic models for children and the elderly leads to considerable gains in automatic speech recognition performance, as compared with using acoustic models trained with young to middle-aged adults’ speech for recognising their speech, as well.
Agradecimentos/Acknowledgements
--
Palavras-chave
Acoustic modelling,Age group classification,Automatic speech recognition,Children,Elderly,Paralinguistic information
Classificação Fields of Science and Technology
- Matemáticas - Ciências Naturais
- Ciências da Computação e da Informação - Ciências Naturais

English