Scientific journal paper Q1
Automatic transcription system for parliamentary debates in the context of assembly of the republic of Portugal
Pedro Nascimento (Nascimento, P.); Joao C Ferreira or Joao Ferreira (Ferreira, J. C.); Fernando Batista (Batista, F.);
Journal Title
International Journal of Speech Technology
Year (definitive publication)
2024
Language
English
Country
United Kingdom
More Information
Web of Science®

This publication is not indexed in Web of Science®

Scopus

Times Cited: 0

(Last checked: 2024-11-14 15:02)

View record in Scopus

Google Scholar

Times Cited: 0

(Last checked: 2024-11-17 15:57)

View record in Google Scholar

Abstract
The transcription of parliamentary proceedings is essential for democratic governance. Traditional methods are manual and time-consuming. This work introduces an Automatic Transcription System for the Assembly of the Republic of Portugal (STAAR) that uses an automatic speech recognition model and speaker diarization technologies. STAAR was developed after analyzing existing technologies and the Assembly’s specific needs, leading to an effective solution that integrates with current processes. STAAR stands out for its efficiency in transcribing debates and adapting to parliamentary language nuances. It significantly exceeded expectations by presenting a low transcription error rate, ranging from 1.7 to 11.3%, depending on the context and speech style, reducing the time required to produce the official parliamentary debates journal, and improving overall transcription efficiency. Additionally, STAAR enabled the transcription of previously undocumented parliamentary committee meetings, enhancing the documentation of parliamentary activities. This achievement marks a significant step in modernizing parliamentary processes, increasing transparency and accessibility of political information, and positions the Portuguese Parliament at the forefront of technological innovation in parliamentary debates transcription.
Acknowledgements
--
Keywords
Automatic transcription,Parliamentary debates,Automatic speech recognition,Natural language processing,Machine learning,Large language model,Speaker diarization
  • Computer and Information Sciences - Natural Sciences
  • Languages and Literature - Humanities

With the objective to increase the research activity directed towards the achievement of the United Nations 2030 Sustainable Development Goals, the possibility of associating scientific publications with the Sustainable Development Goals is now available in Ciência-IUL. These are the Sustainable Development Goals identified by the author(s) for this publication. For more detailed information on the Sustainable Development Goals, click here.