DocumentCode
2818536
Title
Automatic transcription and speech recognition of Romanian corpus RO-GRID
Author
Giurgiu, Mircea ; Kabir, Ahsanul
Author_Institution
Telecommun. Dept., Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
fYear
2012
fDate
3-4 July 2012
Firstpage
465
Lastpage
468
Abstract
The results reported in this paper assess the ability of Hidden Markov Model (HMM) based method to generate accurate and reliable automatic phone-level transcriptions for a small vocabulary speech corpus such as RO-GRID. The system requires only orthographic transcription of the target corpus, and can be bootstrapped from models trained just on few amount of data in the transcribed corpus. For this purpose, an automatic time-aligned phone transcription toolbox has been developed and tested on the Romanian corpus and also validated on an English corpus. The quality of transcriptions is judged by evaluating the statistical parameters of the error between the automatic and manual transcription. The transcriptions generated from the most reliable system deviate from the average manual transcription by an average of 20 ms. The system is also able to convert the generated transcription from HTK format into PRAAT format for further manipulation of the speech signal.
Keywords
hidden Markov models; natural language processing; speech processing; speech recognition; English corpus; HMM based method; HTK format; PRAAT format; RO-GRID; Romanian corpus; automatic phone-level transcriptions; automatic time-aligned phone transcription toolbox; bootstrapping; hidden Markov model; manual transcription; orthographic transcription; speech recognition; speech signal manipulation; statistical error parameter evaluation; transcription quality; vocabulary speech corpus; Adaptation models; Hidden Markov models; Manuals; Speech; Speech recognition; Standards; Training; Automatic speech transcription; Hidden Markov Models;
fLanguage
English
Publisher
ieee
Conference_Titel
Telecommunications and Signal Processing (TSP), 2012 35th International Conference on
Conference_Location
Prague
Print_ISBN
978-1-4673-1117-5
Type
conf
DOI
10.1109/TSP.2012.6256337
Filename
6256337
Link To Document