Title :
Robust automatic transcription of English speech corpora
Author :
Kabir, Ahsanul ; Giurgiu, Mircea ; Barker, Jon
Author_Institution :
Dept. of Telecommun., Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
Abstract :
This research assesses the ability of a Hidden Markov Model (HMM) based method to generate an accurate and reliable automatic phone-level transcriptions for a small vocabulary speech corpus. In particular, we are interested in a system that requires only orthographic transcription of the target corpus, and can be bootstrapped from models trained on an independent phonetically transcribed corpus. The question we ask is whether reliable results can be achieved despite a large mismatch between the bootstrapping corpus (US English) and the target corpus (British English). Quality of the automatic transcriptions is judged by comparison with manual transcriptions produced by several independent transcribers. Different training strategies are compared for handling the interspeaker variability in the target corpus. The transcriptions generated from the most reliable system deviate from the average manual transcription by an average of 20 ms.
Keywords :
hidden Markov models; natural language processing; speech processing; English speech corpora; hidden Markov model; interspeaker variability; orthographic transcription; robust automatic phone-level transcriptions; vocabulary speech corpus; Automatic control; Computer science; Dictionaries; Hidden Markov models; Natural languages; Robustness; Speech analysis; State estimation; Viterbi algorithm; Vocabulary; Automatic Transcription; GRID Corpus; Hidden Markov Model; TIMIT Corpus;
Conference_Titel :
Communications (COMM), 2010 8th International Conference on
Conference_Location :
Bucharest
Print_ISBN :
978-1-4244-6360-2
DOI :
10.1109/ICCOMM.2010.5509116