Title :
Phonetic Transcription using Speech Recognition Technique Considering Variations in Pronunciation
Author :
Min-Siong Liang ; Ren-Yuan Lyu ; Yuang-Chin Chiang
Author_Institution :
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Abstract :
We propose a new approach for performing phonetic transcription of speech and text that combines automatic speech recognition (ASR) and grapheme-to-phoneme (G2P) techniques. By augmenting the text with speech and using automatic speech recognition with a sausage searching net constructed from multiple text pronunciations corresponding to human speech utterance, we are able to reduce the effort for phonetic transcription. By using a multiple pronunciation lexicon, a transcription error rate of 12.74% was achieved. Further improvement can be achieved by adapting the pronunciation lexicon with pronunciation variation (PV) rules and an error rate reduction of 17.11 % could be achieved.
Keywords :
natural language processing; speech processing; speech recognition; automatic speech recognition; grapheme-to-phoneme techniques; human speech utterance; multiple text pronunciations; phonetic transcription; pronunciation variation; speech recognition technique; Automatic speech recognition; Computer science; Error analysis; Flowcharts; Humans; Spatial databases; Speech processing; Speech recognition; Statistics; Vocabulary; Automatic Phonetic Transcription; Chinese; Dialect; Pronunciation Variation; Taiwanese;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367175