DocumentCode :
3410873
Title :
Phonetic pronunciations for arabic speech-to-text systems
Author :
Diehl, F. ; Gales, M.J.F. ; Tomalin, M. ; Woodland, P.C.
Author_Institution :
Eng. Dept., Cambridge Univ., Cambridge
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
1573
Lastpage :
1576
Abstract :
In this paper two aspects of generating and using phonetic arabic dictionaries are described. First, the use of single pronunciation acoustic models in the context of arabic large vocabulary automatic speech recognition (ASR) is investigated. These have been found to be useful for English ASR systems, when combined with standard multiple pronunciation systems. The second area examined is automatically deriving phonetic "pronunciations" for words that standard approaches, such as the Buckwalter morphological analyzer, cannot handle. Without pronunciations for these words the OOV rates for various Arabic tasks significantly increase. Here, pronunciations are automatically found by first deriving grapheme-to-phone rules, and associated rule probabilities. These are then used to produce the most likely pronunciation, or pronunciations, for any word. These approaches are evaluated on a large vocabulary arabic broadcast news and broadcast conversation transcription task. Both schemes are found to yield gains with a multi-pass/combination framework.
Keywords :
speech processing; speech recognition; speech synthesis; Arabic large vocabulary; Arabic speech-to-text systems; automatic speech recognition; multiple pronunciation systems; phonetic Arabic dictionaries; phonetic pronunciations; single pronunciation acoustic models; Acoustical engineering; Automatic speech recognition; Broadcasting; Context modeling; Dictionaries; Frequency; Hidden Markov models; Speech recognition; Training data; Vocabulary; Arabic; Single Pronunciation Modelling; Speech Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4517924
Filename :
4517924
Link To Document :
بازگشت