Title :
Turkish Large Vocabulary Continuous Speech Recognition by using limited audio corpus
Author :
Susman, Derya ; Köprü, Selçuk ; Yazici, Adnan
Author_Institution :
Bilgisayar Muhendisligi, ODTU, Ankara, Turkey
Abstract :
In this paper, the recognition performances of several methodologies proposed in the context of Turkish Large Vocabulary Continuous Speech Recognition are retrieved by using a limited audio corpus. Word based, stem based, stem-ending based, and morph based language models are utilized with different n-gram orders. Word based and stem-ending based language models are extended by using several approaches. Also, a hybrid language model which is based on word based and stem-ending based language models is proposed. Word based language model is observed to outperform sub-word language models when limited audio corpus is used.
Keywords :
speech recognition; Turkish large vocabulary continuous speech recognition; hybrid language model; limited audio corpus; morph based language models; n-gram orders; stem-ending based language models; word based language models; Abstracts; Context; Context modeling; Hidden Markov models; Microphones; Speech recognition; Vocabulary; agglutinative; hidden markov model; large vocabulary continuous speech recognition; limited corpus; n-gram language model;
Conference_Titel :
Signal Processing and Communications Applications Conference (SIU), 2012 20th
Conference_Location :
Mugla
Print_ISBN :
978-1-4673-0055-1
Electronic_ISBN :
978-1-4673-0054-4
DOI :
10.1109/SIU.2012.6204601