DocumentCode :
3081333
Title :
Malayalam Speech Recognition system and its application for visually impaired people
Author :
Anand, Anu V. ; Devi, P. Shobana ; Stephen, Jose ; Bhadran, V.K.
Author_Institution :
Centre for Dev. of Adv. Comput., Trivandrum, India
fYear :
2012
fDate :
7-9 Dec. 2012
Firstpage :
619
Lastpage :
624
Abstract :
This paper describes the development of state-of-the-art large vocabulary continuous speech recognition (LVCSR) system for the Malayalam language with an application for visually challenged. For an LVCSR, building a high accurate acoustic models and large-scale language models are the challenging task. Speech corpus for training the system is collected from 80 native speakers in room environment ensuring the speaker variance. Mel-frequency Cepstral Coefficients (MFCC) method is used as a front-end to extract acoustic features from the input signal. Acoustic model is built on 30 hours of speech data based on Hidden Markov Model (HMM). A hybrid model, integrating rule based and statistical method is used to handle pronunciation variations in the dictionary. The best configuration of the system achieved word accuracy of 75% in average. Accuracy of the system is further increased up to 80% in average, by implementing speaker adaptation technique. The developed system is integrated to OpenOffice Writer together with TTS for making it user friendly editor for visually challenged people.
Keywords :
cepstral analysis; feature extraction; hidden Markov models; natural language processing; speech recognition; HMM; LVCSR; MFCC; Malayalam speech recognition system; OpenOffice Writer; acoustic feature extraction; acoustic models; hidden Markov model; language models; large vocabulary continuous speech recognition; mel frequency cepstral coefficients; speaker adaptation technique; speech corpus; visually impaired people; Acoustics; Computational modeling; Dictionaries; Hidden Markov models; Speech; Speech recognition; Training; CMU SPHINX; HMM; LVCSR; Speech Recognition; Speech Recognition application; Voice enabled OpenOffice;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
India Conference (INDICON), 2012 Annual IEEE
Conference_Location :
Kochi
Print_ISBN :
978-1-4673-2270-6
Type :
conf
DOI :
10.1109/INDCON.2012.6420692
Filename :
6420692
Link To Document :
بازگشت