DocumentCode :
3113009
Title :
Development of Consonant-Vowel Recognition Systems for Indian languages: Bengali and Odia
Author :
Manjunath, K.E. ; Kumar, S. B. Sunil ; Pati, Debadatta ; Satapathy, Biswajit ; Rao, K. Sreenivasa
Author_Institution :
Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
fYear :
2013
fDate :
13-15 Dec. 2013
Firstpage :
1
Lastpage :
6
Abstract :
The basic goal of this work is to develop a Consonant-Vowel Recognition System (CVRS) for determining a sequence of Consonant-Vowel (CV) units present in a given speech utterance. In this work, we are focusing on developing CVRSs for Indian languages namely Bengali and Odia. This framework of developing CVRSs can be extended to any Indian languages. We have developed two separate CVRSs for Bengali and Odia languages. The CVRS is developed using read speech corpus. In this study, 67 CV classes for Bengali and 58 CV classes for Odia are used. Mel Frequency Cepstral Coefficients (MFCCs) are used as features for building models. The Vowel Onset Points (VOPs) are used as anchor points for marking syllable boundaries and for feature extraction. Support Vector Machines (SVMs) are used for building CV models. The performance of the developed CVRSs are evaluated in speaker dependent and speaker independent modes. In speaker dependent case, the best percentage accuracies of Bengali and Odia CVRSs are 49.48 and 69.66 respectively whereas in speaker independent case, the best percentage accuracies of Bengali and Odia CVRSs are 40.26 and 41.59 respectively.
Keywords :
cepstral analysis; feature extraction; natural language processing; speech recognition; support vector machines; Bengali language; CV classes; CV models; CV unit sequence; CVRS development; CVRS performance evaluation; Indian languages; MFCC; Mel frequency cepstral coefficients; Odia language; SVMs; VOP; anchor points; consonant-vowel recognition system development; consonant-vowel unit sequence; feature extraction; read speech corpus; speaker dependent modes; speaker independent modes; speech utterance; support vector machines; syllable boundary marking; vowel onset points; Accuracy; Feature extraction; Hidden Markov models; Speech; Support vector machines; Training; Vectors; Consonant-Vowel recognition; International Phonetic Alphabet; Support Vector Machine; Syllable recognition; Vowel Onset Point;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
India Conference (INDICON), 2013 Annual IEEE
Conference_Location :
Mumbai
Print_ISBN :
978-1-4799-2274-1
Type :
conf
DOI :
10.1109/INDCON.2013.6726109
Filename :
6726109
Link To Document :
بازگشت