مرکز منطقه ای اطلاع رساني علوم و فناوري - Development of Consonant-Vowel Recognition Systems for Indian languages: Bengali and Odia

DocumentCode :

3113009

Title :

Development of Consonant-Vowel Recognition Systems for Indian languages: Bengali and Odia

Author :

Manjunath, K.E. ; Kumar, S. B. Sunil ; Pati, Debadatta ; Satapathy, Biswajit ; Rao, K. Sreenivasa

Author_Institution :

Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India

fYear :

2013

fDate :

13-15 Dec. 2013

Firstpage :

Lastpage :

Abstract :

The basic goal of this work is to develop a Consonant-Vowel Recognition System (CVRS) for determining a sequence of Consonant-Vowel (CV) units present in a given speech utterance. In this work, we are focusing on developing CVRSs for Indian languages namely Bengali and Odia. This framework of developing CVRSs can be extended to any Indian languages. We have developed two separate CVRSs for Bengali and Odia languages. The CVRS is developed using read speech corpus. In this study, 67 CV classes for Bengali and 58 CV classes for Odia are used. Mel Frequency Cepstral Coefficients (MFCCs) are used as features for building models. The Vowel Onset Points (VOPs) are used as anchor points for marking syllable boundaries and for feature extraction. Support Vector Machines (SVMs) are used for building CV models. The performance of the developed CVRSs are evaluated in speaker dependent and speaker independent modes. In speaker dependent case, the best percentage accuracies of Bengali and Odia CVRSs are 49.48 and 69.66 respectively whereas in speaker independent case, the best percentage accuracies of Bengali and Odia CVRSs are 40.26 and 41.59 respectively.

Keywords :

cepstral analysis; feature extraction; natural language processing; speech recognition; support vector machines; Bengali language; CV classes; CV models; CV unit sequence; CVRS development; CVRS performance evaluation; Indian languages; MFCC; Mel frequency cepstral coefficients; Odia language; SVMs; VOP; anchor points; consonant-vowel recognition system development; consonant-vowel unit sequence; feature extraction; read speech corpus; speaker dependent modes; speaker independent modes; speech utterance; support vector machines; syllable boundary marking; vowel onset points; Accuracy; Feature extraction; Hidden Markov models; Speech; Support vector machines; Training; Vectors; Consonant-Vowel recognition; International Phonetic Alphabet; Support Vector Machine; Syllable recognition; Vowel Onset Point;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

India Conference (INDICON), 2013 Annual IEEE

Conference_Location :

Mumbai

Print_ISBN :

978-1-4799-2274-1

Type :

conf

DOI :

10.1109/INDCON.2013.6726109

Filename :

6726109

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3113009