DocumentCode :
3570563
Title :
Development of vocal tract length normalized phonetic engine for Gujarati and Marathi languages
Author :
Sharma, Shubham ; Madhavi, Maulik C. ; Patil, Hemant A.
Author_Institution :
Dhirubhai Ambani Inst. of Inf. & Commun. Technol., Gandhinagar, India
fYear :
2014
Firstpage :
1
Lastpage :
6
Abstract :
Phonetic engine (PE) is a system that converts speech sound units into symbols without any higher-level information (such as semantic or linguistic details). This paper presents the development of PE in two Indian languages, viz., Gujarati and Marathi. To investigate the performance of PE, speech recorded in three different modes, viz., read, spontaneous and lecture is considered. Database consists of a large number of speakers in each mode for these languages. In order to reduce the effects of speaker differences in the databases, Vocal Tract Length Normalization (VTLN) using Lee-Rose method is incorporated. Here, performances of PEs are tested using state-of-the-art Mel frequency cepstral coefficients (MFCC) and vocal tract length normalized features. Hidden Markov model (HMM)-based approach is used for modeling the phonetic units. On an average, improvement of 3.12 % and 1.32 % is achieved using vocal tract length normalized PE over MFCCs for Gujarati and Marathi, respectively.
Keywords :
hidden Markov models; natural language processing; speech processing; Gujarati languages; HMM; Indian languages; Lee-Rose method; MFCC; Marathi languages; Mel frequency cepstral coefficients; PE; VTLN; hidden Markov model; higher-level information; normalized phonetic engine; speech recording; speech sound units; vocal tract length normalization; Lee-Rose method; MFCC; Phonetic engine; VTLN; hidden Markov model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014 17th Oriental Chapter of the International Committee for the
Type :
conf
DOI :
10.1109/ICSDA.2014.7051439
Filename :
7051439
Link To Document :
بازگشت