DocumentCode
2660431
Title
Automatic identification of gender & accent in spoken Hindi utterances with regional Indian accents
Author
Malhotra, Kamini ; Khosla, Anu
Author_Institution
Sci. Anal. Group, DRDO, Delhi
fYear
2008
fDate
15-19 Dec. 2008
Firstpage
309
Lastpage
312
Abstract
In the past significant effort has been focused on automatic extraction of information from speech signals. Most techniques have aimed at automatic speech recognition or speaker identification. Automatic accent identification (AID) has received far less attention. This paper gives an approach to identify gender and accent of a speaker using Gaussian mixture modeling technique. The proposed approach is text independent and identifies accent among four regional Indian accents in spoken Hindi and also identifies the gender. The accents worked upon are Kashmiri, Manipuri, Bengali and neutral Hindi. The Gaussian mixture model (GMM) approach precludes the need of speech segmentation for training and makes the implementation of the system very simple. When gender dependent GMMs are used, the accent identification score is enhanced and gender is also correctly recognized. The results show that the GMMs lend themselves to accent and gender identification task very well. In this approach spectral features have been incorporated in the form of mel frequency cepstral coefficients (MFCC). The approach has a wide scope of expansion to incorporate other regional accents in a very simple way.
Keywords
Gaussian processes; cepstral analysis; gender issues; natural language processing; speaker recognition; Bengali language; Gaussian mixture modeling; Kashmiri language; Manipuri language; automatic accent identification; automatic gender identification; mel frequency cepstral coefficient; regional Indian accent; speech segmentation; spoken Hindi utterance; Automatic speech recognition; Hidden Markov models; Information analysis; Loudspeakers; Mel frequency cepstral coefficient; Signal analysis; Signal processing; Speech analysis; Speech processing; Speech recognition; Pattern recognition; Spectral analysis; Speech analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language Technology Workshop, 2008. SLT 2008. IEEE
Conference_Location
Goa
Print_ISBN
978-1-4244-3471-8
Electronic_ISBN
978-1-4244-3472-5
Type
conf
DOI
10.1109/SLT.2008.4777902
Filename
4777902
Link To Document