DocumentCode
152247
Title
Comparison of MFCC, LPCC and PLP features for the determination of a speaker´s gender
Author
Yucesoy, E. ; Nabiyev, V.V.
Author_Institution
Teknik Bilimler Meslek Yuksekokulu, Ordu Univ., Ordu, Turkey
fYear
2014
fDate
23-25 April 2014
Firstpage
321
Lastpage
324
Abstract
Gender information is a distinctive and the most important property in a speech. Determination of this information from a speech signal is a substantial subject. Gender information used for various purposes in many applications, provides the less error rate by defining the gender-dependent speech/speaker models. In this study, a system determining the gender of a speaker with no dependency from a text is proposed. In the proposed system, speech records represented by Mel Frequency Cepstral Coefficients (MFCC), Linear Prediction Cepstral Coefficients (LPCC) and Perceptual Linear Prediction Coefficients (PLP) features are classified according to the genders by using GMM model. In the study, the effect of feature type and its dimension and the number of GMM components on the success are comparatively investigated. In the experiments, the best result is obtained as 99.37% with 16-coefficient MFCC and 8-component GMM.
Keywords
gender issues; speech processing; LPCC; LPCC features; MFCC features; Mel frequency cepstral coefficients; PLP features; gender dependent speech-speaker models; gender information; linear prediction cepstral coefficients; perceptual linear prediction coefficients; speaker gender; speech signal; substantial subject; Conferences; Mel frequency cepstral coefficient; Multimedia communication; Signal processing; Speech; Speech recognition; Tutorials; Gaussian Mixture Model (GMM); Gender identification; LPCC; MFCC; PLP;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Communications Applications Conference (SIU), 2014 22nd
Conference_Location
Trabzon
Type
conf
DOI
10.1109/SIU.2014.6830230
Filename
6830230
Link To Document