DocumentCode :
3410996
Title :
Age and gender recognition for telephone applications based on GMM supervectors and support vector machines
Author :
Bocklet, Tobias ; Maier, Andreas ; Bauer, Josef G. ; Burkhardt, Felix ; Nöth, Elmar
Author_Institution :
Inst. of Pattern Recognition, Erlangen-Nuremberg Univ., Nuremberg
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
1605
Lastpage :
1608
Abstract :
This paper compares two approaches of automatic age and gender classification with 7 classes. The first approach are Gaussian mixture models (GMMs) with universal background models (UBMs), which is well known for the task of speaker identification/verification. The training is performed by the EM algorithm or MAP adaptation respectively. For the second approach for each speaker of the test and training set a GMM model is trained. The means of each model are extracted and concatenated, which results in a GMM supervector for each speaker. These supervectors are then used in a support vector machine (SVM). Three different kernels were employed for the SVM approach: a polynomial kernel (with different polynomials), an RBF kernel and a linear GMM distance kernel, based on the KL divergence. With the SVM approach we improved the recognition rate to 74% (p < 0.001) and are in the same range as humans.
Keywords :
Gaussian processes; Karhunen-Loeve transforms; acoustic signal processing; expectation-maximisation algorithm; gender issues; polynomials; radial basis function networks; speaker recognition; support vector machines; Gaussian mixture model supervectors; KL divergence; acoustic signal processing; age recognition; expectation-maximisation algorithm; gender recognition; maximum a posteriori algorithm; polynomial kernel; radial basis function kernel; speaker classification; speaker identification; speaker verification; support vector machines; telephone; universal background models; Automatic speech recognition; Humans; Kernel; Loudspeakers; Pattern recognition; Polynomials; Support vector machine classification; Support vector machines; System testing; Telephony; Acoustic signal analysis; Gaussian mixture models (GMM); age; gender; speaker classification; support vector machine (SVM);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4517932
Filename :
4517932
Link To Document :
بازگشت