مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker´s gender classification and segmentation using spectral and cepstral feature averaging

DocumentCode :

3642965

Title :

Speaker´s gender classification and segmentation using spectral and cepstral feature averaging

Author :

Marko Kos;Damjan Vlaj;Zdravko Kačič

Author_Institution :

Faculty of Electrical Engineering and Computer Science, University of Maribor, SI-2000 Maribor, Slovenia

fYear :

2011

fDate :

6/1/2011 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

This paper presents speaker gender classification and segmentation. Such classification is frequently used in broadcast news domain. Because pitch is a feature that is difficult to calculate reliably in noisy environment, and because telephone speech is present in broadcast material, we focused on using general acoustic features for gender discrimination task. We also averaged the feature values to emphasize general speaker´s properties to discard short-time properties of speech production. Test show that Average Mel-Frequency Cepstral Coefficients (AMFCC) perform best. The AMFCC features are very convenient gender discriminator for automatic speech recognition system where MFCC features are used, as they perform better than classic MFCC features and only one additional calculation step is needed.

Keywords :

"Speech","Mel frequency cepstral coefficient","Accuracy","Speech recognition","Databases","Materials"

Publisher :

ieee

Conference_Titel :

Systems, Signals and Image Processing (IWSSIP), 2011 18th International Conference on

ISSN :

2157-8672

Print_ISBN :

978-1-4577-0074-3

Electronic_ISBN :

2157-8702

Type :

conf

Filename :

5977407

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3642965