DocumentCode :
3642965
Title :
Speaker´s gender classification and segmentation using spectral and cepstral feature averaging
Author :
Marko Kos;Damjan Vlaj;Zdravko Kačič
Author_Institution :
Faculty of Electrical Engineering and Computer Science, University of Maribor, SI-2000 Maribor, Slovenia
fYear :
2011
fDate :
6/1/2011 12:00:00 AM
Firstpage :
1
Lastpage :
4
Abstract :
This paper presents speaker gender classification and segmentation. Such classification is frequently used in broadcast news domain. Because pitch is a feature that is difficult to calculate reliably in noisy environment, and because telephone speech is present in broadcast material, we focused on using general acoustic features for gender discrimination task. We also averaged the feature values to emphasize general speaker´s properties to discard short-time properties of speech production. Test show that Average Mel-Frequency Cepstral Coefficients (AMFCC) perform best. The AMFCC features are very convenient gender discriminator for automatic speech recognition system where MFCC features are used, as they perform better than classic MFCC features and only one additional calculation step is needed.
Keywords :
"Speech","Mel frequency cepstral coefficient","Accuracy","Speech recognition","Databases","Materials"
Publisher :
ieee
Conference_Titel :
Systems, Signals and Image Processing (IWSSIP), 2011 18th International Conference on
ISSN :
2157-8672
Print_ISBN :
978-1-4577-0074-3
Electronic_ISBN :
2157-8702
Type :
conf
Filename :
5977407
Link To Document :
بازگشت