Title :
Speaker Characterization with MLSFs
Author :
Cordeiro, Hugo ; Ribeiro, Carlos Meneses
Author_Institution :
Dept. of Electron. Telecommun. & Comput. Eng., Inst. Superior de Engenharia de Lisboa
Abstract :
The work described in this paper concerns the analysis of an alternative feature for speaker characterization, in the context of speaker recognition: line spectrum frequencies (LSF), but derived from mel-filter bank energies. This new feature, that we denominate mel-LSFs (MLSFs), shows similar performance comparing to MFCCs for male speakers, one of the most common feature found in speaker recognition, but for female speakers MLSFs performs better than MFCCs. When combined with mel-LSFs differences, MLSFs feature overcomes the performance of the MFCCs for male and female speakers, even with temporal delta, AMFCCs, included. Performance is measured in the context of speaker verification, using EER and minimum HTER. Detection error threshold (DET) curves are also presented, as well as HTER curves. The main objective of this study is to compare different features performances with a common framework, from what a standard support vector machine recogniser was developed. Tests are based on the cellular component of the "2002 NIST Speaker Recognition Evaluation Corpus"
Keywords :
channel bank filters; error detection; speaker recognition; spectral analysis; support vector machines; DET curves; MFCC; NIST Speaker Recognition Evaluation Corpus; detection error threshold; line spectrum frequency; mel-LSF; mel-filter bank energy; speaker characterization; speaker recognition; speaker verification; support vector machine; Autocorrelation; Error analysis; Frequency; Machine learning; NIST; Speaker recognition; Standards development; Support vector machines; Telecommunication computing; Testing;
Conference_Titel :
Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
Conference_Location :
San Juan
Print_ISBN :
1-424400471-1
Electronic_ISBN :
1-4244-0472-X
DOI :
10.1109/ODYSSEY.2006.248113