DocumentCode :
134261
Title :
Exploiting Variable length Teager Energy Operator in melcepstral features for person recognition from humming
Author :
Madhavi, Maulik C. ; Patil, Hemant A.
Author_Institution :
Dhirubhai Ambani Inst. of Inf. & Commun. Technol. (DA-IICT), Gandhinagar, India
fYear :
2014
fDate :
12-14 Sept. 2014
Firstpage :
624
Lastpage :
628
Abstract :
In this paper, we attempt voice biometrics problem using only humming signal rather than normal speech. This paper adapts a new feature extraction technique which exploits Variable length Teager Energy Operator (VTEO) onto subband filtered signal of Mel filterbank. This feature modifies structure of state-of-the-art feature set, viz., Mel Frequency Cepstral Coefficients (MFCC). In particular, a new energy measure, viz., VTEO is employed to compute subband energies of different time-domain subband signals. The features derived MFCCs to capture magnitude and phase spectrum information via VTEO are termed as MFCC-VTMP. Discriminatively-trained polynomial classifier of 2nd order approximations is used as the basis for all experiments. MFCC-VTMP feature set is found to be better than MFCC for various evaluation factors such as order of polynomial classifier, dimension of feature vector, signal degradation conditions and class separability. % EER of MFCC and MFCC-VTMP are found to be 12.20% and 12.01%, respectively using 2nd order polynomial classification.
Keywords :
biometrics (access control); cepstral analysis; channel bank filters; feature extraction; mathematical operators; polynomials; signal classification; speech recognition; 2nd order approximations; MFCC-VTMP feature set; VTEO; class separability; discriminatively-trained polynomial classifier; feature extraction technique; feature vector dimension; humming signal; mel cepstral features; mel filterbank; mel frequency cepstral coefficients; person recognition; phase spectrum information; polynomial classifier order; signal degradation conditions; subband filtered signal; time-domain subband signals; variable length teager energy operator; voice biometrics problem; Feature extraction; Mel frequency cepstral coefficient; Noise; Polynomials; Speech; Training; Vectors; Humming; MFCC-VTMP; Melcepstrum; VTEO; person recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
Type :
conf
DOI :
10.1109/ISCSLP.2014.6936654
Filename :
6936654
Link To Document :
بازگشت