Title :
Classification of normal and pathological voices using TEO phase and Mel cepstral features
Author :
Patil, Hemant A. ; Baljekar, Pallavi N.
Author_Institution :
Dhirubhai Ambani Inst. of Inf. & Commun. Technol. (DA-IICT), Gandhinagar, India
Abstract :
In this paper, a new feature-set, viz., Teager Energy Operator (TEO) phase has been proposed for automatic classification of normal vs. pathological voices. Development of TEO phase has been motivated from recently proposed linear prediction (LP) residual phase for speaker recognition. Classification was performed using a discriminatively-trained 2nd order polynomial classifier on a subset of the Massachusetts Ear and Eye Infirmary (MEEI) database. Score-level fusion of TEO phase and state-of-the-art Mel frequency cepstral coefficients (MFCC) gave reduction in equal error rate (EER) by 1.86 % than EER of MFCC alone. Proposed TEO phase feature set is also evaluated under degraded conditions using the NOISEX-92 database for the case of additive car noise.
Keywords :
polynomials; signal classification; speaker recognition; EER; LP residual phase; MEEI database; MFCC; Massachusetts Ear and Eye Infirmary database; Mel cepstral features; Mel frequency cepstral coefficients; NOISEX-92 database; TEO phase; Teager energy operator phase; additive car noise; discriminatively-trained second order polynomial classifier; equal error rate; linear prediction residual phase; normal voice classification; pathological voice automatic classification; speaker recognition; Accuracy; Databases; Feature extraction; Mel frequency cepstral coefficient; Noise; Pathology; Speech; Mel cepstrum; Teager Energy Operator (TEO) Phase; Voice pathology; discriminative training; score-level fusion;
Conference_Titel :
Signal Processing and Communications (SPCOM), 2012 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4673-2013-9
DOI :
10.1109/SPCOM.2012.6289991