Title :
Improving the recognition of pathological voice using the discriminant HLDA transformation
Author :
Lachhab, Othman ; Di Martino, Joseph ; Ibn Elhaj, El Hassane ; Hammouch, Ahmed
Author_Institution :
ENSET, Mohammed V Univ., Rabat, Morocco
Abstract :
In this paper, we propose a simple and fast method for evaluating the pathological voice (esophageal) by applying the continuous speech recognition in a speaker dependent mode, on our own database of the pathological voice, we call FPSD (French Pathological Speech Database). The recognition system used is implemented using the HTK platform, based on HMM/GMM monophone models. The acoustic vectors are linearly transformed by the HLDA (Heteroscedastic Linear Discriminant Analysis) method to reduce their size in a smaller space with good discriminative properties. The obtained phone recognition rate (63.59 %) is very promising when we know that esophageal voice contains unnatural sounds, difficult to understand.
Keywords :
Gaussian processes; hidden Markov models; mixture models; speaker recognition; FPSD; French pathological speech database; GMM monophone model; HMM monophone model; HTK platform; acoustic vector; continuous speech recognition; discriminant HLDA transformation; esophageal voice; heteroscedastic linear discriminant analysis; pathological voice recognition; phone recognition rate; speaker dependent mode; Databases; Hidden Markov models; Mel frequency cepstral coefficient; Pathology; Speech; Speech recognition; Vectors; Automatic Speech Recognition(ASR); GMM; HLDA; HMM; HTK; MFCC; Pathological voices;
Conference_Titel :
Information Science and Technology (CIST), 2014 Third IEEE International Colloquium in
Conference_Location :
Tetouan
Print_ISBN :
978-1-4799-5978-5
DOI :
10.1109/CIST.2014.7016648