DocumentCode
234388
Title
Improving the recognition of pathological voice using the discriminant HLDA transformation
Author
Lachhab, Othman ; Di Martino, Joseph ; Ibn Elhaj, El Hassane ; Hammouch, Ahmed
Author_Institution
ENSET, Mohammed V Univ., Rabat, Morocco
fYear
2014
fDate
20-22 Oct. 2014
Firstpage
370
Lastpage
373
Abstract
In this paper, we propose a simple and fast method for evaluating the pathological voice (esophageal) by applying the continuous speech recognition in a speaker dependent mode, on our own database of the pathological voice, we call FPSD (French Pathological Speech Database). The recognition system used is implemented using the HTK platform, based on HMM/GMM monophone models. The acoustic vectors are linearly transformed by the HLDA (Heteroscedastic Linear Discriminant Analysis) method to reduce their size in a smaller space with good discriminative properties. The obtained phone recognition rate (63.59 %) is very promising when we know that esophageal voice contains unnatural sounds, difficult to understand.
Keywords
Gaussian processes; hidden Markov models; mixture models; speaker recognition; FPSD; French pathological speech database; GMM monophone model; HMM monophone model; HTK platform; acoustic vector; continuous speech recognition; discriminant HLDA transformation; esophageal voice; heteroscedastic linear discriminant analysis; pathological voice recognition; phone recognition rate; speaker dependent mode; Databases; Hidden Markov models; Mel frequency cepstral coefficient; Pathology; Speech; Speech recognition; Vectors; Automatic Speech Recognition(ASR); GMM; HLDA; HMM; HTK; MFCC; Pathological voices;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science and Technology (CIST), 2014 Third IEEE International Colloquium in
Conference_Location
Tetouan
Print_ISBN
978-1-4799-5978-5
Type
conf
DOI
10.1109/CIST.2014.7016648
Filename
7016648
Link To Document