Improving the recognition of pathological voice using the discriminant HLDA transformation

Author

Lachhab, Othman ; Di Martino, Joseph ; Ibn Elhaj, El Hassane ; Hammouch, Ahmed

Author_Institution

ENSET, Mohammed V Univ., Rabat, Morocco

fYear

2014

fDate

20-22 Oct. 2014

Firstpage

370

Lastpage

373

Abstract

In this paper, we propose a simple and fast method for evaluating the pathological voice (esophageal) by applying the continuous speech recognition in a speaker dependent mode, on our own database of the pathological voice, we call FPSD (French Pathological Speech Database). The recognition system used is implemented using the HTK platform, based on HMM/GMM monophone models. The acoustic vectors are linearly transformed by the HLDA (Heteroscedastic Linear Discriminant Analysis) method to reduce their size in a smaller space with good discriminative properties. The obtained phone recognition rate (63.59 %) is very promising when we know that esophageal voice contains unnatural sounds, difficult to understand.

Keywords

Gaussian processes; hidden Markov models; mixture models; speaker recognition; FPSD; French pathological speech database; GMM monophone model; HMM monophone model; HTK platform; acoustic vector; continuous speech recognition; discriminant HLDA transformation; esophageal voice; heteroscedastic linear discriminant analysis; pathological voice recognition; phone recognition rate; speaker dependent mode; Databases; Hidden Markov models; Mel frequency cepstral coefficient; Pathology; Speech; Speech recognition; Vectors; Automatic Speech Recognition(ASR); GMM; HLDA; HMM; HTK; MFCC; Pathological voices;

fLanguage

English

Publisher

ieee

Conference_Titel

Information Science and Technology (CIST), 2014 Third IEEE International Colloquium in

Conference_Location

Tetouan

Print_ISBN

978-1-4799-5978-5

Type

conf

DOI

10.1109/CIST.2014.7016648

Filename

7016648