Title :
Minimum cost based phoneme class detection for improved iterative speech enhancement
Author :
Arslan, Levent M. ; Hansen, John H L
Author_Institution :
Dept. of Electr. Eng., Duke Univ., Durham, NC, USA
Abstract :
It is known that degrading acoustic noise influences speech quality across phoneme classes in a non-uniform manner. This results in variable quality performance for many speech enhancement algorithms in noisy environments. To address this, a hidden-Markov-mode phoneme classification procedure is proposed which directs single channel speech enhancement across individual phoneme classes. The procedure performs broad phoneme class partitioning of noisy speech frames using a continuous-mixture hidden-Markov-model recognizer in conjunction with a cost based decision process. Cost functions are assigned which weigh errors between phoneme classes that are perceptually different (e.g., vowels versus fricatives, etc.). Once noisy speech frames are partitioned, iterative speech enhancement based on all-pole parameter estimation with inter and intra-frame spectral constraints (Auto:I,LSP:T) is employed. The phoneme class directed enhancement algorithm is evaluated using TIMIT speech data, and shown to result in substantial improvement in objective speech quality over a range of signal-to-noise ratios and individual phoneme classes. The algorithm is also shown to possess consistent quality improvement in a speaker independent scenario
Keywords :
acoustic noise; decision theory; hidden Markov models; iterative methods; speech enhancement; TIMIT speech data; algorithm; all-pole parameter estimation; cost based decision process; degrading acoustic noise; fricatives; hidden-Markov-mode phoneme classification procedure; iterative speech enhancement; minimum cost; noisy environments; phoneme class detection; signal-to-noise ratios; single channel speech enhancement; speaker independent scenario; spectral constraints; speech quality; variable quality performance; vowels; Acoustic noise; Acoustic signal detection; Costs; Degradation; Iterative algorithms; Partitioning algorithms; Speech analysis; Speech enhancement; Speech processing; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389722