Title :
Smooth Gmm Based Multi-Talker Spectral Conversion for Spectrally Degraded Speech
Author :
Liu, Chuping ; Fu, Qian-Jie ; Narayanan, Shrikanth S.
Author_Institution :
Dept. of Electr. Eng., Southern California Univ., Los Angeles, CA
Abstract :
Because of the limited spectro-temporal resolution associated with the implant device, cochlear implant (CI) patients are more susceptible to talker variability than normal hearing (NH) listeners. In the present study, the effect of a smooth GMM based spectral conversion algorithm on multi-talker sentence recognition was tested in CI patients. In a model of CI speech processing (4-16 channels of spectrally degraded speech), talker distortion was significantly reduced with relatively few (~64) GMM components. CI patients´ sentence recognition was measured for one male (M1) and one female (F1) talker, as well as for spectrally converted speech (from M1 to F1 and from F1 to M1). Overall, CI users were sensitive to talker differences; some subjects performed better with M1, others with F1. After converting the spectrum of the less-understood talker to that of the better-understood talker, recognition of the less-understood talker´s speech was significantly improved. The results suggest that smooth GMM-based spectral conversion may improve CI patients´ multi-talker speech recognition
Keywords :
Markov processes; ear; medical signal processing; prosthetics; speech processing; speech recognition; cochlear implant; multitalker sentence recognition; multitalker spectral conversion; smooth GMM; spectrally degraded speech; spectro-temporal resolution; speech processing; Acoustic distortion; Auditory implants; Auditory system; Cochlear implants; Degradation; Frequency; Pattern matching; Speech processing; Speech recognition; Testing;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1661232