DocumentCode :
845139
Title :
Speaker normalization for Chinese vowel recognition in cochlear implants
Author :
Luo, Xin ; Fu, Qian-Jie
Author_Institution :
Dept. of Auditory Implants & Perception, House Ear Inst., Los Angeles, CA, USA
Volume :
52
Issue :
7
fYear :
2005
fDate :
7/1/2005 12:00:00 AM
Firstpage :
1358
Lastpage :
1361
Abstract :
Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech recognition. The present study investigated whether multitalker speech recognition can be improved by applying speaker normalization techniques to cochlear implant speech processing. Multitalker Chinese vowel recognition was tested with normal-hearing Chinese-speaking subjects listening to a 4-channel cochlear implant simulation, with and without speaker normalization. For each subject, speaker normalization was referenced to the speaker that produced the best recognition performance under conditions without speaker normalization. To match the remaining speakers to this "optimal" output pattern, the overall frequency range of the analysis filter bank was adjusted for each speaker according to the ratio of the mean third formant frequency values between the specific speaker and the reference speaker. Results showed that speaker normalization provided a small but significant improvement in subjects\´ overall recognition performance. After speaker normalization, subjects\´ patterns of recognition performance across speakers changed, demonstrating the potential for speaker-dependent effects with the proposed normalization technique.
Keywords :
ear; medical signal processing; prosthetics; speech processing; speech recognition; Chinese vowel recognition; analysis filter bank; cochlear implants; limited spectra-temporal resolution; multitalker speech recognition; normal-hearing Chinese-speaking subjects; speaker normalization; speech processing; Auditory implants; Cochlear implants; Filter bank; Frequency; Pattern analysis; Pattern matching; Pattern recognition; Speech processing; Speech recognition; Testing; Cochlear implants; speaker normalization; vowel recognition; Artificial Intelligence; China; Cochlear Implants; Computer-Aided Design; Equipment Failure Analysis; Humans; Phonation; Prosthesis Design; Sound Spectrography; Speech Acoustics; Speech Perception; Speech Recognition Software;
fLanguage :
English
Journal_Title :
Biomedical Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9294
Type :
jour
DOI :
10.1109/TBME.2005.847530
Filename :
1440618
Link To Document :
بازگشت