Title :
Analysis of effect of compensation parameter estimation for CMN on speech/speaker recognition
Author :
Wang, Longbiao ; Kitaoka, Norihide ; Nakagawa, Seiichi
Author_Institution :
Dept. of Inf. & Comput. Sci., Toyohashi Univ. of Technol., Toyohashi
Abstract :
In a distant environment, channel distortion may drastically degrade speech recognition and speaker recognition performances. In this paper, we provide the analysis of effect of compensation parameter estimation for Cepstral Mean Normalization (CMN) on speech/speaker recognition. We first investigate the differences between the intra-speaker variation and the inter-speaker variation by analyzing the cepstrum distances of Japanese vowels. It is indicated that the effect of transmission characteristics compensation on speech recognition task and speaker recognition task is different. Then Position-Dependent Cepstral Mean Normalization (PDCMN) to compensate for channel distortion depending on speaker position is used to evaluate the speech recognition and speaker recognition performances in a distant environment. We conducted the experiments using small vocabulary (100 words) distant isolated word recognition in both simulated and real environments. The results indicate that the proposed PDCMN is more effective for the speaker recognition method than the speech recognition method. We also investigate the effect of experimental environment, the length of utterance and the distance between the sound source and the microphone, etc. on speech/speaker recognition, and discuss the solutions for the degradation caused by various factors. The analysis allows us to decide which recognition method and processing could be effective and necessary for specific recognition task under a certain experimental setup.
Keywords :
cepstral analysis; distortion; parameter estimation; speaker recognition; CMN; channel distortion; compensation parameter estimation; interspeaker variation; intraspeaker variation; microphone; position-dependent cepstral mean normalization; sound source; speaker recognition; speech recognition; word recognition; Cepstral analysis; Cepstrum; Degradation; Microphones; Parameter estimation; Performance evaluation; Speaker recognition; Speech analysis; Speech recognition; Vocabulary;
Conference_Titel :
Signal Processing and Its Applications, 2007. ISSPA 2007. 9th International Symposium on
Conference_Location :
Sharjah
Print_ISBN :
978-1-4244-0778-1
Electronic_ISBN :
978-1-4244-1779-8
DOI :
10.1109/ISSPA.2007.4555505