Title :
Switching linear dynamic transducer for stereo data based speech feature mapping
Author :
Han, Chang Woo ; Kang, Tae Gyoon ; Hong, Doo Hwa ; Kim, Nam Soo ; Eom, Kiwan ; Lee, Jaewon
Author_Institution :
Sch. of Electr. Eng., Seoul Nat. Univ., Seoul, South Korea
Abstract :
The performance of a speech recognition system may be degraded even without any background noise because of the linear or non-linear distortions incurred by recording devices or reverberations. One of the well-known approaches to reduce this channel distortion is feature mapping which maps the distorted speech feature to its clean counterpart. The feature mapping rule is usually trained based on a set of stereo data which consists of the simultaneous recordings obtained in both the reference and target conditions. In this paper, we propose a novel approach to speech feature sequence mapping based on the switching linear dynamic transducer (SLDT). The proposed algorithm enables us a sequence-to-sequence mapping in a systematic way, instead of the traditional vector-to-vector mapping. The proposed approach is applied to compensate channel distortion in speech recognition and shows improvement in recognition performance.
Keywords :
distortion; speech recognition; SLDT; channel distortion; distorted speech feature; linear distortions; nonlinear distortions; sequence-to-sequence mapping; speech feature sequence mapping; speech recognition; speech recognition system; stereo data based speech feature mapping; switching linear dynamic transducer; vector-to-vector mapping; Hidden Markov models; Performance evaluation; Speech; Speech recognition; Switches; Transducers; Vectors; Switching linear dynamic transducer; channel compensation; feature mapping; stereo data;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947423