مرکز منطقه ای اطلاع رساني علوم و فناوري - Switching linear dynamic transducer for stereo data based speech feature mapping

DocumentCode :

2175766

Title :

Switching linear dynamic transducer for stereo data based speech feature mapping

Author :

Han, Chang Woo ; Kang, Tae Gyoon ; Hong, Doo Hwa ; Kim, Nam Soo ; Eom, Kiwan ; Lee, Jaewon

Author_Institution :

Sch. of Electr. Eng., Seoul Nat. Univ., Seoul, South Korea

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

4776

Lastpage :

4779

Abstract :

The performance of a speech recognition system may be degraded even without any background noise because of the linear or non-linear distortions incurred by recording devices or reverberations. One of the well-known approaches to reduce this channel distortion is feature mapping which maps the distorted speech feature to its clean counterpart. The feature mapping rule is usually trained based on a set of stereo data which consists of the simultaneous recordings obtained in both the reference and target conditions. In this paper, we propose a novel approach to speech feature sequence mapping based on the switching linear dynamic transducer (SLDT). The proposed algorithm enables us a sequence-to-sequence mapping in a systematic way, instead of the traditional vector-to-vector mapping. The proposed approach is applied to compensate channel distortion in speech recognition and shows improvement in recognition performance.

Keywords :

distortion; speech recognition; SLDT; channel distortion; distorted speech feature; linear distortions; nonlinear distortions; sequence-to-sequence mapping; speech feature sequence mapping; speech recognition; speech recognition system; stereo data based speech feature mapping; switching linear dynamic transducer; vector-to-vector mapping; Hidden Markov models; Performance evaluation; Speech; Speech recognition; Switches; Transducers; Vectors; Switching linear dynamic transducer; channel compensation; feature mapping; stereo data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5947423

Filename :

5947423

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2175766