Title :
Noise adaptive stream fusion based on feature component rejection for robust multi-stream speech recognition
Author :
Jun Zhang ; Yizhi Feng ; Gengxin Ning ; Fei Ji
Author_Institution :
Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
Abstract :
Weighting the stream outputs according to their reliability levels is one of the most common stream fusion methods in the multi-stream automatic speech recognition (MS ASR). However, when a MS ASR system works in noisy environments, there are distortion level differences among not only the data streams, but also the feature components inside a stream. In this paper, we first propose a feature component rejection approach that can provide the similar function as the missing data techniques while is much easier to be applied to different features. Then a new stream fusion method that can make use of the reliability information of both inter- and intra-streams is developed by incorporating the proposed feature component rejection approach into the conventional MS HMM. The proposed stream fusion method shows good noise adaptive ability and achieves similar recognition accuracy as the missing data based stream fusion method for additive noises in the experiments of the Ti digits connected word recognition task.
Keywords :
hidden Markov models; sensor fusion; speech recognition; MS ASR system; MS HMM; Ti digit-connected word recognition task; additive noises; distortion level differences; feature component rejection approach; interstreams; intrastreams; missing data based stream fusion method; noise adaptive ability; noise adaptive stream fusion; noisy environments; recognition accuracy; reliability levels; robust multistream automatic speech recognition; similar function; stream output weighting; Accuracy; Hidden Markov models; Reliability;
Conference_Titel :
Advanced Computational Intelligence (ICACI), 2015 Seventh International Conference on
Conference_Location :
Wuyi
Print_ISBN :
978-1-4799-7257-9
DOI :
10.1109/ICACI.2015.7184714