Title :
A novel stream-weight method for the multi-stream speech recognition system
Author :
Guo, Hongyu ; Zhao, Xiaoqun ; Guo, Hongmiao
Author_Institution :
Sch. of Electron. & Inf. Eng., Tongji Univ., Shanghai, China
Abstract :
A multi-stream speech recognition system is based on the combination of multiple complementary feature streams. Utilizing the fusion scheme of multi-stream, better performance was achieved in speech recognition system. The stream-weight method plays a very important role in the fusion collaborative scheme. The stream weights should be selected to be proportional to the feature stream reliability and informativeness. The posterior probability estimate is a measure of reliability, and the classification error is a measure of informativeness. The larger separation between class distributions in a given stream implies better discriminative power. The intra-class distances are an estimate of the class variance. The inter- and intra-class distances are combined to yield and estimate of the misclassification error for each stream. An unsupervised stream weight estimation method for multi-stream speech recognition system based on the computation of intra-and inter-class distances in each stream is proposed here. Experiments are conducted using Chinese Academy of Science speech database. Applying the new stream-weigh algorithm, we achieve better fusion performance compared with some traditional fusion methods, and the word error rate was decreased by 6%.
Keywords :
estimation theory; pattern classification; probability; sensor fusion; speech recognition; classification error; feature stream reliability; fusion collaborative scheme; intra-class distances; multiple complementary feature streams; multistream speech recognition system; posterior probability estimate; unsupervised stream weight estimation method; word error rate; Approximation methods; Artificial intelligence; Hidden Markov models; Measurement uncertainty; Mel frequency cepstral coefficient; Signal to noise ratio; Training; hidden markov models; inter-class distance; intra-class distance; linear predictive cepstral coefficient; mel-frequency cepstral coefficient; multistream framework; stream weighst;
Conference_Titel :
Intelligent Computing and Intelligent Systems (ICIS), 2010 IEEE International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4244-6582-8
DOI :
10.1109/ICICISYS.2010.5658488