• DocumentCode
    3443377
  • Title

    A novel stream-weight method for the multi-stream speech recognition system

  • Author

    Guo, Hongyu ; Zhao, Xiaoqun ; Guo, Hongmiao

  • Author_Institution
    Sch. of Electron. & Inf. Eng., Tongji Univ., Shanghai, China
  • Volume
    3
  • fYear
    2010
  • fDate
    29-31 Oct. 2010
  • Firstpage
    179
  • Lastpage
    182
  • Abstract
    A multi-stream speech recognition system is based on the combination of multiple complementary feature streams. Utilizing the fusion scheme of multi-stream, better performance was achieved in speech recognition system. The stream-weight method plays a very important role in the fusion collaborative scheme. The stream weights should be selected to be proportional to the feature stream reliability and informativeness. The posterior probability estimate is a measure of reliability, and the classification error is a measure of informativeness. The larger separation between class distributions in a given stream implies better discriminative power. The intra-class distances are an estimate of the class variance. The inter- and intra-class distances are combined to yield and estimate of the misclassification error for each stream. An unsupervised stream weight estimation method for multi-stream speech recognition system based on the computation of intra-and inter-class distances in each stream is proposed here. Experiments are conducted using Chinese Academy of Science speech database. Applying the new stream-weigh algorithm, we achieve better fusion performance compared with some traditional fusion methods, and the word error rate was decreased by 6%.
  • Keywords
    estimation theory; pattern classification; probability; sensor fusion; speech recognition; classification error; feature stream reliability; fusion collaborative scheme; intra-class distances; multiple complementary feature streams; multistream speech recognition system; posterior probability estimate; unsupervised stream weight estimation method; word error rate; Approximation methods; Artificial intelligence; Hidden Markov models; Measurement uncertainty; Mel frequency cepstral coefficient; Signal to noise ratio; Training; hidden markov models; inter-class distance; intra-class distance; linear predictive cepstral coefficient; mel-frequency cepstral coefficient; multistream framework; stream weighst;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Computing and Intelligent Systems (ICIS), 2010 IEEE International Conference on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-1-4244-6582-8
  • Type

    conf

  • DOI
    10.1109/ICICISYS.2010.5658488
  • Filename
    5658488