• DocumentCode
    417271
  • Title

    A factorial HMM approach to simultaneous recognition of isolated digits spoken by multiple talkers on one audio channel

  • Author

    Deoras, Ameya Nitin ; Hasegawa-Johnson, Mark

  • Author_Institution
    Illinois Univ., Urbana, IL, USA
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    This paper addresses the novel problem of recognizing digits spoken simultaneously by two different talkers. A factorial hidden Markov model architecture is proposed to accurately model the simultaneous utterance of two digits. Nadas´ (1999) MIXMAX approximation is extended to a mixture of Gaussians observation PDF which enables the implementation of the proposed system. The multiple digit recognizer is found to successfully recognize pairs of simultaneous utterances of digits at 0db SNR with up to 89% accuracy.
  • Keywords
    Gaussian distribution; hidden Markov models; speech recognition; MIXMAX approximation; audio channel; factorial HMM; hidden Markov model architecture; isolated digits; mixture of Gaussians; multiple digit recognizer; multiple talkers; observation PDF; simultaneous recognition; simultaneous utterance; Acoustic noise; Additive noise; Frequency; Gaussian approximation; Hidden Markov models; Image analysis; Loudspeakers; Noise figure; Parameter estimation; Random processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326122
  • Filename
    1326122