• DocumentCode
    294600
  • Title

    Auditory scene analysis and hidden Markov model recognition of speech in noise

  • Author

    Green, P.D. ; Cooke, M.P. ; Crawford, M.D.

  • Author_Institution
    Dept. of Comput. Sci., Sheffield Univ., UK
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    401
  • Abstract
    We describe a novel paradigm for automatic speech recognition in noisy environments in which an initial stage of auditory scene analysis separates out the evidence for the speech to be recognised from the evidence for other sounds. In general, this evidence will be incomplete, since intruding sound sources will dominate some spectro-temporal regions. We generalise continuous-density hidden Markov model recognition to this `occluded speech´ case. The technique is based on estimating the probability that a Gaussian mixture density distribution for an auditory firing rate map will generate an observation such that the separated components are at their observed values and the remaining components are not greater than their values in the acoustic mixture. Experiments on isolated digit recognition in noise demonstrate the potential of the new approach to yield performance comparable to that of listeners
  • Keywords
    Gaussian distribution; Gaussian processes; acoustic noise; acoustic signal processing; hearing; hidden Markov models; speech processing; speech recognition; Gaussian mixture density distribution; acoustic mixture; auditory firing rate map; auditory scene analysis; automatic speech recognition; continuous-density hidden Markov model; experiments; hidden Markov model recognition; isolated digit recognition; noise; noisy environments; occluded speech; performance; probability; sound sources; spectro-temporal regions; Acoustic noise; Automatic speech recognition; Computational modeling; Hidden Markov models; Image analysis; Noise robustness; Signal to noise ratio; Speech analysis; Speech enhancement; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479606
  • Filename
    479606