• DocumentCode
    1749634
  • Title

    A microphone array-based 3-D N-best search algorithm for the simultaneous recognition of multiple sound sources in real environments

  • Author

    Heracleous, Panikos ; Nakamura, Satoshi ; Hikano, K.

  • Volume
    1
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    193
  • Abstract
    Deals with the recognition of distant talking speech and, particularly, with the simultaneous recognition of multiple sound sources. A problem that must be solved in the recognition of distant talking speech is talker localization. In some approaches, the talker is localized by using short- and long-term power. The 3-D Viterbi search based method proposed by Yamada et al.(1998), integrates talker localization and speech recognition. This method provides high recognition rates but its application is restricted to the presence of one talker. In order to deal with multiple talkers, we extended the 3-D Viterbi search method to a 3-D N-best search method enabling the recognition of multiple sound sources. The paper describes our baseline 3-D N-best search-based system and two additional techniques, namely, a likelihood normalization technique and a path distance-based clustering technique. The paper also describes experiments carried out in order to evaluate the performance of the system
  • Keywords
    acoustic generators; array signal processing; hidden Markov models; microphones; search problems; speech recognition; statistical analysis; 3D Viterbi search based method; distant talking speech; likelihood normalization technique; long-term power; microphone array-based 3D N-best search algorithm; multiple sound sources; short-term power; simultaneous recognition; talker localization; Acoustic beams; Acoustic noise; Feature extraction; Information science; Microphone arrays; Natural languages; Search methods; Speech recognition; Viterbi algorithm; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940800
  • Filename
    940800