• DocumentCode
    3166127
  • Title

    A two-microphone based voice activity detection for distant-talking speech in wide range of direction of arrival

  • Author

    Guo, Yanmeng ; Li, Kai ; Fu, Qiang ; Yan, Yonghong

  • Author_Institution
    Key Lab. of Speech Acoust. & Content Understanding, Inst. of Acoust., Beijing, China
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4901
  • Lastpage
    4904
  • Abstract
    In this paper, a two-microphone based voice activity detection (VAD) algorithm is proposed to detect the distant-talking speech coming randomly from a wide range of direction of arrival (DOA). The long-term information of inter-channel phase difference (LTIPD) is introduced as a target speech existence measure, which describes the concentration degree of DOA estimations on a sound source with harmonic structure. The proposed algorithm performs robustly on distant-talking speech recorded in several real environments.
  • Keywords
    direction-of-arrival estimation; microphones; speech processing; DOA estimations; LTIPD; direction of arrival estimation; distant-talking speech; harmonic structure; long-term information of interchannel phase difference; sound source; target speech existence measure; two-microphone based VAD algorithm; two-microphone based voice activity detection; Databases; Direction of arrival estimation; Estimation; Harmonic analysis; Robustness; Speech; Time frequency analysis; Voice activity detection; direction of arrival; harmonic structure; inter-channel phase difference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6289018
  • Filename
    6289018