• DocumentCode
    178305
  • Title

    Robust DOA estimation of multiple speech sources

  • Author

    Nguyen Thi Ngoc Tho ; Shengkui Zhao ; Jones, Douglas L.

  • Author_Institution
    Adv. Digital Sci. Center (ADSC), Singapore, Singapore
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    2287
  • Lastpage
    2291
  • Abstract
    It is challenging to determine the directions of arrival of speech signals when there are fewer sensors than sources, particularly in noisy and reverberant environments. The coherence test by Mohan et al. exploits the time-frequency sparseness of non-stationary speech signals to select more relevant time-frequency bins to estimate directions of arrival. With no prior knowledge about the incoming sources, this work proposes a combination of noise-floor tracking, onset detection and a coherence test to robustly identify time-frequency bins where only one source is dominant. After that, the largest eigenvectors of covariance matrices corresponding to these bins are clustered and the directions of arrival of the sources are estimated based on the cluster centroids. Simulation and experimental results show that this method is able to localize 8 sources with small errors using only 3 omnidirectional microphones. The proposed method is robust to background noise and reverberation.
  • Keywords
    covariance matrices; direction-of-arrival estimation; eigenvalues and eigenfunctions; microphone arrays; reverberation; time-frequency analysis; background noise; cluster centroids; coherence test; covariance matrices; directions of arrival estimation; eigenvectors; multiple speech sources; noise-floor tracking; noisy environment; nonstationary speech signals; omnidirectional microphones; onset detection; reverberant environment; reverberation; robust DOA estimation; time-frequency bins; time-frequency sparseness; Arrays; Coherence; Covariance matrices; Direction-of-arrival estimation; Estimation; Microphones; Speech; coherence test; direction of arrival estimation; eigenvector; microphone array; time-frequency;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854007
  • Filename
    6854007