• DocumentCode
    730435
  • Title

    A learning-based approach to direction of arrival estimation in noisy and reverberant environments

  • Author

    Xiong Xiao ; Shengkui Zhao ; Xionghu Zhong ; Jones, Douglas L. ; Eng Siong Chng ; Haizhou Li

  • Author_Institution
    Temasek Lab., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    2814
  • Lastpage
    2818
  • Abstract
    This paper presents a learning-based approach to the task of direction of arrival estimation (DOA) from microphone array input. Traditional signal processing methods such as the classic least square (LS) method rely on strong assumptions on signal models and accurate estimations of time delay of arrival (TDOA) . They only work well in relatively clean conditions, but suffer from noise and reverberation distortions. In this paper, we propose a learning-based approach that can learn from a large amount of simulated noisy and reverberant microphone array inputs for robust DOA estimation. Specifically, we extract features from the generalised cross correlation (GCC) vectors and use a multilayer perceptron neural network to learn the nonlinear mapping from such features to the DOA. One advantage of the learning based method is that as more and more training data becomes available, the DOA estimation will become more and more accurate. Experimental results on simulated data show that the proposed learning based method produces much better results than the state-of-the-art LS method. The testing results on real data recorded in meeting rooms show improved root-mean-square error (RMSE) compared to the LS method.
  • Keywords
    acoustic signal processing; direction-of-arrival estimation; feature extraction; learning (artificial intelligence); microphone arrays; multilayer perceptrons; reverberation; vectors; GCC; direction of arrival estimation; feature extraction; generalised cross correlation vectors; learning-based approach; multilayer perceptron neural network; noisy environment; nonlinear mapping; reverberant environment; reverberant microphone array; robust DOA estimation; Arrays; Direction-of-arrival estimation; Estimation; Robustness; Speech; Training; Training data; direction of arrival; least squares; machine learning; microphone arrays; neural networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178484
  • Filename
    7178484