• DocumentCode
    730112
  • Title

    A joint audio-visual approach to audio localization

  • Author

    Jensen, Jesper Rindom ; Christensen, Mads Graesboll

  • Author_Institution
    Audio Anal. Lab., Aalborg Univ., Aalborg, Denmark
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    454
  • Lastpage
    458
  • Abstract
    Localization of audio sources is an important research problem, e.g., to facilitate noise reduction. In the recent years, the problem has been tackled using distributed microphone arrays (DMA). A common approach is to apply direction-of-arrival (DOA) estimation on each array (denoted as nodes), and then map the DOA estimates to a location. In practice, however, the individual nodes contain few microphones, limiting the DOA estimation accuracy and, thereby, also the localization performance. We investigate a new approach, where range estimates are also obtained and utilized from each node, e.g., using time-of-flight cameras. Moreover, we propose an optimal method for weighting such DOA and range information for audio localization. Our experiments on both synthetic and real data show that there is a clear, potential advantage of using the joint audio-visual localization framework.
  • Keywords
    audio signal processing; audio-visual systems; cameras; direction-of-arrival estimation; interference suppression; microphone arrays; DMA; DOA estimation; audio sources; direction-of-arrival estimation; distributed microphone arrays; joint audio-visual localization; noise reduction; time-of-flight cameras; Accuracy; Cameras; Direction-of-arrival estimation; Microphone arrays; Noise; Sensors; DOA; Localization; distributed microphone arrays; optimal weighting; range; time-of-flight camera;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178010
  • Filename
    7178010