• DocumentCode
    698003
  • Title

    Multimodal speaker localization from omnidirectional videos

  • Author

    Reuse, Pascal ; Gurban, Mihai ; Austvoll, Ivar ; Thiran, Jean-Philippe

  • Author_Institution
    Signal Process. Lab. 5, Ecole Polytech. Fed. de Lausanne, Lausanne, Switzerland
  • fYear
    2009
  • fDate
    24-28 Aug. 2009
  • Firstpage
    735
  • Lastpage
    739
  • Abstract
    The use of omnidirectional cameras for videoconferencing promises to simplify the hardware setup necessary for large groups of participants. We investigate the use of a multimodal speaker detection algorithm on audio-visual sequences captured with such a camera, in particular, an algorithm that uses the audio energy together with the optical flow. We analyze several types of optical flow methods to determine the one which is appropriate to the omnidirectional context.
  • Keywords
    audio-visual systems; cameras; image sequences; speaker recognition; teleconferencing; audio-visual sequence capture; multimodal speaker detection algorithm; multimodal speaker localization; omnidirectional camera; omnidirectional videoconferencing; optical flow method; Cameras; Computer vision; Image motion analysis; Integrated optics; Optical distortion; Optical filters; Optical imaging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2009 17th European
  • Conference_Location
    Glasgow
  • Print_ISBN
    978-161-7388-76-7
  • Type

    conf

  • Filename
    7077577