• DocumentCode
    257795
  • Title

    Improving overlapping speaker detection using multiple speaker tracking information

  • Author

    Oualil, Youssef ; Toroghi, Rahil Mahdian ; Klakow, Dietrich

  • Author_Institution
    Spoken Language Syst., Saarland Univ., Saarbrucken, Germany
  • fYear
    2014
  • fDate
    3-5 Dec. 2014
  • Firstpage
    552
  • Lastpage
    556
  • Abstract
    Traditionally, multiple speaker tracking consists of two stages, namely, 1) detection of location measurements, followed by 2) a multiple object tracking approach. In general, these two steps are performed separately, and the tracking performance is highly dependent on the measurement detection rate. The performance of the widely used Steered Response Power (SRP)-based measurement detectors, however, drastically decreases in the overlapping speech scenario, where the dominant speaker frequently masks the low-energy speakers. To overcome this problem, we propose an approach that enhances the probabilistic SRP-based measurement detector, using the multiple speaker information obtained in the tracking step. In doing so, this approach tightly couples the two stages, and increases the detection rate of low-energy speakers during overlapping speech segments. Experiments conducted on the AV16.3 corpus showed a significant improvement of the detection and tracking performance, when the proposed approach is integrated into a Kalman-based multiple speaker tracking framework.
  • Keywords
    object tracking; speaker recognition; AV16.3 corpus; Kalman-based multiple speaker tracking framework; SRP-based measurement detector; location measurement detection; low-energy speakers; multiple object tracking approach; multiple speaker tracking information; overlapping speaker detection; overlapping speech segment; probabilistic SRP-based measurement detector; steered response power; Bayes methods; Detectors; Microphones; Noise; Speech; Speech processing; Target tracking; Kaiman filter; Speaker overlap; conversational speech; multiple speaker tracking; steered response power;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing (GlobalSIP), 2014 IEEE Global Conference on
  • Conference_Location
    Atlanta, GA
  • Type

    conf

  • DOI
    10.1109/GlobalSIP.2014.7032178
  • Filename
    7032178