• DocumentCode
    337486
  • Title

    Detection of target speakers in audio databases

  • Author

    Magrin-Chagnolleau, Ivan ; Rosenberg, Aaron E. ; Parthasarathy, S.

  • Author_Institution
    AT&T Bell Labs., Forham Park, NJ, USA
  • Volume
    2
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    821
  • Abstract
    The problem of speaker detection in audio databases is addressed in this paper. Gaussian mixture modeling is used to build target speaker and background models. A detection algorithm based on a likelihood ratio calculation is applied to estimate target speaker segments. Evaluation procedures are defined in detail for this task. Results are given for different subsets of the HUB4 broadcast news database. For one target speaker, with the data restricted to high quality speech segments, the segment miss rate is approximately 7%. For unrestricted data, the segment miss rate is approximately 27%. In both cases the segment false alarm rate is 4 or 5 per hour. For two target speakers with unrestricted data, the segment miss rate is approximately 63% with about 27 segment false alarms per hour. The decrease in performance for two target speakers is largely associated with short speech segments in the two target speaker test data which are undetectable in the current configuration of the detection algorithm
  • Keywords
    Gaussian processes; maximum likelihood estimation; speaker recognition; Gaussian mixture modeling; HUB4 broadcast news database; audio databases; high quality speech segment; likelihood ratio calculation; performance; segment false alarm rate; segment miss rate; speaker detection; target speakers; unrestricted data; Audio databases; Broadcasting; Concatenated codes; Detection algorithms; Speech; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.759797
  • Filename
    759797