• DocumentCode
    3422831
  • Title

    Discriminative learning for optimizing detection performance in spoken language recognition

  • Author

    Zhu, Donglai ; Li, Haizhou ; Ma, Bin ; Lee, Chin-Hui

  • Author_Institution
    Inst. for Infocomm Res., Singapore
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4161
  • Lastpage
    4164
  • Abstract
    We propose novel approaches for optimizing the detection performance in spoken language recognition. Two objective functions are designed to directly relate model parameters to two performance metrics of interest, the detection cost function and the area under the detection-error-tradeoff curve, respectively. Both metrics are approximated with differentiable functions of model parameters by using a smoothing function based on a class misclassification measure. The model parameters are optimized by using the generalized probabilistic descent algorithm. We conduct experiments on the NIST 2003 and 2005 Language Recognition Evaluation corpora. Results show that the proposed approaches effectively improve the performance over the maximum likelihood training approach.
  • Keywords
    Gaussian processes; learning (artificial intelligence); natural language processing; speech recognition; Gaussian mixture model; NIST Language Recognition Evaluation corpora; class misclassification measure; detection cost function; detection performance optimization; detection-error-tradeoff curve; differentiable functions; discriminative learning; generalized probabilistic descent algorithm; maximum likelihood training approach; model parameters; smoothing function; spoken language recognition; Cost function; Detectors; Error analysis; Maximum likelihood detection; NIST; Natural languages; Pattern recognition; Smoothing methods; Speech recognition; Support vector machines; Gaussian mixture model; detection cost function; detection error tradeoff; discriminative learning; spoken language recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518571
  • Filename
    4518571