• DocumentCode
    3132597
  • Title

    Noisy channel adaptation in language identification

  • Author

    Ganapathy, Shrikanth ; Omar, Murad ; Pelecanos, Jason

  • Author_Institution
    IBM T.J Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2012
  • fDate
    2-5 Dec. 2012
  • Firstpage
    307
  • Lastpage
    312
  • Abstract
    Language identification (LID) of speech data recorded over noisy communication channels is a challenging problem especially when the LID system is tested on speech data from an unseen communication channel (not seen in training). In this paper, we consider the scenario in which a small amount of adaptation data is available from a new communication channel. Various approaches are investigated for efficient utilization of the adaptation data in a supervised as well as unsupervised setting. In a supervised adaptation framework, we show that support vector machines (SVMs) with higher order polynomial kernels (HO-SVM) trained using lower dimensional representations of the the Gaussian mixture model supervectors (GSVs) provide significant performance improvements over the baseline SVM-GSV system. In these LID experiments, we obtain 30% reduction in error-rate with 6 hours of adaptation data for a new channel. For unsupervised adaptation, we develop an iterative procedure for re-labeling the development data using a co-training framework. In these experiments, we obtain considerable improvements(relative improvements of 13 %) over a self-training framework with the HO-SVM models.
  • Keywords
    Gaussian processes; natural language processing; polynomials; speech processing; support vector machines; vectors; Gaussian mixture model supervector; HO-SVM; LID; SVM-GSV system; co-training framework; higher order polynomial kernels; iterative procedure; language identification; noisy channel adaptation; noisy communication channels; self-training framework; speech data; supervised adaptation framework; support vector machines; Adaptation models; Kernel; Noise measurement; Principal component analysis; Speech; Support vector machines; Training; Language Identification; Noisy Communication Channel; Supervised and Unsupervised Adaptation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2012 IEEE
  • Conference_Location
    Miami, FL
  • Print_ISBN
    978-1-4673-5125-6
  • Electronic_ISBN
    978-1-4673-5124-9
  • Type

    conf

  • DOI
    10.1109/SLT.2012.6424241
  • Filename
    6424241