• DocumentCode
    1858186
  • Title

    Statistical language identification based on untranscribed training

  • Author

    Lund, Michael A. ; Ma, Kristine ; Gish, Herbert

  • Author_Institution
    BBN Syst. & Technols., Cambridge, MA, USA
  • Volume
    2
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    793
  • Abstract
    BBN´s baseline language identification (LID) system tokenizes utterances based on an English hidden Markov model (HMM) phone recognizer and uses language-dependent phone-bigram models to discriminate between languages. This is clearly a suboptimal procedure, as English phone models may fail to provide a meaningful tokenization of non-English speech. We address this problem through the use of parametric acoustic segment models derived from untranscribed target-language training. The paper describes some promising exploratory experiments related to model-segment selection and LID based on nearest neighbor classification, as well as an LID system in which language-specific HMMs are trained from unsupervised clustering of parametric acoustic segments. In addition, it describes an experiment in which phone HMMs trained on CallHome Mandarin are added to the baseline system, resulting in an error reduction of 30% on pairwise language discrimination on the OGI-TS corpus
  • Keywords
    acoustic signal processing; hidden Markov models; natural languages; speech recognition; statistical analysis; CallHome Mandarin; English hidden Markov model; English phone models; HMM phone recognizer; OGI-TS corpus; baseline language identification; baseline system; error reduction; exploratory experiments; language discrimination; language-dependent phone-bigram models; model segment selection; nearest neighbor classification; nonEnglish speech; pairwise language discrimination; parametric acoustic segment models; parametric acoustic segments; statistical language identification; suboptimal procedure; unsupervised clustering; untranscribed target-language training; untranscribed training; Acoustic measurements; Clustering algorithms; Databases; Error analysis; Hidden Markov models; Natural languages; Nearest neighbor searches; Parametric statistics; Speech recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.543240
  • Filename
    543240