• DocumentCode
    1857194
  • Title

    Automatic dialect identification of extemporaneous conversational, Latin American Spanish speech

  • Author

    Zissman, Marc A. ; Gleason, Terry P. ; Rekart, Deborah M. ; Losiewicz, Beth L.

  • Author_Institution
    Lincoln Lab., MIT, Lexington, MA, USA
  • Volume
    2
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    777
  • Abstract
    A dialect identification technique is described that takes as input extemporaneous, conversational speech spoken in Latin American Spanish and produces as output a hypothesis of the dialect. The system has been trained to recognize Cuban and Peruvian dialects of Spanish, but could be extended easily to other dialects (and languages) as well. Building on our experience in automatic language identification, the dialect-ID system uses an English phone recognizer trained on the TIMIT corpus to tokenize training speech spoken in each Spanish dialect. Phonotactic language models generated from this tokenized training speech are used during testing to compute dialect likelihoods for each unknown message. This system has an error rate of 16% on the Cuban/Peruvian two-alternative forced-choice test. We introduce the new “Miami” Latin American Spanish speech corpus that is capable of supporting our research efforts into the future
  • Keywords
    natural languages; speech recognition; Cuban; English phone recognizer; Miami Latin American Spanish speech corpus; Peruvian; automatic dialect identification; dialect-ID system; error rate; extemporaneous conversational Latin American Spanish speech; phonotactic language models; tokenized training speech; training speech; Automatic speech recognition; Electronic mail; Error analysis; Humans; Laboratories; Natural languages; Routing; Speech recognition; Springs; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.543236
  • Filename
    543236