• DocumentCode
    3423363
  • Title

    An empirical study of automatic accent classification

  • Author

    Choueiter, Ghinwa ; Zweig, Geoffrey ; Nguyen, Patrick

  • Author_Institution
    Massachusetts Inst. of Technol., Cambridge, MA
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4265
  • Lastpage
    4268
  • Abstract
    This paper extends language identification (LID) techniques to a large scale accent classification task: 23-way classification of foreign-accented English. We find that a purely acoustic approach based on a combination of heteroscedastic linear discriminant analysis (HLDA) and maximum mutual information (MMI) training is very effective. In contrast to LID tasks, methods based on parallel language models prove much less effective. We focus on the Oregon Graduate Institute Foreign-Accented English dataset, and obtain a detection rate of 32%, which to our knowledge is the best reported result for 23-way accent classification.
  • Keywords
    acoustic signal processing; natural languages; signal classification; speech recognition; statistical analysis; acoustic approach; automatic accent classification; foreign-accented English; heteroscedastic linear discriminant analysis; language identification; maximum mutual information training; parallel language model; Acoustic signal detection; Advertising; Demography; Hidden Markov models; Large-scale systems; Linear discriminant analysis; Mutual information; Natural languages; Parallel languages; Statistics; Accent classifier; GMM; Gaussian tokenization; MMI; language identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518597
  • Filename
    4518597