• DocumentCode
    835932
  • Title

    Advances in phone-based modeling for automatic accent classification

  • Author

    Angkititrakul, Pongtep ; Hansen, John H L

  • Author_Institution
    Center for Robust Speech Syst., Univ. of Texas, Richardson, TX, USA
  • Volume
    14
  • Issue
    2
  • fYear
    2006
  • fDate
    3/1/2006 12:00:00 AM
  • Firstpage
    634
  • Lastpage
    646
  • Abstract
    It is suggested that algorithms capable of estimating and characterizing accent knowledge would provide valuable information in the development of more effective speech systems such as speech recognition, speaker identification, audio stream tagging in spoken document retrieval, channel monitoring, or voice conversion. Accent knowledge could be used for selection of alternative pronunciations in a lexicon, engage adaptation for acoustic modeling, or provide information for biasing a language model in large vocabulary speech recognition. In this paper, we propose a text-independent automatic accent classification system using phone-based models. Algorithm formulation begins with a series of experiments focused on capturing the spectral evolution information as potential accent sensitive cues. Alternative subspace representations using principal component analysis and linear discriminant analysis with projected trajectories are considered. Finally, an experimental study is performed to compare the spectral trajectory model framework to a traditional hidden Markov model recognition framework using an accent sensitive word corpus. System evaluation is performed using a corpus representing five English speaker groups with native American English, and English spoken with Mandarin Chinese, French, Thai, and Turkish accents for both male and female speakers.
  • Keywords
    natural languages; principal component analysis; speech processing; speech recognition; automatic accent classification; linear discriminant analysis; phone-based modeling; principal component analysis; spectral evolution information; text-independent accent classification; Adaptation model; Hidden Markov models; Information retrieval; Loudspeakers; Monitoring; Natural languages; Speech recognition; Streaming media; Tagging; Vocabulary; Automatic accent classification; dialect modeling; open accent classification; phoneme recognition; spectral trajectory modeling; speech recognition;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TSA.2005.851980
  • Filename
    1597266