• DocumentCode
    705393
  • Title

    Dialect identification: Impact of differences between read versus spontaneous speech

  • Author

    Gang Liu ; Yun Lei ; Hansen, John H. L.

  • Author_Institution
    CRSS: Center for Robust Speech Syst., Univ. of Texas at Dallas, Richardson, TX, USA
  • fYear
    2010
  • fDate
    23-27 Aug. 2010
  • Firstpage
    2003
  • Lastpage
    2006
  • Abstract
    Automatic Dialect Classification (ADC) has recently gained substantial interest in the field of speech processing. Dialects of a language normally are reflected in terms of their phoneme space, word pronunciation/selection, and prosodic traits. These traits are clearly visible in natural speaker-to-speaker spontaneous conversations. However, dialect cues in prompted/read speech are often neglected by the community. In this study, we consider a systematic assessment of the differences between the acoustic characteristics of spontaneous and read speech and their effects on dialect identification performance. By examining both the model space and phoneme space of read and spontaneous dialect speech, we observe that each spans different dialect spaces and with distinct characteristics that need to be addressed respectively. From this comparison, we find useful clues to design more efficient identification systems. Finally, we also propose a novel feature extraction technique, PMVDR-SDC, and obtain a +26.4% relative improvement in dialect recognition rate.
  • Keywords
    feature extraction; speaker recognition; speech processing; ADC; PMVDR-SDC; acoustic characteristics; automatic dialect classification; dialect identification performance; dialect recognition rate; feature extraction technique; model space; natural speaker-to-speaker spontaneous conversations; phoneme space; prompted-read speech; prosodic traits; spontaneous dialect speech processing; systematic assessment; word pronunciation-selection; Acoustics; Data models; Feature extraction; Hidden Markov models; Speech; Speech recognition; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2010 18th European
  • Conference_Location
    Aalborg
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7096666