• DocumentCode
    2875101
  • Title

    An integrated approach to the detection and classification of accents/dialects for a spoken document retrieval system

  • Author

    Gray, Sharmistha ; Hansen, John H L

  • Author_Institution
    Dept. of Speech, Language & Hearing Sci., Colorado Univ., Boulder, CO
  • fYear
    2005
  • fDate
    27-27 Nov. 2005
  • Firstpage
    35
  • Lastpage
    40
  • Abstract
    In this study, an integrated approach to accent/dialect detection and classification is proposed, which can be used for enhancing Rich indexing of historical spoken documents with accent/dialect information. A next generation spoken document retrieval (SDR) system would require a more diverse set of speech criteria including speaker, accent/dialect, language, stress/emotion and environment content. The proposed accent/dialect tagging system for SDR is based on several recent advances in a multi-dimensional space. Here, temporal and spectral based features including the stochastic trajectory model (STM), pitch structure, formant location and voice onset time (VOT) are considered. Mono-phone based STM (MP-STM) is shown to be the most successful for dialect classification with an average rate of 96.5% for read speech and 72.5% for spontaneous speech, while classifying four dialects. An example of next generation Rich transcript indexing for conversational speech to simulate SDR is also presented
  • Keywords
    classification; indexing; information retrieval; natural languages; speech processing; Rich transcript indexing; accent detection; accent-dialect tagging system; dialect classification; dialect detection; formant location; pitch structure; speech criteria; spoken document retrieval system; stochastic trajectory model; voice onset time; Auditory system; Automatic speech recognition; Content based retrieval; Indexing; Information retrieval; Natural languages; Robustness; Speech enhancement; Speech processing; Stress;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
  • Conference_Location
    San Juan
  • Print_ISBN
    0-7803-9478-X
  • Electronic_ISBN
    0-7803-9479-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2005.1566480
  • Filename
    1566480