• DocumentCode
    394250
  • Title

    Symbolic speaker adaptation with phone inventory expansion

  • Author

    Lee, Kyung-Tuk ; Melnar, L. ; Talley, Jim ; Wellekens, Christian J.

  • Author_Institution
    Inst. Eurecom, Sophia Antipolis, France
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    This paper further develops a previously proposed adaptation method for speech recognition called symbolic speaker adaptation (SSA). The basic idea of SSA is to model a speaker\´s pronunciation as a blend of speech varieties (SVs) - regional dialects and foreign accents - for which the system has existing pronunciation models. The system determines during an adaptation process the relative applicability of those models, yielding a speech variety profile (SVP) for each speaker. Speaker-dependent lexica for recognition are determined from a speaker\´s SVP. In this paper, we discuss a series of experiments designed to analyze how the SSA method is affected by SV-balanced training, expanded phone inventories, reduced amounts of adaptation data, and speech from SVs not modeled by the system. The most dramatic improvements were obtained by using expanded ("SV-inclusive") phone inventories. SSA was also shown to be effective with a very small number of adaptation sentences. And, SSA\´s SV blending scheme yields higher accuracy than using a SV classification scheme for speakers of novel (unseen) SVs.
  • Keywords
    speech recognition; SV-balanced training; adaptation method; blending scheme; expanded phone inventories; foreign accents; pronunciation; regional dialects; speaker-dependent lexica; speech recognition; speech varieties; speech variety profile; symbolic speaker adaptation; Automatic speech recognition; Databases; Degradation; Design methodology; Error analysis; Natural languages; Speech analysis; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198776
  • Filename
    1198776