• DocumentCode
    3346947
  • Title

    Automatic and language independent triphone training using phonetic tables [speech recognition]

  • Author

    Netsch, Lorin ; Bernard, Alexis

  • Author_Institution
    DSP Solutions R&D Center, Texas Instrum. Inc., Dallas, TX, USA
  • Volume
    5
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Training triphone acoustic models for speech recognition is time-consuming and requires important manual intervention. We present an alternative solution, performing automatic training by use of a pronunciation phonetic table which summarizes the articulatory characteristics of the target language. The method is able to train triphones for any language, given an existing set of reference monophones in one or more languages, by automatically performing the tasks of monophone seeding, triphone clustering and other training steps. The automatic nature of the training algorithm lends itself to parameter optimization, which can further improve recognition accuracy with respect to manually trained models. In a continuous digit recognition experiment, it is shown that automatically generated triphone models gave a 1.26% error rate, compared to a 2.30% error rate for its manual counterpart.
  • Keywords
    speech processing; speech recognition; ASR; automatic speech recognition; automatic triphone training; language independent triphone training; monophone seeding; parameter optimization; pronunciation phonetic tables; recognition accuracy; reference monophones; speech recognition; target language articulatory characteristics; trained acoustic model parameters; triphone clustering; Automatic speech recognition; Clustering algorithms; Costs; Databases; Digital signal processing; Error analysis; Humans; Instruments; Manuals; Research and development;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1327106
  • Filename
    1327106