• DocumentCode
    2279282
  • Title

    Improved pronunciation modelling by inverse word frequency and pronunciation entropy

  • Author

    Tsai, Ming-Yi ; Chou, Fu-Chiang ; Lee, Lin-shan

  • Author_Institution
    Graduate Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    53
  • Lastpage
    56
  • Abstract
    We propose a new approach to rank the potential pronunciations for each word by their pronunciation frequency and inverse word frequency (pf-iwf) weights. The pronunciation set obtained in this way can then be pruned with different criteria. This approach not only considers the frequencies of occurrence of the pronunciations, but tries to minimize the extra confusion which may be introduced by pronunciation variations, such that the best overall performance can be achieved. A new entropy-based approach for pruning the pronunciation variations is also proposed. Experimental results showed that the proposed approach can not only improve the recognition performance, but make the performance more stable and less sensitive to various parameters, factors and options including the different pruning criteria. All the experiments were performed with the LDC Mandarin Call Home corpus, although the approaches and principles are definitely not limited to Mandarin Chinese.
  • Keywords
    entropy; speech recognition; statistical analysis; ASR; LDC Mandarin Call Home corpus; Mandarin Chinese; inverse word frequency; pronunciation entropy; pronunciation frequency; pruning criteria; speech recognition performance; Automatic speech recognition; Costs; Dynamic programming; Entropy; Frequency; Heuristic algorithms; Inverse problems; Natural languages; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
  • Print_ISBN
    0-7803-7343-X
  • Type

    conf

  • DOI
    10.1109/ASRU.2001.1034587
  • Filename
    1034587