DocumentCode
2279282
Title
Improved pronunciation modelling by inverse word frequency and pronunciation entropy
Author
Tsai, Ming-Yi ; Chou, Fu-Chiang ; Lee, Lin-shan
Author_Institution
Graduate Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear
2001
fDate
2001
Firstpage
53
Lastpage
56
Abstract
We propose a new approach to rank the potential pronunciations for each word by their pronunciation frequency and inverse word frequency (pf-iwf) weights. The pronunciation set obtained in this way can then be pruned with different criteria. This approach not only considers the frequencies of occurrence of the pronunciations, but tries to minimize the extra confusion which may be introduced by pronunciation variations, such that the best overall performance can be achieved. A new entropy-based approach for pruning the pronunciation variations is also proposed. Experimental results showed that the proposed approach can not only improve the recognition performance, but make the performance more stable and less sensitive to various parameters, factors and options including the different pruning criteria. All the experiments were performed with the LDC Mandarin Call Home corpus, although the approaches and principles are definitely not limited to Mandarin Chinese.
Keywords
entropy; speech recognition; statistical analysis; ASR; LDC Mandarin Call Home corpus; Mandarin Chinese; inverse word frequency; pronunciation entropy; pronunciation frequency; pruning criteria; speech recognition performance; Automatic speech recognition; Costs; Dynamic programming; Entropy; Frequency; Heuristic algorithms; Inverse problems; Natural languages; Training data; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN
0-7803-7343-X
Type
conf
DOI
10.1109/ASRU.2001.1034587
Filename
1034587
Link To Document