DocumentCode :
312167
Title :
Speech recognition based on acoustically derived segment units
Author :
Fukada, Toshiaki ; Bacchiani, Michiel ; Paliwal, Kuldip K. ; Sagisaka, Yoshinori
Author_Institution :
ATR Interpreting Telecommun. Res. Labs., Japan
Volume :
2
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
1077
Abstract :
The paper describes a new method of word model generation based on acoustically derived segment units (henceforth ASUs). An ASU-based approach has the advantages of growing out of human pre-determined phonemes and of consistently generating acoustic units by using the maximum likelihood (ML) criterion. The former advantage is effective when it is difficult to map acoustics to a phone such as with highly co-articulated spontaneous speech. In order to implement an ASU-based modeling approach in a speech recognition system, one must first solve two points: (1) how does one design an inventory of acoustically-derived segmental units and (2) how does one model the pronunciations of lexical entries in terms of the ASUs. As for the second question, the authors propose an ASU-based word model generation method by composing the ASU statistics, that is, their means, variances and durations. The effectiveness of the proposed method is shown through spontaneous word recognition experiments
Keywords :
maximum likelihood estimation; speech recognition; statistics; acoustically derived segment units; durations; highly co-articulated spontaneous speech; human pre-determined phonemes; lexical entry pronunciation; maximum likelihood criterion; speech recognition; spontaneous word recognition experiments; statistics; variances; word model generation; Acoustical engineering; Acoustics; Character recognition; Context modeling; Databases; Humans; Speech recognition; Statistical distributions; Stochastic processes; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607792
Filename :
607792
Link To Document :
بازگشت