Title :
Design of a speech recognition system based on acoustically derived segmental units
Author :
Bacchiani, M. ; Ostendorf, M. ; Sagisaka, Y. ; Paliwal, K.
Author_Institution :
ATR Interpreting Telecommun. Res. Labs., Kyoto, Japan
Abstract :
The design of a speech recognition system based on acoustically-derived, segmental units can be divided in three steps: unit design, lexicon building and pronunciation modeling. We formulate an iterative unit design procedure which consistently uses a maximum likelihood (ML) objective in successive application of resegmentation and model re-estimation. The lexicon building allows multi-word entries in the lexicon but restricts the number of these entries in order to avoid a too costly search. Selected multi-word lexical entries are those with high frequency (such as function words) and those which consistently exhibit cross-word phone assimilation. The stochastic pronunciation model represents the likelihood of a particular acoustic segment sequence given the phonetic baseform of a lexical item, where the sequence of baseform phones are treated as a Markov state sequence and each state can emit multiple segments
Keywords :
Markov processes; iterative methods; maximum likelihood estimation; sequences; speech recognition; Markov state sequence; acoustic segment sequence; acoustically derived segmental units; baseform phones; cross-word phone assimilation; function words; iterative unit design procedure; lexicon building; maximum likelihood; model re-estimation; multi-word entries; phonetic baseform; pronunciation modeling; resegmentation; speech recognition system; stochastic pronunciation model; Acoustical engineering; Buildings; Cepstral analysis; Degradation; Design engineering; Frequency; Maximum likelihood estimation; Polynomials; Speech recognition; Stochastic processes;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.541128