Title :
Temporal decomposition and acoustic-phonetic decoding of speech
Author :
Bimbot, Frederic ; Chollet, Gerard ; Deleglise, Paul ; Montacie, Claude
Author_Institution :
CNRS, Paris, France
Abstract :
The automatic recognition of continuous speech may use a symbolic representation of the acoustic signal in order to facilitate lexical access. The allophones of the language form a practical set of symbols. A major issue is a reliable localisation of these units in the speech stream and their identification. Localisation is obtained using a robust implementation of temporal decomposition, a technique originally proposed by Atal (1983), for speech coding. Speech is decomposed in terms of overlapping events characterized by both a spectral target and a time-limited interpolation function. An undershot target may be reestimated using neighbours and the associated functions. The possibility of undoing the effects of coarticulation is the major contribution of this work. The identification of these corrected targets is therefore possible with no further contextual rules. The recognition of spelled surnames (letters of the alphabet) is used for evaluation. 76% of correct phones allow 70% of correct letters
Keywords :
acoustic signal processing; decoding; encoding; speech analysis and processing; speech recognition; acoustic signal; acoustic-phonetic decoding; allophones; automatic continuous speech recognition; coarticulation; identification; language; lexical access; localisation; spectral target; speech coding; speech decoding; symbolic representation; temporal decomposition; time-limited interpolation function; undershot target; Acoustical engineering; Decoding; Natural languages; Robustness; Speech coding; Speech processing; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on
Conference_Location :
New York, NY
DOI :
10.1109/ICASSP.1988.196614