Title :
On the generation and use of a segment dictionary for speech coding, synthesis and recognition
Author :
Chollet, G. ; Galliano, J.F. ; Lefevre, J.P. ; Viara, E.
Author_Institution :
ENST, Paris, Cedex
Abstract :
A methodology is described to obtain a set of segments and rules that represents adequately the speech performance of a given speaker. This methodology proceeds from an initial set of diphones extracted from a neutral context and modify this set with larger and/or smaller segments depending on the match with natural utterances. Each segment is stored as a sequence of frames coded using LPC coefficients. An estimate of the likelihood of timescale distortion is associated with each frame. It represents knowledge on temporal variability that can be used by synthesis rules and/or pattern matching algorithms. It is then shown how such a segment data base can be used for 1) speech coding at very low bit rate ( ∼ 400 bit/sec), 2) synthesis from unrestricted text, 3) continuous speech recognition.
Keywords :
Acoustic distortion; Bit rate; Costs; Dictionaries; Linear predictive coding; Pattern matching; Speech coding; Speech processing; Speech recognition; Speech synthesis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '83.
DOI :
10.1109/ICASSP.1983.1172018