DocumentCode
1277402
Title
A time-varying analysis method for rapid transitions in speech
Author
Nathan, Krishna S. ; Lee, Yi-Teh ; Silverman, Harvey F.
Author_Institution
Div. of Eng., Brown Univ., Providence, RI, USA
Volume
39
Issue
4
fYear
1991
fDate
4/1/1991 12:00:00 AM
Firstpage
815
Lastpage
824
Abstract
A linear predictive coding (LPC) model based on time-dependent poles which has yielded promising results when applied to synthetic data is applied to real speech data. The data are processed pitch-synchronously using a simple procedure to identify regions of the data that best fit the model. The maximum-likelihood technique, which has been found to be robust in the presence of noise, is used to estimate the parameters. Resulting formant estimates for several diphthongs are presented. The algorithm tracks the formants well, both in stable regions and in regions of transition. This ability to track formant variation within analysis intervals is a definite advantage over traditional LPC. Results from speech data involving final stop consonants are presented. Rapid changes, particularly in the first and second formants, in the region immediately prior to the stop are detected. Such abrupt transitions are often not detected by traditional time-invariant methods
Keywords
parameter estimation; speech analysis and processing; LPC model; diphthongs; final stop consonants; formant estimates; linear predictive coding; maximum-likelihood technique; parameter estimation; pitch-synchronous processing; rapid transitions in speech; time-dependent poles; time-varying analysis method; Data mining; Difference equations; Linear predictive coding; Maximum likelihood detection; Maximum likelihood estimation; Noise robustness; Parameter estimation; Parameter extraction; Parametric statistics; Speech analysis;
fLanguage
English
Journal_Title
Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
1053-587X
Type
jour
DOI
10.1109/78.80903
Filename
80903
Link To Document