DocumentCode :
3001741
Title :
Recognition of lexical stress in a continuous speech understanding system - A pattern recognition approach
Author :
Waibel, Alex
Author_Institution :
Carnegie Mellon University, Pittsburgh, PA
Volume :
11
fYear :
1986
fDate :
31503
Firstpage :
2287
Lastpage :
2290
Abstract :
Stress is one of the key components in human speech perception. Its uses extend from the phonetic level over the lexical to the syntactic and semantic level. Several methods have been developed in the past to detect stress automatically from the signal. This paper takes a pattern recognition approach to the the problem of stress detection. The algorithm presented has three key features: (1) optimal combination of the evidence obtained from the acoustic correlates of stress is achieved by means of a Bayesian classifier assuming multivariate Gaussian distributions; (2) the algorithm detects lexical stress in continuously spoken English utterances; (3) rather than making hard decisions, the algorithm returns probabilities for each syllable, i.e., a measure of stressedness. The algorithm was tested over 4 databases of differing continuous speech data. When a forced decision is imposed by setting a threshold at stress probability 0.5, error rates of 7.79% to 14.85% missed stresses were obtained. Unlike in other languages (such as Japanese), amplitude integrals are the strongest predictor of English stress. Performance results and an analysis of errors are presented.
Keywords :
Acoustic measurements; Acoustic signal detection; Acoustic testing; Bayesian methods; Error analysis; Gaussian distribution; Humans; Pattern recognition; Speech recognition; Stress measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
Type :
conf
DOI :
10.1109/ICASSP.1986.1168788
Filename :
1168788
Link To Document :
بازگشت