مرکز منطقه ای اطلاع رساني علوم و فناوري - Non-steady state speech analysis method with dynamic feature enhancing effect

DocumentCode :

3054538

Title :

Non-steady state speech analysis method with dynamic feature enhancing effect

Author :

Nakajima, Takayuki ; Suzuki, Torazo ; Ohmura, Hiroshi

Author_Institution :

Electrotechnical Laboratory, Ibaraki-Ken, Japan

Volume :

fYear :

1982

fDate :

30072

Firstpage :

1299

Lastpage :

1302

Abstract :

Extraction of speaker independent features, which make the separation of /b/ /d/ /g/ or /p/ /t/ /k/ possible, is one of the most difficult problem in automatic speech recognition. The authors propose a new speech analysis method to handle these sounds characterized by high speed articulations. The method is based on an autoregressive model with linearly time variant parameters in the analysis window. Recursive method, which is achieved by solving simultaneous linear equations with same number of parameters as in LPC, is proposed assuming the framewise continuation of each parameter. An articulatory dynamic feature enhancing effect is created by the introduction of vocal tract reflection coefficients and the enhancement of the vocal tract (acoustic tube) shape change between adjacent frames. In experiments on Japanese CV-syllables, where C expresses stops, comparisons have been made between the proposed method and LPC-based method, and widely different results were obtained for the transient parts, especially in labio-dental sounds such as /d/.

Keywords :

Acoustic reflection; Automatic speech recognition; Cepstral analysis; Equations; Feature extraction; Linear predictive coding; Loudspeakers; Shape; Speech analysis; Speech processing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.

Type :

conf

DOI :

10.1109/ICASSP.1982.1171639

Filename :

1171639

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3054538