DocumentCode
1749705
Title
Nonlinear dynamical system based acoustic modeling for ASR
Author
Warakagoda, Narada D. ; Johnsen, Magne H.
Author_Institution
Dept. of Telecommun., NTNU, Trondheim, Norway
Volume
1
fYear
2001
fDate
2001
Firstpage
525
Abstract
The work presented is centered around a speech production model called the chained dynamical system model (CDSM) which is motivated by the fundamental limitations of the mainstream ASR approaches. The CDSM is essentially a smoothly time varying continuous state nonlinear dynamical system, consisting of two sub dynamical systems coupled as a chain so that one system controls the parameters of the next system. The speech recognition problem is posed as inverting the CDSM, for which we propose a solution based on the theory of embedding. The resulting architecture, which we call inverted CDSM (ICDSM) is evaluated in a set of experiments involving a speaker independent, continuous speech recognition task on the TIMIT database. Results of these experiments which can be compared with the corresponding results in the literature, confirm the feasibility and advantages of the approach
Keywords
hidden Markov models; parameter estimation; pattern classification; speech recognition; TIMIT database; acoustic modeling; automatic speech recognition; chained dynamical system model; embedding theory; smoothly time varying continuous state nonlinear dynamical system; speaker independent continuous speech recognition; statistical pattern recognition; Automatic speech recognition; Control systems; Couplings; Databases; Nonlinear acoustics; Nonlinear control systems; Nonlinear dynamical systems; Production systems; Speech recognition; Time varying systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location
Salt Lake City, UT
ISSN
1520-6149
Print_ISBN
0-7803-7041-4
Type
conf
DOI
10.1109/ICASSP.2001.940883
Filename
940883
Link To Document