DocumentCode :
323583
Title :
ACID/HNN: clustering hierarchies of neural networks for context-dependent connectionist acoustic modeling
Author :
Fritsch, Jürgen ; Fïnke, Michael
Author_Institution :
Interactive Syst. Labs., Karlsruhe Univ., Germany
Volume :
1
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
505
Abstract :
We present the ACID/HNN framework, a principled approach to hierarchical connectionist acoustic modeling in large vocabulary conversational speech recognition (LVCSR). Our approach consists of an agglomerative clustering algorithm based on information divergence (ACID) to automatically design and robustly estimate hierarchies of neural networks (HNN) for arbitrarily large sets of context-dependent decision tree clustered HMM states. We argue that a hierarchical approach is crucial in applying locally discriminative connectionist models to the typically very large state spaces observed in LVCSR systems. We evaluate the ACID/HNN framework on the Switchboard conversational telephone speech corpus. Furthermore, we focus on the benefits of the proposed connectionist acoustic model, namely exploiting the hierarchical structure for speaker adaptation and decoding speed-up algorithms
Keywords :
decoding; estimation theory; neural nets; speech recognition; ACID; ACID/HNN; HNN; LVCSR; Switchboard conversational telephone speech corpus; agglomerative clustering algorithm based on information divergence; clustering hierarchies; context-dependent connectionist acoustic modeling; context-dependent decision tree clustered HMM states; hierarchical connectionist acoustic modeling; hierarchical structure; large vocabulary conversational speech recognition; locally discriminative connectionist models; neural networks; speaker adaptation; speed-up algorithms decoding; Algorithm design and analysis; Clustering algorithms; Decision trees; Hidden Markov models; Neural networks; Robustness; Speech recognition; State estimation; State-space methods; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.674478
Filename :
674478
Link To Document :
بازگشت