Title :
Improved clustered hierarchical tandem system with bottom-up processing
Author :
Chang, Shuo-Yiin ; Lee, Lin-shan
Author_Institution :
Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei
Abstract :
The outputs of multi-layer perceptron (MLP) classifiers have been successfully used in tandem systems as features for HMM-based automatic speech recognition. In a previous paper, we proposed data-driven clustered hierarchical MLP (CHMLP) tandem system yielding improved performance by dividing the complicated global phone classification problem into simpler hierarchical tasks, in which specialized MLPs are trained to classify small clusters of confusing phones in a hierarchical structure. In this paper a bottom-up processing is further proposed to enhance the classification in the above CHMLP and offer even better performance. MLP rescoring for the tandem system is also investigated. The best result achieved 19.1% relative error reduction over the MFCC baseline.
Keywords :
hidden Markov models; multilayer perceptrons; speech recognition; HMM-based automatic speech recognition; bottom-up processing; data-driven clustered hierarchical MLP tandem system; global phone classification problem; improved clustered hierarchical tandem system; multi-layer perceptron classifiers; Artificial neural networks; Automatic speech recognition; Clustering algorithms; Feature extraction; Hidden Markov models; Lattices; Mel frequency cepstral coefficient; Multilayer perceptrons; Neural networks; LVCSR; Neural Network; Tandem system;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960615