DocumentCode
3529743
Title
Improved clustered hierarchical tandem system with bottom-up processing
Author
Chang, Shuo-Yiin ; Lee, Lin-shan
Author_Institution
Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei
fYear
2009
fDate
19-24 April 2009
Firstpage
4441
Lastpage
4444
Abstract
The outputs of multi-layer perceptron (MLP) classifiers have been successfully used in tandem systems as features for HMM-based automatic speech recognition. In a previous paper, we proposed data-driven clustered hierarchical MLP (CHMLP) tandem system yielding improved performance by dividing the complicated global phone classification problem into simpler hierarchical tasks, in which specialized MLPs are trained to classify small clusters of confusing phones in a hierarchical structure. In this paper a bottom-up processing is further proposed to enhance the classification in the above CHMLP and offer even better performance. MLP rescoring for the tandem system is also investigated. The best result achieved 19.1% relative error reduction over the MFCC baseline.
Keywords
hidden Markov models; multilayer perceptrons; speech recognition; HMM-based automatic speech recognition; bottom-up processing; data-driven clustered hierarchical MLP tandem system; global phone classification problem; improved clustered hierarchical tandem system; multi-layer perceptron classifiers; Artificial neural networks; Automatic speech recognition; Clustering algorithms; Feature extraction; Hidden Markov models; Lattices; Mel frequency cepstral coefficient; Multilayer perceptrons; Neural networks; LVCSR; Neural Network; Tandem system;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960615
Filename
4960615
Link To Document