Title :
Neural network acoustic model with decision tree clustered triphones
Author :
Tomas Pavelka;Pavel Kral
Author_Institution :
Dept. of Computer Science and Engineering, University of West Bohemia, Plzen, Czech Republic
Abstract :
This article tries to compare the performance of neural network and Gaussian mixture acoustic models (GMMs). We argue that using a multi layer perceptron as an emission probability estimator in hidden Markov model based automatic speech recognition can lead to better results than when the more traditional Gaussian mixtures are applied. We present a solution on how to model triphone phonetic units with neural networks and we show that this also leads to better performance in comparison with GMMs. The superior performance of the neural networks comes at a cost of extremely long training times.
Keywords :
"Neural networks","Decision trees","Hidden Markov models","Automatic speech recognition","Scheduling","Testing","Training data","Computer science","Acoustical engineering","Acoustic emission"
Conference_Titel :
Machine Learning for Signal Processing, 2008. MLSP 2008. IEEE Workshop on
Print_ISBN :
978-1-4244-2375-0
Electronic_ISBN :
2378-928X
DOI :
10.1109/MLSP.2008.4685482