DocumentCode :
3629696
Title :
Neural network acoustic model with decision tree clustered triphones
Author :
Tomas Pavelka;Pavel Kral
Author_Institution :
Dept. of Computer Science and Engineering, University of West Bohemia, Plzen, Czech Republic
fYear :
2008
Firstpage :
216
Lastpage :
220
Abstract :
This article tries to compare the performance of neural network and Gaussian mixture acoustic models (GMMs). We argue that using a multi layer perceptron as an emission probability estimator in hidden Markov model based automatic speech recognition can lead to better results than when the more traditional Gaussian mixtures are applied. We present a solution on how to model triphone phonetic units with neural networks and we show that this also leads to better performance in comparison with GMMs. The superior performance of the neural networks comes at a cost of extremely long training times.
Keywords :
"Neural networks","Decision trees","Hidden Markov models","Automatic speech recognition","Scheduling","Testing","Training data","Computer science","Acoustical engineering","Acoustic emission"
Publisher :
ieee
Conference_Titel :
Machine Learning for Signal Processing, 2008. MLSP 2008. IEEE Workshop on
ISSN :
1551-2541
Print_ISBN :
978-1-4244-2375-0
Electronic_ISBN :
2378-928X
Type :
conf
DOI :
10.1109/MLSP.2008.4685482
Filename :
4685482
Link To Document :
بازگشت