DocumentCode :
1843149
Title :
On context-dependent neural networks and speaker adaptation
Author :
Zelinka, J. ; Trmal, Jan ; Muller, Lukas
Author_Institution :
Dept. of Cybern., Univ. of West Bohemia, Plzeň, Czech Republic
Volume :
1
fYear :
2012
fDate :
21-25 Oct. 2012
Firstpage :
515
Lastpage :
518
Abstract :
This paper describes evaluation of a neural network based hybrid LVCSR system. The novelty of the evaluated hybrid system lies in speaker adaptation techniques that are employed to increase performance of neural networks for context-dependent phonetic units modeling. The performance comparison is done as follows. First, performances of different hybrid systems employing either a context-independent neural network or a context-dependent neural network are compared. Second, the influence of the recently published speaker adaptation technique called MELT is evaluated. Furthermore, several possible approaches to conversion of posterior probabilities into observation likelihoods, which are necessary for a hybrid LVSCR systems, are described and discussed in this paper.
Keywords :
neural nets; speech processing; speech recognition; MELT; context-dependent neural network; context-dependent neural networks; context-dependent phonetic units modeling; evaluated hybrid system; hybrid LVCSR system-based neural network; observation likelihoods; posterior probabilities; speaker adaptation technique; speaker adaptation techniques;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing (ICSP), 2012 IEEE 11th International Conference on
Conference_Location :
Beijing
ISSN :
2164-5221
Print_ISBN :
978-1-4673-2196-9
Type :
conf
DOI :
10.1109/ICoSP.2012.6491538
Filename :
6491538
Link To Document :
بازگشت