Title :
On context-dependent neural networks and speaker adaptation
Author :
Zelinka, J. ; Trmal, Jan ; Muller, Lukas
Author_Institution :
Dept. of Cybern., Univ. of West Bohemia, Plzeň, Czech Republic
Abstract :
This paper describes evaluation of a neural network based hybrid LVCSR system. The novelty of the evaluated hybrid system lies in speaker adaptation techniques that are employed to increase performance of neural networks for context-dependent phonetic units modeling. The performance comparison is done as follows. First, performances of different hybrid systems employing either a context-independent neural network or a context-dependent neural network are compared. Second, the influence of the recently published speaker adaptation technique called MELT is evaluated. Furthermore, several possible approaches to conversion of posterior probabilities into observation likelihoods, which are necessary for a hybrid LVSCR systems, are described and discussed in this paper.
Keywords :
neural nets; speech processing; speech recognition; MELT; context-dependent neural network; context-dependent neural networks; context-dependent phonetic units modeling; evaluated hybrid system; hybrid LVCSR system-based neural network; observation likelihoods; posterior probabilities; speaker adaptation technique; speaker adaptation techniques;
Conference_Titel :
Signal Processing (ICSP), 2012 IEEE 11th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-2196-9
DOI :
10.1109/ICoSP.2012.6491538