DocumentCode :
1689387
Title :
Investigation into the use of deep neural networks for LVCSR of Czech
Author :
Mateju, Lukas ; Cerva, Petr ; Zdansky, Jindrich
Author_Institution :
Fac. of Mechatron., Inf. & Interdiscipl. Studies, Tech. Univ. of Liberec, Liberec, Czech Republic
fYear :
2015
Firstpage :
1
Lastpage :
4
Abstract :
This paper deals with utilization of deep neural networks (DNNs) for speech recognition. The main goal is to find out the best strategy for training and utilization of these models within an acoustic modeling module of a large vocabulary continuous speech recognition (LVCSR) system of Czech language. For this purpose, various DNNs are trained a) using several training strategies, b) with different inner structure and c) using various kinds of features. Experimental evaluation is then performed on a large dataset including broadcast recordings, recordings of lectures, dictates of judgments and set of nonlinearly distorted utterances. The resulting recipe for training of DNNs for our LVCSR system employs a) ReLU activation function with hidden layer width of 1024 neurons and b) filter-bank based features.
Keywords :
channel bank filters; feature extraction; neural nets; speech recognition; transfer functions; vocabulary; Czech language; DNN; LVCSR system; ReLU activation function; acoustic modeling module; broadcast recordings; deep neural networks; filter-bank based features; large vocabulary continuous speech recognition system; Biological neural networks; Feature extraction; Hidden Markov models; Neurons; Speech; Speech recognition; Training; acoustic modeling; deep neural networks; feature extraction; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronics, Control, Measurement, Signals and their Application to Mechatronics (ECMSM), 2015 IEEE International Workshop of
Conference_Location :
Liberec
Print_ISBN :
978-1-4799-6970-8
Type :
conf
DOI :
10.1109/ECMSM.2015.7208708
Filename :
7208708
Link To Document :
بازگشت