DocumentCode
643340
Title
Pipelining Acoustic Model Training for Speech Recognition Using Storm
Author
Sitaram, Dinkar ; Srinivasaraghavan, Haripriya ; Agarwal, K. ; Agrawal, Ankit ; Joshi, Niranjan ; Ray, Debtanu
Author_Institution
Comput. Sci. Dept., PES Inst. of Technol., Bangalore, India
fYear
2013
fDate
24-25 Sept. 2013
Firstpage
219
Lastpage
224
Abstract
Speech recognition has been increasingly used on mobile devices, which has in turn increased the need for creation of new acoustic models for various languages, dialects, accents, speakers and environmental conditions. This involves training and adapting a huge number of acoustic models, some of them in real-time. Training acoustic models is thus essential for speech recognition because these models determine the accuracy and quality of the recognition process. This paper, discusses the use of Storm, a distributed real time computational system, to pipeline the creation of acoustic models by CMU Sphinx, an open-source software project for speech recognition and training. Software pipelining reduces the time required for training and optimizes system resource utilization, thus enabling huge amounts of data to be trained in considerably less amount of time than taken by the conventional sequential process. Pipelining is achieved by grouping the stages of the training process into a set of five stages, and running each stage on individual nodes in a Storm cluster. Thus acoustic models are created by training multiple streams of speech samples using the same SphinxTrain setup, also resulting in improvement of training time and throughput.
Keywords
mobile computing; pipeline processing; public domain software; real-time systems; resource allocation; speech recognition; CMU Sphinx; SphinxTrain setup; Storm cluster; acoustic models; distributed real time computational system; environmental conditions; mobile devices; open-source software project; pipelining acoustic model training; recognition process; resource utilization; sequential process; software pipelining; speaker conditions; speech recognition; speech sample streams; training acoustic models; Acoustics; Adaptation models; Hidden Markov models; Pipeline processing; Speech recognition; Storms; Training; CMU Sphinx; Parallel Computing; Pipelining; Speech; SphinxTrain; Storm;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence, Modelling and Simulation (CIMSim), 2013 Fifth International Conference on
Conference_Location
Seoul
Print_ISBN
978-1-4799-2308-3
Type
conf
DOI
10.1109/CIMSim.2013.42
Filename
6663188
Link To Document