DocumentCode
2123174
Title
Development of highly accurate real-time large scale speech recognition system
Author
Kim, I. ; Park, C. ; Lee, K. ; Kim, N. ; Lee, J. ; Kim, J. ; Lane, I.
Author_Institution
DMC R&D Center, Samsung Electron., Suwon, South Korea
fYear
2015
fDate
9-12 Jan. 2015
Firstpage
493
Lastpage
496
Abstract
This paper describes the development of the framework and the algorithm for large scale automatic speech recognition systems. Technical advances include the acceleration of decoding speed by leveraging the computational power of many-core graphic processing units (GPU), in order to solve the issue of training data sparseness, improvement in the accuracy by Subspace Gaussian Mixture Models (SGMM), and employing novel methods of language models such as the Instant Language Model Adaptation (ILMA) method. We present the effectiveness of each technique by evaluating it with actual usage data collected from television sets. It is shown that the proposed engine can recognize speech at real time with high accuracy.
Keywords
Gaussian processes; graphics processing units; mixture models; real-time systems; speech coding; speech recognition; GPU; ILMA method; SGMM; computational power; decoding speed; highly accurate real-time large scale speech recognition system; instant language model adaptation; many-core graphic processing units; subspace Gaussian mixture model; television sets; training data sparseness; Accuracy; Acoustics; Adaptation models; Data models; Decoding; Graphics processing units; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Consumer Electronics (ICCE), 2015 IEEE International Conference on
Conference_Location
Las Vegas, NV
Print_ISBN
978-1-4799-7542-6
Type
conf
DOI
10.1109/ICCE.2015.7066496
Filename
7066496
Link To Document