DocumentCode
1161211
Title
A high-speed, low-resource ASR back-end based on custom arithmetic
Author
Li, Xiao ; Malkin, Jonathan ; Bilmes, Jeff A.
Author_Institution
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA
Volume
14
Issue
5
fYear
2006
Firstpage
1683
Lastpage
1693
Abstract
With the skyrocketing popularity of mobile devices, new processing methods tailored to a specific application have become necessary for low-resource systems. This work presents a high-speed, low-resource speech recognition system using custom arithmetic units, where all system variables are represented by integer indices and all arithmetic operations are replaced by hardware-based table lookups. To this end, several reordering and rescaling techniques, including two accumulation structures for Gaussian evaluation and a novel method for the normalization of Viterbi search scores, are proposed to ensure low entropy for all variables. Furthermore, a discriminatively inspired distortion measure is investigated for scalar quantization of forward probabilities to maximize the recognition rate. Finally, heuristic algorithms are explored to optimize system-wide resource allocation. Our best bit-width allocation scheme only requires 59 kB of ROMs to hold the lookup tables, and its recognition performance with various vocabulary sizes in both clean and noisy conditions is nearly as good as that of a system using a 32-bit floating-point unit. Simulations on various architectures show that, on most modern processor designs, we can expect a cycle-count speedup of at least three times over systems with floating-point units. Additionally, the memory bandwidth is reduced by over 70% and the offline storage for model parameters is reduced by 80%
Keywords
entropy; probability; resource allocation; speech recognition; table lookup; ASR; Gaussian evaluation; Viterbi search scores; accumulation structures; automatic speech recognition; bitwidth allocation; custom arithmetic; entropy; floating-point unit; hardware-based table lookups; heuristic algorithms; mobile devices; reordering techniques; rescaling techniques; resource allocation; scalar quantization; speech recognition system; Arithmetic; Automatic speech recognition; Distortion measurement; Entropy; Heuristic algorithms; Quantization; Resource management; Speech recognition; Table lookup; Viterbi algorithm; Alpha recursion; bit-width allocation; custom arithmetic; discriminative distortion measure; forward probability normalization and scaling; high speed; low resource; normalization; quantization; speech recognition;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TSA.2005.858556
Filename
1677988
Link To Document