DocumentCode
699768
Title
FPGA-based implementation of a real-time 5000-word continuous speech recognizer
Author
Young-kyu Choi ; Kisun You ; Wonyong Sung
Author_Institution
Sch. of Electr. Eng., Seoul Nat. Univ., Seoul, South Korea
fYear
2008
fDate
25-29 Aug. 2008
Firstpage
1
Lastpage
5
Abstract
We have developed a hidden Markov model based 5000-word speaker independent continuous speech recognizer using a Field-Programmable Gate Array (FPGA). The feature extraction is conducted in software on a soft-core based CPU, while the emission probability computation and the Viterbi beam search are implemented using parallel and pipelined hardware blocks. In order to reduce the bandwidth requirement to external DRAM, we employed bit-width reduction of the Gaussian parameters, multi-block computation of the emission probability, and two-stage language model pruning. These optimizations reduce the memory bandwidth requirement for emission probability computation and inter-word transition by 81% and 44%, respectively. The speech recognition hardware was synthesized for the Virtex-4 FPGA, and it operates at 100MHz. The experimental result on Wall Street Journal 5k vocabulary task shows that the developed system runs 1.52 times faster than real-time.
Keywords
DRAM chips; Gaussian processes; feature extraction; field programmable gate arrays; hidden Markov models; optimisation; probability; speech recognition; DRAM; Gaussian parameters; Virtex-4 FPGA; Viterbi beam search; Wall Street Journal vocabulary task; emission probability computation; feature extraction; field programmable gate array; hidden Markov model; interword transition; memory bandwidth; soft-core based CPU; speaker independent continuous speech recognizer; speech recognition hardware; two-stage language model; Bandwidth; Computational modeling; Computer architecture; Hidden Markov models; Random access memory; Speech; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2008 16th European
Conference_Location
Lausanne
ISSN
2219-5491
Type
conf
Filename
7080300
Link To Document