DocumentCode :
3422547
Title :
Polyphase speech recognition
Author :
Lin, Hui ; Bilmes, Jeff
Author_Institution :
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4109
Lastpage :
4112
Abstract :
We propose a model for speech recognition that consists of multiple semi-synchronized recognizers operating on a polyphase decomposition of standard speech features. Specifically, we consider multiple out-of-phase downsampled speech features as separate streams which are modeled separately at the lowest level, and are then integrated at the higher level (words) during first-pass decoding. Our model lessens the severity of the oversampling problem in many speech recognition systems - i.e., that speech modulation energy is most important below 25 Hz but a 100 Hz frame rate gives a modulation bandwidth of 50 Hz. Our polyphase approach moreover captures wider and more diverse dynamics within the speech signal. Our integrative network is high-level, namely it couples together and decodes word strings from different recognizers simultaneously and asynchronously. We provide preliminary results on the 10-word vocabulary version of the switchboard (small-vocabulary switchboard) task and show that our polyphase recognition system significantly outperforms an optimized baseline (HMM) approach.
Keywords :
decoding; speech recognition; downsampled speech features; first-pass decoding; out-of-phase speech features; polyphase decomposition; polyphase recognition system; semisynchronized recognizers; speech modulation energy; speech recognition; switchboard; Acoustics; Automatic speech recognition; Bandwidth; Bayesian methods; Decoding; Frequency modulation; Hidden Markov models; Power system modeling; Speech recognition; Streaming media; dynamic Bayesian network; polyphase speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518558
Filename :
4518558
Link To Document :
بازگشت