DocumentCode :
3346335
Title :
Fast two-level-dynamic-programming algorithm for speech recognition
Author :
Agbago, Akakpo ; Barriere, Caroline
Author_Institution :
Sch. of Inf. Technol. & Eng., Ottawa Univ., Ont., Canada
Volume :
5
fYear :
2004
fDate :
17-21 May 2004
Abstract :
A three-stage architecture for speech recognition is presented; it includes preprocessing, phoneme recognition, and natural language postprocessor. Within this context of phoneme-based utterance recognition, the paper focuses on the often problematic speed of the second stage and reengineers a standard two-level-dynamic-programming (TLDP) approach to achieve an increase in speed of 75%. Our fast two-level-dynamic-programming algorithm (FTLDP) uses a phoneme clustering technique to reduce the reference search space and silence detection to reduce the length of the utterance to recognize. An overview of the FTLDP algorithm is presented as well as some results.
Keywords :
dynamic programming; natural languages; pattern clustering; speech enhancement; speech recognition; fast two-level dynamic programming algorithm; natural language postprocessor; phoneme clustering; phoneme recognition; preprocessing; reference search space; signal enhancement; silence detection; speech recognition; three-stage architecture; utterance recognition; Clustering algorithms; Natural languages; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1327064
Filename :
1327064
Link To Document :
بازگشت