Title :
The IBM Attila speech recognition toolkit
Author :
Soltau, Hagen ; Saon, George ; Kingsbury, Brian
Author_Institution :
T.J. Watson Res. Center, IBM, Yorktown Heights, NY, USA
Abstract :
We describe the design of IBM´s Attila speech recognition toolkit. We show how the combination of a highly modular and efficient library of low-level C++ classes with simple interfaces, an interconnection layer implemented in a modern scripting language (Python), and a standardized collection of scripts for system-building produce a flexible and scalable toolkit that is useful both for basic research and for construction of large transcription systems for competitive evaluations.
Keywords :
C++ language; speech recognition; IBM Attila speech recognition toolkit; interconnection layer; low-level C++ classes; scripting language; speech recognition;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2010 IEEE
Conference_Location :
Berkeley, CA
Print_ISBN :
978-1-4244-7904-7
Electronic_ISBN :
978-1-4244-7902-3
DOI :
10.1109/SLT.2010.5700829