Title :
Longer-length acoustic units for continuous speech recognition
Author :
Hamalainen, Annika ; de Veth, Johan ; Boves, Lou
Author_Institution :
Dept. of Language & Speech, Radboud Univ. Nijmegen, Nijmegen, Netherlands
Abstract :
Recent research on the TIMIT database suggests that longer-length acoustic units are better suited for modelling pronunciation variation and long-term temporal dependencies in speech than traditional phoneme-length units, yielding substantial improvements in recognition accuracy [9]. In this paper, we investigate whether similar improvements can be gained on another database, viz. excerpts from novels in a Dutch library for the blind. We use a hierarchical method that employs a mixture of word-, syllable- and phoneme-length units. Our results show that the approach does increase the word accuracy, but to a lesser extent than expected. The paper discusses possible explanations for the finding.
Keywords :
speech recognition; Dutch library; TIMIT database; continuous speech recognition; long-term temporal dependencies; longer-length acoustic units; phoneme-length units; pronunciation variation; syllable-length units; word-length units; Accuracy; Acoustics; Context modeling; Data models; Speech; Speech recognition; Training;
Conference_Titel :
Signal Processing Conference, 2005 13th European
Conference_Location :
Antalya
Print_ISBN :
978-160-4238-21-1