Title :
Learning pronunciation dictionary from speech data
Author :
Westendorf, Christian M. ; Jelitto, Jens
Author_Institution :
Inst. fur Tech. Acoust., Tech. Univ. Dresden, Germany
Abstract :
An algorithm and first results from investigations in automatically learning pronunciation variations from speech data are presented. Pronunciation dictionaries establish an important feature in state-of-the-art speech recognition systems. In most systems only simple dictionaries containing the canonical pronunciation forms are implemented. However, for a good recognition performance more sophisticated dictionaries including pronunciation variations are essential. The generation of such dictionaries by hand is an extremely time consuming task, and the introduction of errors and inconsistencies is probable. The authors show an approach for automatically generating suitable pronunciation dictionaries from the speech database itself, as they are desirable not only for speech recognition tasks but also for speech technology and phonologic research in general. The only knowledge sources besides the database are the (unlabeled) signals and their transliterations on word level. First experiments yielding promising results have been performed with the software system DataLab, which integrates the recognition system of the TU Dresden
Keywords :
glossaries; learning (artificial intelligence); speech processing; speech recognition; DataLab software system; algorithm; automatic pronunciation variation learning; canonical pronunciation forms; phonologic research; pronunciation dictionary learning; signals; speech data; speech database; speech recognition systems; speech technology; word level transliterations; Acceleration; Acoustics; Dictionaries; Error correction; Iterative algorithms; Software systems; Speech analysis; Speech processing; Speech recognition;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607784