• DocumentCode
    353712
  • Title

    Data-driven generation of pronunciation dictionaries in the German Verbmobil project: discussion of experimental results

  • Author

    Eichner, Matthias ; Wolff, Matthias

  • Author_Institution
    Lab. of Acoustics & Speech Commun., Tech. Univ. Dresden, Germany
  • Volume
    3
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1687
  • Abstract
    In the framework of the German Verbmobil project we developed a procedure for the automatic, data-driven generation of pronunciation dictionaries for speech recognition systems. In most recognizers, only simple dictionaries containing the canonical pronunciation form are used. They represent the correct pronunciation, but in most cases the canonical pronunciation does not match the actual realization of the word. To solve this problem we chose an approach to derive pronunciation variants automatically from a speech database. The training algorithm is based on a canonical dictionary which is compiled into a graph representation in a first stage. Pronunciation variants are then learned from a training sample consisting of speech signal and its orthographic transcription. The authors focus on the experimental results obtained in the Verbmobil framework and introduce methods to evaluate pronunciation dictionaries generated by the training procedure
  • Keywords
    dictionaries; graph theory; multimedia databases; natural languages; speech recognition; German Verbmobil project; canonical dictionary; canonical pronunciation form; data-driven generation; graph representation; orthographic transcription; pronunciation dictionaries; pronunciation variants; speech database; speech recognition systems; speech signal; training algorithm; training procedure; training sample; Acoustics; Counting circuits; Databases; Dictionaries; Educational products; Graphics; Laboratories; Lattices; Oral communication; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.862075
  • Filename
    862075