• DocumentCode
    3163627
  • Title

    A layered approach for dutch large vocabulary continuous speech recognition

  • Author

    Pelemans, Joris ; Demuynck, Kris ; Wambacq, Patrick

  • Author_Institution
    Dept. ESAT, Katholieke Univ. Leuven, Leuven, Belgium
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4421
  • Lastpage
    4424
  • Abstract
    In this paper we investigate whether a layered architecture that has already proven its value for small tasks, works for a system with large lexica (400k words) and language models (5-grams) as well. The architecture was designed to decouple phone and word recognition which allows for the integration of more complex linguistic components, especially at the sub-word level. It was tested on the Dutch language which - with its large variety of accents and rich morphology - is ideally suited to benefit from this integration. The results reveal that the architecture is already competitive to an all-in-one approach in which acoustic models, language models and lexicon are all applied simultaneously. Candidates for further improvement to the system based on a conditional phone confusion model are suggested.
  • Keywords
    speech recognition; accents; acoustic models; decouple phone; dutch large vocabulary continuous speech recognition; language models; lexicon; phone confusion model; rich morphology; word recognition; Acoustics; Context; Context modeling; Decoding; Hidden Markov models; Lattices; Speech; ASR architecture; LVCSR; accented speech; phone confusion matrix; phone lattice decoding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288900
  • Filename
    6288900