• DocumentCode
    353700
  • Title

    A unified context-free grammar and n-gram model for spoken language processing

  • Author

    Wang, Ye-Yi ; Mahajan, Milind ; Huang, Xuedong

  • Author_Institution
    Speech Technol. Group, Microsoft Corp., Redmond, WA, USA
  • Volume
    3
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1639
  • Abstract
    While context-free grammars (CFGs) remain as one of the most important formalisms for interpreting natural language, word n-gram models are surprisingly powerful for domain-independent applications. We propose to unify these two formalisms for both speech recognition and spoken language understanding (SLU). With portability as the major problem, we incorporated domain-specific CFGs into a domain-independent n-gram model that can improve the generalizability of the CFG and the specificity of the n-gram. In our experiments, the unified model can significantly reduce the test set perplexity from 378 to 90 in comparison with a domain-independent word trigram. The unified model converges well when domain-specific data becomes available. The perplexity can be further reduced from 90 to 65 with a limited amount of domain-specific data. While we have demonstrated excellent portability, the full potential of our approach lies in its unified recognition and understanding that we are investigating
  • Keywords
    context-free grammars; natural languages; nomograms; speech processing; speech recognition; convergence; domain-independent applications; domain-independent n-gram model; domain-independent word trigram; domain-specific context-free grammars; generalizability; natural language interpretation; portability; specificity; speech recognition; spoken language processing; spoken language understanding; test set perplexity; unified model; word n-gram models; Context modeling; Decoding; Equations; Natural languages; Predictive models; Signal generators; Speech processing; Speech recognition; Testing; Unified modeling language;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.862062
  • Filename
    862062