Title :
Large-scaleword representation features for improved spoken language understanding
Author :
Jun Zhang ; Yang, Terry Zhenrong ; Hazen, Timothy J.
Author_Institution :
New England R&D Center, Microsoft Corp., Cambridge, MA, USA
Abstract :
Recently there has been great interest in the application of word representation techniques to various natural language processing (NLP) scenarios. Word representation features from techniques such as Brown clustering or spectral clustering are generally computed from large corpora of unlabeled data in a completely unsupervised manner. These features can then be directly included as supplementary features to standard representations used for NLP processing tasks. In this paper, we apply these techniques to the tasks of domain classification and intent detection in a spoken language understanding (SLU) system. In experiments in a personal assistant domain, features derived from both Brown clustering and spectral clustering techniques improved the performance of all models in our experiments and the combination of both techniques yielded additional improvements.
Keywords :
speech; Brown clustering; NLP; improved spoken language understanding; large-scale word representation; personal assistant domain; spectral clustering; Clustering algorithms; Computational modeling; Correlation; Feature extraction; Semantics; Support vector machines; Training; hierarchical clustering; spectral clustering; spoken language understanding; word representation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178984