DocumentCode :
3485876
Title :
Exploiting distance based similarity in topic models for user intent detection
Author :
Celikyilmaz, Asli ; Hakkani-Tur, Dilek ; Tur, Gokhan ; Fidler, Ashley ; Hillard, Dustin
Author_Institution :
Microsoft Speech Labs., Mountain View, CA, USA
fYear :
2011
fDate :
11-15 Dec. 2011
Firstpage :
425
Lastpage :
430
Abstract :
One of the main components of spoken language understanding is intent detection, which allows user goals to be identified. A challenging sub-task of intent detection is the identification of intent bearing phrases from a limited amount of training data, while maintaining the ability to generalize well. We present a new probabilistic topic model for jointly identifying semantic intents and common phrases in spoken language utterances. Our model jointly learns a set of intent dependent phrases and captures semantic intent clusters as distributions over these phrases based on a distance dependent sampling method. This sampling method uses proximity of words utterances when assigning words to latent topics. We evaluate our method on labeled utterances and present several examples of discovered semantic units. We demonstrate that our model outperforms standard topic models based on bag-of-words assumption.
Keywords :
probability; speech recognition; SLU; bag-of-words assumption; distance dependent sampling method; jointly identifying semantic intents; probabilistic topic model; speech signals; spoken language understanding; spoken language utterances; user intent detection; Clustering algorithms; Lattices; Motion pictures; Sampling methods; Semantics; Testing; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
Type :
conf
DOI :
10.1109/ASRU.2011.6163969
Filename :
6163969
Link To Document :
بازگشت