DocumentCode
3485876
Title
Exploiting distance based similarity in topic models for user intent detection
Author
Celikyilmaz, Asli ; Hakkani-Tur, Dilek ; Tur, Gokhan ; Fidler, Ashley ; Hillard, Dustin
Author_Institution
Microsoft Speech Labs., Mountain View, CA, USA
fYear
2011
fDate
11-15 Dec. 2011
Firstpage
425
Lastpage
430
Abstract
One of the main components of spoken language understanding is intent detection, which allows user goals to be identified. A challenging sub-task of intent detection is the identification of intent bearing phrases from a limited amount of training data, while maintaining the ability to generalize well. We present a new probabilistic topic model for jointly identifying semantic intents and common phrases in spoken language utterances. Our model jointly learns a set of intent dependent phrases and captures semantic intent clusters as distributions over these phrases based on a distance dependent sampling method. This sampling method uses proximity of words utterances when assigning words to latent topics. We evaluate our method on labeled utterances and present several examples of discovered semantic units. We demonstrate that our model outperforms standard topic models based on bag-of-words assumption.
Keywords
probability; speech recognition; SLU; bag-of-words assumption; distance dependent sampling method; jointly identifying semantic intents; probabilistic topic model; speech signals; spoken language understanding; spoken language utterances; user intent detection; Clustering algorithms; Lattices; Motion pictures; Sampling methods; Semantics; Testing; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location
Waikoloa, HI
Print_ISBN
978-1-4673-0365-1
Electronic_ISBN
978-1-4673-0366-8
Type
conf
DOI
10.1109/ASRU.2011.6163969
Filename
6163969
Link To Document