DocumentCode :
1226772
Title :
Natural language spoken interface control using data-driven semantic inference
Author :
Bellegarda, Jerome R. ; Silverman, Kim E A
Author_Institution :
Spoken Language Group, Apple Comput. Inc., Cupertino, CA, USA
Volume :
11
Issue :
3
fYear :
2003
fDate :
5/1/2003 12:00:00 AM
Firstpage :
267
Lastpage :
277
Abstract :
Spoken interaction tasks are typically approached using a formal grammar as language model. While ensuring good system performance, this imposes a rigid framework on users, by implicitly forcing them to conform to a pre-defined interaction structure. This paper introduces the concept of data-driven semantic inference, which in principle allows for any word constructs in command/query formulation. Each unconstrained word string is automatically mapped onto the intended action through a semantic classification against the set of supported actions. As a result, it is no longer necessary for users to memorize the exact syntax of every command. The underlying (latent semantic analysis) framework relies on co-occurrences between words and commands, as observed in a training corpus. A suitable extension can also handle commands that are ambiguous at the word level. The behavior of semantic inference is characterized using a desktop user interface control task involving 113 different actions. Under realistic usage conditions, this approach exhibits a 2 to 5% classification error rate. Various training scenarios of increasing scope are considered to assess the influence of coverage on performance. Sufficient semantic knowledge about the task domain is found to be captured at a level of coverage as low as 70%. This illustrates the good generalization properties of semantic inference.
Keywords :
grammars; inference mechanisms; natural language interfaces; query processing; signal classification; speech recognition; automatic speech recognition; classification error rate; co-occurrences; command/query formulation; coverage; data-driven semantic inference; desktop user interface control; formal grammar; language model; latent semantic analysis; natural language spoken interface control; semantic classification; semantic inference; semantic knowledge; spoken interaction tasks; system performance; training; training corpus; unconstrained word string; Acoustic applications; Automatic speech recognition; Command and control systems; Communication system control; Error analysis; Natural languages; Speech analysis; Speech recognition; System performance; User interfaces;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2003.811534
Filename :
1208295
Link To Document :
بازگشت