DocumentCode :
1439185
Title :
Flexible speech understanding based on combined key-phrase detection and verification
Author :
Kawahara, Tatsuya ; Lee, Chin-Hui ; Juang, Biing-hwang
Author_Institution :
Sch. of Inf., Kyoto Univ., Japan
Volume :
6
Issue :
6
fYear :
1998
fDate :
11/1/1998 12:00:00 AM
Firstpage :
558
Lastpage :
568
Abstract :
We propose a novel speech understanding strategy based on combined detection and verification of semantically tagged key-phrases in spontaneous spoken utterances. Key-phrases are defined in a top-down manner so as to constitute semantic slots. Their detection directly leads to robust understanding. A phrase network realizes both a wide coverage and a reasonable constraint for detection. A subword-based verifier is then incorporated to reduce false alarms in detection and attach confidence measures of the detected phrases. This set of phrase confidence measures, when incorporated in a spoken dialogue system, forms a basis for designing intelligent speech interfaces that accept only verified key-phrases and reprompt users to clarify unspecified or unrecognized portions. Several forms of confidence measures based on subword-level tests are investigated. The proposed approach was tested on field data collected from real-world trial applications. The combined detection and verification strategy drastically improves the accuracy in handling out-of-grammar utterances over the conventional decoding approaches while maintaining the performance for in-grammar utterances
Keywords :
grammars; natural language interfaces; speech recognition; continuous speech recognition; decoding; false alarm reduction; field data; in-grammar utterances; key-phrase detection; key-phrase verification; out-of-grammar utterances; phrase confidence measures; phrase network; real-world trial applications; robust understanding; semantic slots; semantically tagged key-phrases; speech understanding; spoken language systems; spontaneous spoken utterances; subword-based verifier; subword-level tests; Automata; Data mining; Decoding; Filling; Informatics; Information retrieval; Management information systems; Robustness; Speech recognition; Testing;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.725322
Filename :
725322
Link To Document :
بازگشت