DocumentCode
417190
Title
Speech-activated text retrieval system for multimodal cellular phones
Author
Ishikawa, Shin-ya ; Ikeda, Takahiro ; Miki, Kiyokazu ; Adachi, Fumihiro ; Isotani, Ryosuke ; Iso, Ken-ichi ; Okumura, Akitoshi
Author_Institution
Multimedia Res. Labs., NEC Corp., Japan
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
The paper describes an on-line manual page retrieval system activated by spoken queries for multimodal cellular phones. The system recognizes a user´s naturally spoken queries by telephone LVCSR and searches an on-line manual with a retrieval module on a server. The user can view the retrieved data on the screen of the phone via Web access. The LVCSR module consists of a telephone acoustic model and an n-gram language model derived from a task query corpus. The adaptation method using the target manual is also presented. The retrieval module utilizes pairs of words with dependency relations and also distinguishes affirmative and negative expressions to improve precision. The proposed system gives 82.6% keyword recognition accuracy and 77.5% task achievement rate. The field trial of the system is now underway.
Keywords
Internet; cellular radio; natural language interfaces; query processing; speech recognition; speech-based user interfaces; Web access; large vocabulary continuous speech recognition; multimodal cellular phones; n-gram language model; naturally spoken queries; on-line manual; on-line manual page retrieval; retrieval module; speech-activated text retrieval; task query corpus; telephone LVCSR; telephone acoustic model; Cellular phones; Home appliances; ISO; Information retrieval; Laboratories; Manuals; Multimedia systems; Speech recognition; Speech synthesis; Telephony;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326020
Filename
1326020
Link To Document