DocumentCode
2770351
Title
A system for speech driven information retrieval
Author
González-Ferreras, César ; Cardeñoso-Payo, Valentín
Author_Institution
Univ. de Valladolid, Valladolid
fYear
2007
fDate
9-13 Dec. 2007
Firstpage
624
Lastpage
628
Abstract
In this paper we present a system that allows users to search information in a document collection using a spoken query. The system is based on a speech recognizer and on an information retrieval engine. The system works for Spanish language. We evaluated the system using CLEF´01 test set, extended to include spoken queries. We proposed an adaptation of vocabulary and language model, to reduce the out of vocabulary word problem. In order to reduce errors caused by words in a foreign language, we expanded our pronunciation lexicon to include the pronunciation of English words. Experiments showed a relative gain in retrieval precision of 6.34%, a relative reduction in OOV word rate of 24.71% and a relative reduction in WER of 10.87%.
Keywords
information retrieval systems; natural languages; speech recognition; Spanish language; information retrieval engine; language model; speech driven information retrieval; speech recognizer; spoken query; vocabulary model; Adaptation model; Audio recording; Information retrieval; Internet; Microcomputers; Natural languages; Search engines; Speech recognition; System testing; Vocabulary; foreign words modeling; information retrieval; language model adaptation; speech driven information retrieval; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location
Kyoto
Print_ISBN
978-1-4244-1746-9
Electronic_ISBN
978-1-4244-1746-9
Type
conf
DOI
10.1109/ASRU.2007.4430184
Filename
4430184
Link To Document