Title :
A system for speech driven information retrieval
Author :
González-Ferreras, César ; Cardeñoso-Payo, Valentín
Author_Institution :
Univ. de Valladolid, Valladolid
Abstract :
In this paper we present a system that allows users to search information in a document collection using a spoken query. The system is based on a speech recognizer and on an information retrieval engine. The system works for Spanish language. We evaluated the system using CLEF´01 test set, extended to include spoken queries. We proposed an adaptation of vocabulary and language model, to reduce the out of vocabulary word problem. In order to reduce errors caused by words in a foreign language, we expanded our pronunciation lexicon to include the pronunciation of English words. Experiments showed a relative gain in retrieval precision of 6.34%, a relative reduction in OOV word rate of 24.71% and a relative reduction in WER of 10.87%.
Keywords :
information retrieval systems; natural languages; speech recognition; Spanish language; information retrieval engine; language model; speech driven information retrieval; speech recognizer; spoken query; vocabulary model; Adaptation model; Audio recording; Information retrieval; Internet; Microcomputers; Natural languages; Search engines; Speech recognition; System testing; Vocabulary; foreign words modeling; information retrieval; language model adaptation; speech driven information retrieval; speech recognition;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
DOI :
10.1109/ASRU.2007.4430184