DocumentCode :
2324001
Title :
Content-Based Search in Multilingual Audiovisual Documents Using the International Phonetic Alphabet
Author :
Quénot, Georges ; Tan, Tien Ping ; Le Viet Bac ; Ayache, Stéphane ; Besacier, Laurent ; Mulhem, Philippe
Author_Institution :
Lab. d´´Inf. de Grenoble, Grenoble
fYear :
2009
fDate :
3-5 June 2009
Firstpage :
150
Lastpage :
155
Abstract :
We present in this paper an approach based on the use of the International Phonetic Alphabet (IPA) for content-based indexing and retrieval of multilingual audiovisual documents. The approach works even if the languages of the document are unknown. It has been validated in the context of the "Star Challenge" search engine competition organized by the Agency for Science, Technology and Research (A*STAR) of Singapore. Our approach includes the building of an IPA-based multilingual acoustic model and a dynamic programming based method for searching document segments by "IPA string spotting". Dynamic programming allows for retrieving the query string in the document string even with a significant transcription error rate at the phone level. The methods that we developed ranked us as first and third on the monolingual (English) search task, as fifth on the multilingual search task and as first on the multimodal (audio and image) search task.
Keywords :
audio databases; content-based retrieval; database indexing; document handling; dynamic programming; natural language processing; visual databases; English search task; IPA string spotting; content-based indexing; content-based retrieval; content-based search; document segments; document string; dynamic programming based method; international phonetic alphabet; multilingual acoustic model; multilingual audiovisual documents; multilingual search task; multimodal search task; query string; transcription error rate; Acoustic testing; Content based retrieval; Context modeling; Dynamic programming; Error analysis; Indexing; NIST; Natural languages; Search engines; Timing; Audiovisual; Content-Based Search; International Phonetic Alphabet; Multilingual;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Content-Based Multimedia Indexing, 2009. CBMI '09. Seventh International Workshop on
Conference_Location :
Chania
Print_ISBN :
978-1-4244-4265-2
Electronic_ISBN :
978-0-7695-3662-0
Type :
conf
DOI :
10.1109/CBMI.2009.42
Filename :
5137833
Link To Document :
بازگشت