DocumentCode :
2770393
Title :
Fast audio search using vector space modelling
Author :
Matthews, Brett ; Chaudhari, Upendra ; Ramabhadran, Bhuvana
Author_Institution :
IBM TJ Watson Res. Center, Yorktown Heights
fYear :
2007
fDate :
9-13 Dec. 2007
Firstpage :
641
Lastpage :
646
Abstract :
Many techniques for retrieving arbitrary content from audio have been developed to leverage the important challenge of providing fast access to very large volumes of multimedia data. We present a two-stage method for fast audio search, where a vector-space modelling approach is first used to retrieve a short list of candidate audio segments for a query. The list of candidate segments is then searched using a word-based index for known words and a phone-based index for out-of-vocabulary words. We explore various system configurations and examine trade-offs between speed and accuracy. We evaluate our audio search system according to the NIST 2006 Spoken Term Detection evaluation initiative. We find that we can obtain a 30-times speedup for the search phase of our system with a 10% relative loss in accuracy.
Keywords :
audio signal processing; content management; indexing; multimedia computing; query processing; speech recognition; audio search; audio segment retrieval; multimedia data access; phone-based index; spoken term detection; vector space modelling; word-based index; Audio databases; Automatic speech recognition; Content based retrieval; Data mining; Indexing; Information retrieval; Lattices; NIST; Search methods; Statistics; Spoken-term detection; audio search; latent semantic indexing; vector-space modelling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
Type :
conf
DOI :
10.1109/ASRU.2007.4430187
Filename :
4430187
Link To Document :
بازگشت