Title :
Feature fluctuation absorption for a quick audio retrieval from long recordings
Author :
Kashino, Kunio ; Kurozumi, Takayuki ; Murase, Hiroshi
Author_Institution :
NTT Commun. Sci. Labs., Kanagawa, Japan
Abstract :
Kashino et al. proposed (1999) a histogram-based quick signal search method called time-series active search (TAS). TAS has only been effective in the exact matching case, where the segments to be detected are assumed to be exactly same as the reference signal. Here, we extend the method so that it is applicable even if the features fluctuate. In addition to the feature modification, feature dithering is discussed to absorb feature fluctuations. Efficient time-scaled search is also investigated to cope with variations of the reference signal duration. Tests using broadcast recordings show that the extended method improves the accuracy in nonexact-matching tasks such as hand-clap detection and word spotting in a single-speaker´s narration. The tests also show the speed-ups by pruning introduced in the time-scaled search
Keywords :
audio signal processing; pattern recognition; time series; TAS; efficient time-scaled search; exact matching case; feature dithering; feature fluctuation absorption; feature modification; hand-clap detection; histogram-based quick signal search method; long recordings; nonexact-matching tasks; quick audio retrieval; single-speaker narration; time-scaled search; time-series active search; word spotting; Absorption; Audio recording; Broadcasting; Fluctuations; Hafnium; Histograms; Laboratories; Search methods; Signal detection; Testing;
Conference_Titel :
Pattern Recognition, 2000. Proceedings. 15th International Conference on
Conference_Location :
Barcelona
Print_ISBN :
0-7695-0750-6
DOI :
10.1109/ICPR.2000.903494