DocumentCode
384233
Title
A robust audio searching method for cellular-phone-based music information retrieval
Author
Kurozumi, Takayuki ; Kashino, Kunio ; Murase, Hiroshi
Author_Institution
NTT Commun. Sci. Labs., NTT Corp., Atsugi, Japan
Volume
3
fYear
2002
fDate
2002
Firstpage
991
Abstract
We propose a search method for detecting a query audio signal fragment in long audio recordings. The query signal is assumed to be captured by a portable terminal, such as a cellular phone, in the real world. A major problem in this kind of search is that the features of the query sound may include distortions due to terminal characteristics or environment noise. The method proposed comprises local time-frequency-region normalization and robust subspace spanning. The former is used to make features invariant to additive noise and frequency characteristics, and the latter to choose frequency bands that minimize the effect of feature distortions. Experiments using cellular phones in the real world show the proposed method is effective.
Keywords
audio databases; cellular radio; feature extraction; music; query processing; signal detection; cellular-phone-based music information retrieval; environment noise; feature distortions; local time-frequency-region normalization; long audio recordings; portable terminal; query audio signal fragment detection; robust audio searching method; robust subspace spanning; terminal characteristics; Additive noise; Cellular phones; Fluctuations; Frequency; Multiple signal classification; Music information retrieval; Noise robustness; Search methods; Spatial databases; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2002. Proceedings. 16th International Conference on
ISSN
1051-4651
Print_ISBN
0-7695-1695-X
Type
conf
DOI
10.1109/ICPR.2002.1048204
Filename
1048204
Link To Document