DocumentCode
1371815
Title
Content-based audio classification and retrieval using the nearest feature line method
Author
Li, Stan Z.
Author_Institution
Microsoft Res., Beijing, China
Volume
8
Issue
5
fYear
2000
fDate
9/1/2000 12:00:00 AM
Firstpage
619
Lastpage
625
Abstract
A method is presented for content-based audio classification and retrieval. It is based on a new pattern classification method called the nearest feature line (NFL). In the NFL, information provided by multiple prototypes per class is explored. This contrasts to the nearest neighbor (NN) classification in which the query is compared to each prototype individually. Regarding audio representation, perceptual and cepstral features and their combinations are considered. Extensive experiments are performed to compare various classification methods and feature sets. The results show that the NFL-based method produces consistently better results than the NN-based and other methods. A system resulting from this work has achieved the error rate of 9.78%, as compared to that of 18.34% of a compelling existing system, as tested on a common audio database
Keywords
audio signal processing; content-based retrieval; database management systems; music; pattern classification; signal classification; NFL; audio representation; cepstral features; content-based audio classification; multiple prototypes; nearest feature line method; perceptual feature; retrieval; Application software; Audio databases; Cepstral analysis; Content based retrieval; Multimedia databases; Nearest neighbor searches; Neural networks; Pattern recognition; Prototypes; Roentgenium;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.861383
Filename
861383
Link To Document