• DocumentCode
    1371815
  • Title

    Content-based audio classification and retrieval using the nearest feature line method

  • Author

    Li, Stan Z.

  • Author_Institution
    Microsoft Res., Beijing, China
  • Volume
    8
  • Issue
    5
  • fYear
    2000
  • fDate
    9/1/2000 12:00:00 AM
  • Firstpage
    619
  • Lastpage
    625
  • Abstract
    A method is presented for content-based audio classification and retrieval. It is based on a new pattern classification method called the nearest feature line (NFL). In the NFL, information provided by multiple prototypes per class is explored. This contrasts to the nearest neighbor (NN) classification in which the query is compared to each prototype individually. Regarding audio representation, perceptual and cepstral features and their combinations are considered. Extensive experiments are performed to compare various classification methods and feature sets. The results show that the NFL-based method produces consistently better results than the NN-based and other methods. A system resulting from this work has achieved the error rate of 9.78%, as compared to that of 18.34% of a compelling existing system, as tested on a common audio database
  • Keywords
    audio signal processing; content-based retrieval; database management systems; music; pattern classification; signal classification; NFL; audio representation; cepstral features; content-based audio classification; multiple prototypes; nearest feature line method; perceptual feature; retrieval; Application software; Audio databases; Cepstral analysis; Content based retrieval; Multimedia databases; Nearest neighbor searches; Neural networks; Pattern recognition; Prototypes; Roentgenium;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.861383
  • Filename
    861383