• DocumentCode
    2806637
  • Title

    Automatic audio tagging using covariate shift adaptation

  • Author

    Wichern, Gordon ; Yamada, Makoto ; Thornburg, Harvey ; Sugiyama, Masashi ; Spanias, Andreas

  • Author_Institution
    SenSIP Center, Arizona State Univ., Tempe, AZ, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    253
  • Lastpage
    256
  • Abstract
    Automatically annotating or tagging unlabeled audio files has several applications, such as database organization and recommender systems. We are interested in the case where the system is trained using clean high-quality audio files, but most of the files that need to be automatically tagged during the test phase are heavily compressed and noisy, for instance if they were captured on a mobile device. In this situation we assume the audio files follow a covariate shift model in the acoustic feature space, i.e., the feature distributions are different in the training and test phases, but the conditional distribution of labels given features remains unchanged. Our method uses a specially designed audio similarity measure as input to a set of weighted logistic regressors, which attempt to alleviate the influence of covariate shift. Results on a freely available database of sound files contributed and labeled by non-expert users, demonstrate effective automatic tagging performance.
  • Keywords
    audio signal processing; covariance analysis; file organisation; identification technology; regression analysis; set theory; acoustic feature space; audio file; covariate shift adaptation; database organization; feature distribution; file annotation; file tagging; logistic regressor; recommender system; Acoustic devices; Acoustic measurements; Acoustic noise; Acoustic testing; Audio databases; Automatic testing; Phase noise; Recommender systems; System testing; Tagging; Acoustic signal analysis; Database query processing; Importance; KLIEP;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495973
  • Filename
    5495973