• DocumentCode
    3697417
  • Title

    Chime-home: A dataset for sound source recognition in a domestic environment

  • Author

    Peter Foster;Siddharth Sigtia;Sacha Krstulovic;Jon Barker;Mark D. Plumbley

  • Author_Institution
    School of Electronic Engineering and Computer Science, Queen Mary University of London, UK
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    For the task of sound source recognition, we introduce a novel data set based on 6.8 hours of domestic environment audio recordings. We describe our approach of obtaining annotations for the recordings. Further, we quantify agreement between obtained annotations. Finally, we report baseline results for sound source recognition using the obtained dataset. Our annotation approach associates each 4-second excerpt from the audio recordings with multiple labels, based on a set of 7 labels associated with sound sources in the acoustic environment. With the aid of 3 human annotators, we obtain 3 sets of multi-label annotations, for 4378 4-second audio excerpts. We evaluate agreement between annotators by computing Jaccard indices between sets of label assignments. Observing varying levels of agreement across labels, with a view to obtaining a representation of ‘ground truth’ in annotations, we refine our dataset to obtain a set of multi-label annotations for 1946 audio excerpts. For the set of 1946 annotated audio excerpts, we predict binary label assignments using Gaussian mixture models estimated on MFCCs. Evaluated using the area under receiver operating characteristic curves, across considered labels we observe performance scores in the range 0.76 to 0.98. Dataset URL: http://archive.org/details/chime-home
  • Keywords
    "Acoustics","Speech","Speech processing","Audio recording","Conferences","Speech recognition"
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics (WASPAA), 2015 IEEE Workshop on
  • Type

    conf

  • DOI
    10.1109/WASPAA.2015.7336899
  • Filename
    7336899