• DocumentCode
    730368
  • Title

    Visual and acoustic identification of bird species

  • Author

    Marini, A. ; Turatti, A.J. ; Britto, A.S. ; Koerich, A.L.

  • Author_Institution
    Postgrad. Program in Inf., Pontifical Catholic Univ. of Parana, Curitiba, Brazil
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    2309
  • Lastpage
    2313
  • Abstract
    This paper presents a novel approach for bird species identification that relies on both visual features extracted from unconstrained bird images and acoustic features extracted from bird vocalizations. The Scale Invariant Feature Transform (SIFT) detects local features in bird images, which are then used to train a support vector machine classifier. The instances that are not classified with a certain degree of certainty are then rejected and reclassified using Mel-frequency cepstral coefficients (MFCCs) extracted from the bird songs if available. Experiments conducted on a dataset of 50 bird species that comprise images from the CUB200-2011 and audio samples from Xeno-Canto have shown that improvements between 1.2 and 15.7 percentage points are achieved when using an acoustic classifier to re-process the instances rejected by the visual classifier, depending on the rejection level.
  • Keywords
    audio acoustics; scaling phenomena; support vector machines; CUB200-2011; MFCC; Mel-frequency cepstral coefficients; SIFT; Xeno-Canto; acoustic features; acoustic identification; audio samples; bird species; scale invariant feature transform; support vector machine classifier; visual classifier; visual identification; Birds; Feature extraction; Mel frequency cepstral coefficient; Monitoring; Support vector machines; Visualization; MFCC; SIFT; combination of classifiers; fine-grained classification; fusion of information;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178383
  • Filename
    7178383