DocumentCode
730368
Title
Visual and acoustic identification of bird species
Author
Marini, A. ; Turatti, A.J. ; Britto, A.S. ; Koerich, A.L.
Author_Institution
Postgrad. Program in Inf., Pontifical Catholic Univ. of Parana, Curitiba, Brazil
fYear
2015
fDate
19-24 April 2015
Firstpage
2309
Lastpage
2313
Abstract
This paper presents a novel approach for bird species identification that relies on both visual features extracted from unconstrained bird images and acoustic features extracted from bird vocalizations. The Scale Invariant Feature Transform (SIFT) detects local features in bird images, which are then used to train a support vector machine classifier. The instances that are not classified with a certain degree of certainty are then rejected and reclassified using Mel-frequency cepstral coefficients (MFCCs) extracted from the bird songs if available. Experiments conducted on a dataset of 50 bird species that comprise images from the CUB200-2011 and audio samples from Xeno-Canto have shown that improvements between 1.2 and 15.7 percentage points are achieved when using an acoustic classifier to re-process the instances rejected by the visual classifier, depending on the rejection level.
Keywords
audio acoustics; scaling phenomena; support vector machines; CUB200-2011; MFCC; Mel-frequency cepstral coefficients; SIFT; Xeno-Canto; acoustic features; acoustic identification; audio samples; bird species; scale invariant feature transform; support vector machine classifier; visual classifier; visual identification; Birds; Feature extraction; Mel frequency cepstral coefficient; Monitoring; Support vector machines; Visualization; MFCC; SIFT; combination of classifiers; fine-grained classification; fusion of information;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178383
Filename
7178383
Link To Document