DocumentCode
3675959
Title
Acoustic Feature Extraction Using Perceptual Wavelet Packet Decomposition for Frog Call Classification
Author
Jie Xie;Michael Towsey;Philip Eichinski;Jinglan Zhang;Paul Roe
Author_Institution
Fac. of Sci. &
fYear
2015
Firstpage
237
Lastpage
242
Abstract
Frog protection has become increasingly essential due to the rapid decline of its biodiversity. Therefore, it is valuable to develop new methods for studying this biodiversity. In this paper, a novel feature extraction method is proposed based on perceptual wavelet packet decomposition for classifying frog calls in noisy environments. Pre-processing and syllable segmentation are first applied to the frog call. Then, a spectral peak track is extracted from each syllable if possible. Track duration, dominant frequency and oscillation rate are directly extracted from the track. With k-means clustering algorithm, the calculated dominant frequency of all frog species is clustered into k parts, which produce a frequency scale for wavelet packet decomposition. Based on the adaptive frequency scale, wavelet packet decomposition is applied to the frog calls. Using the wavelet packet decomposition coefficients, a new feature set named perceptual wavelet packet decomposition sub-band cepstral coefficients is extracted. Finally, a k-nearest neighbour (k-NN) classifier is used for the classification. The experiment results show that the proposed features can achieve an average classification accuracy of 97.45% which outperforms syllable features (86.87%) and Mel-frequency cepstral coefficients (MFCCs) feature (90.80%).
Keywords
"Feature extraction","Wavelet packets","Accuracy","Spectrogram","Time-frequency analysis","Oscillators","Biodiversity"
Publisher
ieee
Conference_Titel
e-Science (e-Science), 2015 IEEE 11th International Conference on
Type
conf
DOI
10.1109/eScience.2015.47
Filename
7304296
Link To Document