• DocumentCode
    3675959
  • Title

    Acoustic Feature Extraction Using Perceptual Wavelet Packet Decomposition for Frog Call Classification

  • Author

    Jie Xie;Michael Towsey;Philip Eichinski;Jinglan Zhang;Paul Roe

  • Author_Institution
    Fac. of Sci. &
  • fYear
    2015
  • Firstpage
    237
  • Lastpage
    242
  • Abstract
    Frog protection has become increasingly essential due to the rapid decline of its biodiversity. Therefore, it is valuable to develop new methods for studying this biodiversity. In this paper, a novel feature extraction method is proposed based on perceptual wavelet packet decomposition for classifying frog calls in noisy environments. Pre-processing and syllable segmentation are first applied to the frog call. Then, a spectral peak track is extracted from each syllable if possible. Track duration, dominant frequency and oscillation rate are directly extracted from the track. With k-means clustering algorithm, the calculated dominant frequency of all frog species is clustered into k parts, which produce a frequency scale for wavelet packet decomposition. Based on the adaptive frequency scale, wavelet packet decomposition is applied to the frog calls. Using the wavelet packet decomposition coefficients, a new feature set named perceptual wavelet packet decomposition sub-band cepstral coefficients is extracted. Finally, a k-nearest neighbour (k-NN) classifier is used for the classification. The experiment results show that the proposed features can achieve an average classification accuracy of 97.45% which outperforms syllable features (86.87%) and Mel-frequency cepstral coefficients (MFCCs) feature (90.80%).
  • Keywords
    "Feature extraction","Wavelet packets","Accuracy","Spectrogram","Time-frequency analysis","Oscillators","Biodiversity"
  • Publisher
    ieee
  • Conference_Titel
    e-Science (e-Science), 2015 IEEE 11th International Conference on
  • Type

    conf

  • DOI
    10.1109/eScience.2015.47
  • Filename
    7304296