Title :
Application of speech recognition to African elephant (Loxodonta africana) vocalizations
Author :
Clemins, PairickJ ; Johnson, Michael T.
Author_Institution :
Speech & Signal Process. Lab., Marquette Univ., Milwaukee, WI, USA
Abstract :
This paper presents a novel application of speech processing research, classification of African elephant vocalizations. Speaker identification and call classification experiments are performed on data collected from captive African elephants in a naturalistic environment. The features used for classification are 12 mel-frequency cepstral coefficients plus log energy computed using a shifted filter bank to emphasize the infrasound range of the frequency spectrum used by African elephants. Initial classification accuracies of 83.8% for call classification and 88.1% for speaker identification were obtained. The long-term goal of this research is to develop a universal analysis framework and robust feature set for animal vocalizations that can be applied to many species.
Keywords :
cepstral analysis; feature extraction; signal classification; speaker recognition; speech processing; African elephant vocalizations; Loxodonta africana; call classification; features; frequency spectrum; infrasound range; log energy; mel-frequency cepstral coefficients; shifted filter bank; speaker identification; speech processing; speech recognition; Acoustic noise; Animals; Biomedical acoustics; Cepstral analysis; Feature extraction; Robustness; Signal processing algorithms; Speech processing; Speech recognition; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198823