مرکز منطقه ای اطلاع رساني علوم و فناوري - Hierarchical classification of audio data for archiving and retrieving

DocumentCode :

2704921

Title :

Hierarchical classification of audio data for archiving and retrieving

Author :

Zhang, Tong ; Kuo, C. C Jay

Author_Institution :

Integrated Media Syst. Center, Univ. of Southern California, Los Angeles, CA, USA

Volume :

fYear :

1999

fDate :

15-19 Mar 1999

Firstpage :

3001

Abstract :

A hierarchical system for audio classification and retrieval based on audio content analysis is presented in this paper. The system consists of three stages. The first stage is called the coarse-level audio classification and segmentation, where audio recordings are classified and segmented into speech, music, several types of environmental sounds, and silence, based on morphological and statistical analysis of temporal curves of short-time features of audio signals. In the second stage, environmental sounds are further classified into finer classes such as applause, rain, bird sound, etc. This fine-level classification is based on time-frequency analysis of audio signals and use of the hidden Markov model (HMM) for classification. In the third stage, the query-by-example audio retrieval is implemented where similar sounds can be found according to an input sample audio. It is shown that the proposed system has achieved an accuracy higher than 90% for coarse-level audio classification. Examples of audio fine classification and audio retrieval are also provided

Keywords :

audio signal processing; content-based retrieval; database management systems; feature extraction; hidden Markov models; information retrieval; mathematical morphology; signal classification; statistical analysis; time-frequency analysis; archiving; audio classification; audio content analysis; audio data; audio recordings; audio signal; coarse-level audio classification; environmental sounds; hidden Markov model; hierarchical classification; morphological analysis; music; query-by-example audio retrieval; retrieval; segmentation; silence; speech; statistical analysis; temporal curve; time-frequency analysis; Audio recording; Birds; Content based retrieval; Hidden Markov models; Hierarchical systems; Multiple signal classification; Music information retrieval; Rain; Speech analysis; Statistical analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on

Conference_Location :

Phoenix, AZ

ISSN :

1520-6149

Print_ISBN :

0-7803-5041-3

Type :

conf

DOI :

10.1109/ICASSP.1999.757472

Filename :

757472

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2704921