DocumentCode :
3412749
Title :
Real-time speech/music classification with a hierarchical oblique decision tree
Author :
Wang, Jun ; Wu, Qiong ; Deng, Haojiang ; Yan, Qin
Author_Institution :
Inst. of Acoust., Chinese Acad. of Sci., Beijing
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
2033
Lastpage :
2036
Abstract :
In the problem of classification of audio signals, the requirements of low-complexity, high-accuracy and short delay are crucial for some practical scenarios. This paper proposes a method of real-time speech/music classification with a hierarchical oblique decision tree. A set of discrimination features in frequency domain are selected together with a proposed simple harmonic structure stability feature, which is based on a rough estimation of the harmonic structure. A feature subset selection tool is used to select a subset of short and long term features to feed into a hierarchical oblique decision tree classifier. The method is evaluated and compared with the open loop selection mode in AMR-WB+. Experiments show the proposed approach gives a better performance (98.3%) compared to other prevailing approaches. In particular, it comes with promising short delay of 10 ms and low complexity of 1 wmops.
Keywords :
audio signal processing; decision trees; frequency-domain analysis; music; signal classification; speech processing; audio signal classification; discrimination feature; frequency domain; harmonic structure stability feature; hierarchical oblique decision tree; real-time speech/music classification; rough estimation; Classification algorithms; Classification tree analysis; Decision trees; Delay; Frequency domain analysis; Frequency selective surfaces; Multiple signal classification; Music; Speech; Stability; FSS; harmonic structure; hierarchical oblique decision tree; signal classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518039
Filename :
4518039
Link To Document :
بازگشت