Title :
Two-Stage Speech/Non-Speech Classification of Telephone Signals
Author :
Jian-bin, Li ; Ji-kun, Yan ; Hui, Zheng ; Zhong-Xia, Niu
Author_Institution :
Southwest Electron. & Telecommun. Technol. Res. Inst., Chengdu
Abstract :
Robust speech/non-speech classification is very useful in the pre-processing stage for speech recognition and content-based audio retrieval where the database is composed of various audio files. In this paper, a two-stage speech/non-speech classification algorithm for telephone signals is provided by combining three simple methods. Short-time energy plus pitch period is used in the first stage and the output is well classified at the second stage which applies the AdaBoost algorithm and MFCC features. Experiments show the effectiveness and efficiency of the algorithm
Keywords :
audio databases; audio signal processing; content-based retrieval; signal classification; speech recognition; AdaBoost algorithm; MFCC features; content-based audio retrieval; database; pitch period; short-time energy; speech recognition; telephone signal; two-stage speech classification; Audio databases; Classification algorithms; Content based retrieval; Data engineering; Feature extraction; Frequency measurement; Mel frequency cepstral coefficient; Robustness; Speech recognition; Telephony;
Conference_Titel :
Communications, Circuits and Systems Proceedings, 2006 International Conference on
Conference_Location :
Guilin
Print_ISBN :
0-7803-9584-0
Electronic_ISBN :
0-7803-9585-9
DOI :
10.1109/ICCCAS.2006.284683