Title :
Two-level approach for detecting non-lexical audio events in spontaneous speech
Author :
Li, Yan-Xiong ; He, Qian-Hua ; Li, Wei ; Wang, Zhi-Feng
Author_Institution :
Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
Abstract :
Based on analyses of characteristic differences between various audio events, a two-level approach is proposed for detecting three non-lexical audio events (filled pause, laugh, and applause) in spontaneous odel-based decision. The experiments give average precision of 87.3%, recall of 93.77%, and F-measure of 90.42%. Compared with the sliding window based approach, average F-measure is improved by 7.52%. Moreover, it can more accurately determine the boundaries of non-lexical audio events in spontaneous speech.
Keywords :
audio signal processing; speech processing; F-measure; characteristic differences; non-lexical audio events detection; spontaneous speech; Argon; Feature extraction; Hidden Markov models; Semantics; Silicon; Speech; Speech processing;
Conference_Titel :
Audio Language and Image Processing (ICALIP), 2010 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-5856-1
DOI :
10.1109/ICALIP.2010.5685083