DocumentCode :
699341
Title :
Audio source segmentation using spectral correlation features for automatic indexing of broadcast news
Author :
Matsunaga, Shoichi ; Mizuno, Osamu ; Ohtsuki, Katsutoshi ; Hayashi, Yoshihiko
Author_Institution :
NTT Cyber Space Labs., NTT Corporationing, Yokosuka, Japan
fYear :
2004
fDate :
6-10 Sept. 2004
Firstpage :
2103
Lastpage :
2106
Abstract :
This paper proposes a new segmentation procedure to detect audio source intervals for automatic indexing of broadcast news. The procedure is composed of an audio source detection part and a part that smoothes the detected sequences. The detection part uses three new acoustic feature parameters that are based on spectral cross-correlation: spectral stability, white noise similarity, and sound spectral shape. These parameters make it possible to capture the audio sources more accurately than can be done with conventional parameters. The smoothing part has a new merging method that drops erroneous detection results of short duration. Audio source classification experiments are conducted on broadcast news segments. Performance is increased by 6.6% when the proposed parameters are used and by 3.1% when the proposed merging method is used, showing the usefulness of our approach. Experiments confirm the impact of this proposal on broadcast news indexing.
Keywords :
audio signal processing; signal classification; signal detection; smoothing methods; acoustic feature parameters; audio source classification; audio source detection part; audio source segmentation procedure; automatic indexing; broadcast news; merging method; sound spectral shape; spectral correlation features; spectral stability; white noise similarity; Abstracts; Correlation; Noise; Smoothing methods; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2004 12th European
Conference_Location :
Vienna
Print_ISBN :
978-320-0001-65-7
Type :
conf
Filename :
7079871
Link To Document :
بازگشت