مرکز منطقه ای اطلاع رساني علوم و فناوري - Sound source detection using multiple noise models

DocumentCode :

3412719

Title :

Sound source detection using multiple noise models

Author :

Matsunaga, Shoichi ; Yamaguchi, Masahide ; Yamauchi, Katsuya ; Yamashita, Masaru

Author_Institution :

Dept. of Comput. & Inf. Sci., Nagasaki Univ., Nagasaki

fYear :

2008

fDate :

March 31 2008-April 4 2008

Firstpage :

2025

Lastpage :

2028

Abstract :

This paper describes a sound source detection approach based on elaborate noise-modeling techniques for audio indexing. For accurate detection, we devised two methods to generate multiple-noise models through clustering techniques. One method is based on frame-wise data similarity, and the other is based on noise source similarity. The former method employs K-means clustering and a smoothing technique to avoid inaccurate segmentation. The latter method involves noise modeling based on a tree data structure generated by the progressive merging of noise clusters. The classification experiments show that by using these proposed methods, audio sources can be detected with better accuracy than that achieved by a conventional method. When four noise models generated by the latter method were used, the noise detection performance increased by 3.9% for the periods in which the sound sources did not overlap. With regard to the experiments for an audio stream that included overlapped segments, the noise detection performance increased by 1.2% without a decrease in the speech detection performance.

Keywords :

acoustic signal detection; audio signal processing; noise; pattern clustering; signal classification; smoothing methods; tree data structures; K-means clustering; acoustic segmentation; audio indexing; classification experiments; frame-wise data similarity; multiple noise models; noise detection; noise source similarity; noise-modeling techniques; smoothing technique; sound source detection; speech detection; tree data structure; Acoustic noise; Acoustic signal detection; Indexing; Multimedia communication; Noise generators; Smoothing methods; Speech enhancement; Speech recognition; Streaming media; Tree data structures; Acoustic segmentation; Clustering;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on

Conference_Location :

Las Vegas, NV

ISSN :

1520-6149

Print_ISBN :

978-1-4244-1483-3

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2008.4518037

Filename :

4518037

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3412719