Title :
Segmentation on Time-Frequency Domain for Speech Segregation
Author :
Lim, Sung-Kil ; Lee, Hyon-Soo
Author_Institution :
Dept. of Comput. Eng., Kyung Hee Univ., Seoul
Abstract :
In this paper, we propose an algorithm for the frequency channel segmentation using a neural oscillatory network. The frequency channel segments means that local groups of channels in frequency domain that could be arisen from the same sound source. The proposed algorithm is based on the smoothed spectrum of the input sound. Valleys in the smoothed spectrum are used to determine vertical weights and the continuity of segment boundaries is used to determine vertical weights in the oscillatory network. To evaluate a suitableness of the proposed segmentation algorithm before the grouping stage is applied, we compare the synthesis results of ideal mask with that of proposed algorithm
Keywords :
smoothing methods; speech processing; time-frequency analysis; frequency channel segmentation; neural oscillatory network; oscillatory network; speech segregation; time-frequency domain segmentation; Biological system modeling; Computer networks; Filter bank; Image segmentation; Neurons; Oscillators; Psychoacoustic models; Signal processing algorithms; Speech; Time frequency analysis;
Conference_Titel :
Intelligent Signal Processing and Communications, 2006. ISPACS '06. International Symposium on
Conference_Location :
Yonago
Print_ISBN :
0-7803-9732-0
Electronic_ISBN :
0-7803-9733-9
DOI :
10.1109/ISPACS.2006.364699