Title :
Single Channel Speech and Background Segregation Through Harmonic-Temporal Clustering
Author :
Le Roux, Jonathan ; Kameoka, Hirokazu ; Ono, Nobutaka ; de Cheveigne, Alain ; Sagayama, Shigeki
Author_Institution :
Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan; CNRS, Université Paris 5, and Ecole Normale Supérieure, Paris, France. leroux@hil.t.u-tokyo.ac.jp
Abstract :
The design of effective algorithms for single-channel analysis of complex and varied acoustical scenes is a very important and challenging problem. We present here the application of the recently introduced Harmonic-Temporal Clustering (HTC) framework to single channel speech enhancement, background retrieval and speaker separation. HTC processing relies on a precise parametric description of the voiced parts of speech derived from the power spectrum. We explain the positioning of the algorithm inside the Computational Acoustic Scene Analysis (CASA) area, describe the theoretical background of the method, show through preliminary experiments its basic feasibility, and discuss potential improvements.
Keywords :
Acoustic applications; Algorithm design and analysis; Auditory system; Clustering algorithms; Layout; Loudspeakers; Signal processing algorithms; Speech analysis; Speech enhancement; Speech processing;
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 2007 IEEE Workshop on
Conference_Location :
New Paltz, NY, USA
Print_ISBN :
978-1-4244-1620-2
Electronic_ISBN :
978-1-4244-1619-6
DOI :
10.1109/ASPAA.2007.4393003