مرکز منطقه ای اطلاع رساني علوم و فناوري - Unvoiced Speech Segregation From Nonspeech Interference via CASA and Spectral Subtraction

DocumentCode :

1383958

Title :

Unvoiced Speech Segregation From Nonspeech Interference via CASA and Spectral Subtraction

Author :

Hu, Ke ; Wang, DeLiang

Author_Institution :

Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA

Volume :

Issue :

fYear :

2011

Firstpage :

1600

Lastpage :

1609

Abstract :

While a lot of effort has been made in computational auditory scene analysis to segregate voiced speech from monaural mixtures, unvoiced speech segregation has not received much attention. Unvoiced speech is highly susceptible to interference due to its relatively weak energy and lack of harmonic structure, and hence makes its segregation extremely difficult. This paper proposes a new approach to segregation of unvoiced speech from nonspeech interference. The proposed system first removes estimated voiced speech, and the periodic part of interference based on cross-channel correlation. The resultant interference becomes more stationary and we estimate the noise energy in unvoiced intervals using segregated speech in neighboring voiced intervals. Then unvoiced speech segregation occurs in two stages: segmentation and grouping. In segmentation, we apply spectral subtraction to generate time-frequency segments in unvoiced intervals. Unvoiced speech segments are subsequently grouped based on frequency characteristics of unvoiced speech using simple thresholding as well as Bayesian classification. The proposed algorithm is computationally efficient, and systematic evaluation and comparison show that our approach considerably improves the performance of unvoiced speech segregation.

Keywords :

Bayes methods; harmonics; interference (signal); speech processing; Bayesian classification; CASA; computational auditory scene analysis; cross-channel correlation; harmonic structure; monaural mixtures; nonspeech interference; spectral subtraction; unvoiced speech segregation; Feature extraction; Harmonic analysis; Interference; Noise; Power harmonic filters; Speech; Time frequency analysis; Bayesian classification; computational auditory scene analysis (CASA); nonspeech interference; spectral subtraction; unvoiced speech segregation;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2010.2093893

Filename :

5640654

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1383958