DocumentCode
3143783
Title
Integrating multiple observations for model-based single-microphone speech separation with conditional random fields
Author
Yeung, Yu Ting ; Lee, Tan ; Leung, Cheung-Chi
Author_Institution
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Hong Kong, China
fYear
2012
fDate
25-30 March 2012
Firstpage
257
Lastpage
260
Abstract
A single-microphone speech separation framework based on conditional random fields (CRFs) is proposed in this paper. Unlike factorial HMM, CRF does not have the conditional independence assumption on observations, thus different types of observations from the speech mixture can be integrated into the models through feature functions. Similar to factorial HMM, there is the statistical independence assumption on sources. Under this assumption, the two-source single-microphone speech separation problem can be expressed by two independent linear-chain CRFs. The separation problem becomes two pattern recognition problems, with respect to CRF models of the two sources. Experimental results show that by integrating initial separation outputs from factorial HMM with log power spectrum, fundamental frequency and speaker likelihoods of the mixture, CRF separation framework consistently improves the results from factorial HMM in terms of SNR, segmental SNR and PESQ.
Keywords
random processes; source separation; speech processing; conditional random fields; feature function; model based single microphone speech separation; multiple observation; speaker likelihood; speech mixture; Gaussian distribution; Hidden Markov models; Signal to noise ratio; Spectral analysis; Speech; Speech recognition; Training; conditional random fields; speech separation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6287866
Filename
6287866
Link To Document