DocumentCode
3013462
Title
A MAP criterion for detecting the number of speakers at frame level in model-based single-channel speech separation
Author
Mowlaee, P. ; Christensen, M.G. ; Tan, Z.-H. ; Jensen, S.H.
Author_Institution
Dept. of Electron. Syst., Aalborg Univ., Aalborg, Denmark
fYear
2010
fDate
7-10 Nov. 2010
Firstpage
538
Lastpage
541
Abstract
The problem of detecting the number of speakers for a particular segment occurs in many different speech applications. In single channel speech separation, for example, this information is often used to simplify the separation process, as the signal has to be treated differently depending on the number of speakers. Inspired by the asymptotic maximum a posteriori rule proposed for model selection, we pose the problem as a model selection problem. More specifically, we derive a multiple hypotheses test for determining the number of speakers at a frame level in an observed signal based on underlying parametric speaker models, trained a priori. The experimental results indicate that the suggested method improves the quality of the separated signals in a single-channel speech separation scenario at different signal-to-signal ratio levels.
Keywords
maximum likelihood estimation; object detection; speech processing; asymptotic maximum a posteriori; frame level; map criterion; model selection problem; model-based single-channel speech separation; parametric speaker models; signal to signal ratio levels; Computational modeling; Detectors; Noise; Speech; Speech processing; Speech recognition; Training; Double-talk detection; multiple-hypothesis test; single-channel speech separation;
fLanguage
English
Publisher
ieee
Conference_Titel
Signals, Systems and Computers (ASILOMAR), 2010 Conference Record of the Forty Fourth Asilomar Conference on
Conference_Location
Pacific Grove, CA
ISSN
1058-6393
Print_ISBN
978-1-4244-9722-5
Type
conf
DOI
10.1109/ACSSC.2010.5757617
Filename
5757617
Link To Document