• DocumentCode
    3115874
  • Title

    A Joint Probabilistic-Deterministic Approach using Source-Filter Modeling of Speech Signal for Single Channel Speech Separation

  • Author

    Radfar, M.H. ; Dansereau, R.M. ; Sayadiyan, A.

  • Author_Institution
    Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, ON
  • fYear
    2006
  • fDate
    6-8 Sept. 2006
  • Firstpage
    47
  • Lastpage
    52
  • Abstract
    In this paper, we present a new technique for separating two speech signals from a single recording. For this purpose, we decompose the speech signal into the excitation signal and the vocal tract function and then estimate the components from the mixed speech using a hybrid model. We first express the probability density function (PDF) of the mixed speech´s log spectral vectors in terms of the PDFs of the underlying speech signal´s vocal tract functions. Then, the mean vectors of PDFs of the vocal tract functions are obtained using a Maximum Likelihood estimator given the mixed signal. Finally, the estimated vocal tract function along with the extracted pitch values are used to reconstruct estimates of the individual speech signals. We compare our model with both an underdetermined blind source separation and a CASA method. The experimental results show our model outperforms both techniques in terms of SNR improvement and the percentage of crosstalk suppression.
  • Keywords
    filtering theory; maximum likelihood estimation; probability; signal reconstruction; source separation; spectral analysis; speech processing; excitation signal; joint probabilistic-deterministic approach; maximum likelihood estimation; mixed speech log spectral vector; probability density function; single channel speech separation; speech signal reconstruction; speech signal source-filter modeling; speech single recording; vocal tract function estimation; Blind source separation; Crosstalk; Fourier transforms; Frequency domain analysis; Maximum likelihood estimation; Probability density function; Psychoacoustic models; Source separation; Speech; Systems engineering and theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning for Signal Processing, 2006. Proceedings of the 2006 16th IEEE Signal Processing Society Workshop on
  • Conference_Location
    Arlington, VA
  • ISSN
    1551-2541
  • Print_ISBN
    1-4244-0656-0
  • Electronic_ISBN
    1551-2541
  • Type

    conf

  • DOI
    10.1109/MLSP.2006.275520
  • Filename
    4053619