• DocumentCode
    1379482
  • Title

    Joint Speech Enhancement and Speaker Identification Using Approximate Bayesian Inference

  • Author

    Maina, Ciira Wa ; Walsh, John MacLaren

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Drexel Univ., Philadelphia, PA, USA
  • Volume
    19
  • Issue
    6
  • fYear
    2011
  • Firstpage
    1517
  • Lastpage
    1529
  • Abstract
    We present a variational Bayesian algorithm for joint speech enhancement and speaker identification that makes use of speaker dependent speech priors. Our work is built on the intuition that speaker dependent priors would work better than priors that attempt to capture global speech properties. We derive an iterative algorithm that exchanges information between the speech enhancement and speaker identification tasks. With cleaner speech we are able to make better identification decisions and with the speaker dependent priors we are able to improve speech enhancement performance. We present experimental results using the TIMIT data set which confirm the speech enhancement performance of the algorithm by measuring signal-to-noise (SNR) ratio improvement and perceptual quality improvement via the Perceptual Evaluation of Speech Quality (PESQ) score. We also demonstrate the ability of the algorithm to perform voice activity detection (VAD). The experimental results also demonstrate that speaker identification accuracy is improved.
  • Keywords
    Bayes methods; approximation theory; iterative methods; speaker recognition; speech enhancement; Bayesian algorithm; Bayesian inference; TIMIT data set; global speech property; iterative algorithm; joint speech enhancement-speaker identification; perceptual evaluation of speech quality; perceptual quality improvement; signal-to-noise ratio; voice activity detection; Approximation algorithms; Bayesian methods; Joints; Noise; Signal processing algorithms; Speech; Speech enhancement; Speech enhancement; speaker identification; variational Bayesian inference;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2092767
  • Filename
    5638126