• DocumentCode
    1395770
  • Title

    A Tandem Algorithm for Pitch Estimation and Voiced Speech Segregation

  • Author

    Hu, Guoning ; Wang, DeLiang

  • Author_Institution
    Biophys. Program, Ohio State Univ., Columbus, OH, USA
  • Volume
    18
  • Issue
    8
  • fYear
    2010
  • Firstpage
    2067
  • Lastpage
    2079
  • Abstract
    A lot of effort has been made in computational auditory scene analysis (CASA) to segregate speech from monaural mixtures. The performance of current CASA systems on voiced speech segregation is limited by lacking a robust algorithm for pitch estimation. We propose a tandem algorithm that performs pitch estimation of a target utterance and segregation of voiced portions of target speech jointly and iteratively. This algorithm first obtains a rough estimate of target pitch, and then uses this estimate to segregate target speech using harmonicity and temporal continuity. It then improves both pitch estimation and voiced speech segregation iteratively. Novel methods are proposed for performing segregation with a given pitch estimate and pitch determination with given segregation. Systematic evaluation shows that the tandem algorithm extracts a majority of target speech without including much interference, and it performs substantially better than previous systems for either pitch extraction or voiced speech segregation.
  • Keywords
    iterative methods; speech processing; computational auditory scene analysis; iterative procesure; monaural mixtures; pitch determination; pitch estimation; pitch extraction; tandem algorithm; target utterance; temporal continuity; voiced speech segregation; Auditory system; Automatic speech recognition; Background noise; Image analysis; Interference; Iterative algorithms; Noise robustness; Performance evaluation; Speech analysis; Speech enhancement; Computational auditory scene analysis (CASA); iterative procedure; pitch estimation; speech segregation; tandem algorithm;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2041110
  • Filename
    5398889