• DocumentCode
    29311
  • Title

    Usable speech detection based on empirical mode decomposition

  • Author

    Ghezaiel, W. ; Ben Slimanne, A. ; Ben Braiek, Ezzedine

  • Author_Institution
    CEREP-ESSTT, Univ. of Tunis, Tunis, Tunisia
  • Volume
    49
  • Issue
    7
  • fYear
    2013
  • fDate
    March 28 2013
  • Firstpage
    503
  • Lastpage
    504
  • Abstract
    Recently, usable speech criteria have been proposed to extract minimally corrupted speech for speaker identification in co-channel speech. Proposed is a new usable speech extraction method based on the pitch information obtained from a multi-resolution analysis by empirical mode decomposition. The idea is to retain the speech segments that have only one pitch detected and remove the others. Evaluation of this method is performed on the TIMIT database referring to the target to interferer ratio measure. Co-channel speech is constructed by mixing all possible gender speakers. Results do not show much difference for different mixtures. For the overall mixtures 94.97% of usable speech is correctly detected with false alarms of 16.52%.
  • Keywords
    feature extraction; signal detection; signal resolution; speaker recognition; statistical analysis; TIMIT database; cochannel speech; empirical mode decomposition; false alarm; gender speaker; interferer ratio measure; minimally corrupted speech extraction; multiresolution analysis; pitch detection; pitch information; speaker identification; speech segment; usable speech criteria; usable speech detection; usable speech extraction method;
  • fLanguage
    English
  • Journal_Title
    Electronics Letters
  • Publisher
    iet
  • ISSN
    0013-5194
  • Type

    jour

  • DOI
    10.1049/el.2012.3639
  • Filename
    6504989