• DocumentCode
    150231
  • Title

    Generalization of supervised learning for binary mask estimation

  • Author

    May, Torsten ; Gerkmann, Timo

  • Author_Institution
    Centre for Appl. Hearing Res., Tech. Univ. of Denmark, Lyngby, Denmark
  • fYear
    2014
  • fDate
    8-11 Sept. 2014
  • Firstpage
    154
  • Lastpage
    158
  • Abstract
    This paper addresses the problem of speech segregation by estimating the ideal binary mask (IBM) from noisy speech. Two methods will be compared, one supervised learning approach that incorporates a priori knowledge about the feature distribution observed during training. The second method solely relies on a frame-based speech presence probability (SPP) es-timation, and therefore, does not depend on the acoustic condition seen during training. We investigate the influence of mismatches between the acoustic conditions used for training and testing on the IBM estimation performance and discuss the advantages of both approaches.
  • Keywords
    learning (artificial intelligence); probability; speech processing; IBM; SPP estimation; feature distribution; ideal binary mask estimation; noisy speech; speech presence probability estimation; speech segregation; supervised learning; Acoustics; Estimation; Noise; Noise measurement; Speech; Testing; Training; generalization; ideal binary mask; speech presence probability; speech segregation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustic Signal Enhancement (IWAENC), 2014 14th International Workshop on
  • Conference_Location
    Juan-les-Pins
  • Type

    conf

  • DOI
    10.1109/IWAENC.2014.6953357
  • Filename
    6953357