• DocumentCode
    149040
  • Title

    Missing feature reconstruction methods for robust speaker identification

  • Author

    Xueliang Zhang ; Hui Zhang ; Guanglai Gao

  • Author_Institution
    Comput. Sci. Dept., Inner Mongolia Univ., Hohhot, China
  • fYear
    2014
  • fDate
    1-5 Sept. 2014
  • Firstpage
    1482
  • Lastpage
    1486
  • Abstract
    In this study, we propose a reconstruction method to restore the degraded features for robust speaker identification. The proposed method is based on a hybrid generative model which consists of deep belief network (DBN) and restricted Boltzmann machine (RBM). Specifically, the noisy speech is firstly decomposed into time-frequency (T-F) representations. Then ideal binary mask (IBM) is computed to indicate each T-F point as reliable or unreliable. We reconstruct the unreliable ones by the proposed model iteratively. Finally, reconstructed feature is utilized to conventional speaker identification system. Experiments demonstrate that the proposed method achieves significant performance improvements over previous missing feature techniques under a wide range of signal-to-noise ratios.
  • Keywords
    signal reconstruction; signal representation; signal restoration; speaker recognition; time-frequency analysis; DBN; IBM; RBM; T-F point; T-F representations; deep belief network; hybrid generative model; ideal binary mask; missing feature reconstruction methods; noisy speech; restricted Boltzmann machine; robust speaker identification system; signal-to-noise ratios; time-frequency representations; Abstracts; Adaptation models; Computational modeling; Data models; Production facilities; Robustness; Smoothing methods; Deep belief network; Missing feature techniques; Restricted Boltzmann machine; Robust speaker identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
  • Conference_Location
    Lisbon
  • Type

    conf

  • Filename
    6952536