• DocumentCode
    1687156
  • Title

    Discriminative feature extraction for language identification

  • Author

    Shuai Huang ; Coppersmith, Glen A.

  • Author_Institution
    Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA
  • fYear
    2013
  • Firstpage
    6862
  • Lastpage
    6865
  • Abstract
    In this paper we propose a discriminative feature extraction method, DFE, to address the increasing number of features in language identification (LID) tasks. Similar to linear discriminant analysis (LDA), it extracts the most discriminative features through the maximization of an “approximated” mutual information I(C; Y ) between the class labels C and the projected data Y. Compared with other feature extraction methods, experiments done on the CallFriend corpus shows DFE could handle high-dimensional dataset with ease. Furthermore, this feature extraction shows improvements on the LID task over standard feature extraction methods (LDA and principal components analysis).
  • Keywords
    feature extraction; natural language processing; principal component analysis; CallFriend corpus; class labels; discriminative feature extraction; high-dimensional dataset; language identification; linear discriminant analysis; mutual information; principal components analysis; Eigenvalues and eigenfunctions; Estimation; Feature extraction; Linear programming; Mutual information; Principal component analysis; Vectors; Feature extraction; language identification; mutual information;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6638991
  • Filename
    6638991