• DocumentCode
    2916369
  • Title

    A new approach for spoken language identification based on sequence kernel SVMs

  • Author

    Ziaei, Ali ; Ahadi, Seyed Mohammad ; Yeganeh, Hojatollah ; Mirrezaie, Seyed Masoud

  • Author_Institution
    Electr. Eng. Dept., Amirkabir Univ. of Technol., Tehran, Iran
  • fYear
    2009
  • fDate
    5-7 July 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    A new back-end classifier for GMM-LM based language identification systems is proposed in this paper. The proposed system consists of a mapping matrix and a back-end classifier of SVMs as its main parts, located in series after the GMM-LM system. While the mapping matrix maps the language model´s output vectors to a new space in which the languages are more separable than before, each SVM in the SVM bank-end classifier separates one language from the others. A new sequence kernel is used for each SVM in the bank-end classifier. As a final stage, a fusion block carries out the task of fusing the SVM bank-end scores with those of the GMM-based LID to achieve higher accuracies. We show that not only our new sequence kernel-based SVMs separate languages more efficiently than common Gaussian mixture and GLDS SVM back-end classifiers, but also our new mapping matrix outperforms common linear discriminant matrix in separating classes from each other and finally the introduction of fusion block leads to even superior performance. The overall accuracy of the LID is noticeably increased in comparison with the other LDA-GMM and LDAGLDS SVM back-end classifiers. Our experiments on 5 languages from OGI-TS multilanguage task prove our claim.
  • Keywords
    Gaussian processes; natural language processing; support vector machines; GMM-LM; Gaussian mixture models; OGI-TS multilanguage task; back-end classifier; linear discriminant matrix; mapping matrix; sequence kernel SVM; spoken language identification; Cepstral analysis; Entropy; Feature extraction; Kernel; Laboratories; Mel frequency cepstral coefficient; Natural languages; Speech processing; Support vector machine classification; Support vector machines; Gaussian Mixture Models; Language Identification; Support Vector Machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing, 2009 16th International Conference on
  • Conference_Location
    Santorini-Hellas
  • Print_ISBN
    978-1-4244-3297-4
  • Electronic_ISBN
    978-1-4244-3298-1
  • Type

    conf

  • DOI
    10.1109/ICDSP.2009.5201071
  • Filename
    5201071