• DocumentCode
    2152892
  • Title

    Speaker identification in a multimodal interface

  • Author

    Kacur, Juraj ; Varga, M. ; Rozinaj, Gregor

  • Author_Institution
    Inst. of Telecommun., Slovak Univ. of Technol., Bratislava, Slovakia
  • fYear
    2013
  • fDate
    25-27 Sept. 2013
  • Firstpage
    191
  • Lastpage
    194
  • Abstract
    The article presents the development of a speaker identification system as one part of the multimodal interface for the HBB-NEXT project. A short introduction to a speaker identification problem in the context of HBB-NEXT project is given. Then we focus on the design, optimization and method selection process in order to realize a real time, text independent speaker identification application, namely: selection and optimization of acoustical features, and selection and optimization of cooperating classification methods. All adjustments and evaluations were executed on a speaker database containing testing and training sections. The highest accuracies reaching 95% were observed for frequency ranges 300-9000Hz, MFCC and KNN methods.
  • Keywords
    cepstral analysis; feature extraction; pattern classification; speaker recognition; HBB-NEXT project; KNN methods; MFCC methods; acoustical features; cooperating classification methods; frequency 300 Hz to 9000 Hz; multimodal interface; real time text independent speaker identification application; speaker database; Accuracy; Feature extraction; Mel frequency cepstral coefficient; Speaker recognition; Speech; Speech recognition; Training; GMM; KNN; MFCC; PLP; Speaker Identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    ELMAR, 2013 55th International Symposium
  • Conference_Location
    Zadar
  • ISSN
    1334-2630
  • Print_ISBN
    978-953-7044-14-5
  • Type

    conf

  • Filename
    6658349