• DocumentCode
    3695826
  • Title

    Acoustic and auxiliary speech features for speaker identification system

  • Author

    Juraj Kacur;Peter Truchly

  • Author_Institution
    Department of telecommunications FEI STU, Ilkovicova 3, Bratislava, Slovakia
  • fYear
    2015
  • Firstpage
    109
  • Lastpage
    112
  • Abstract
    The focus of the article is on the selection, adjustment and overall performance of speech features at acoustical and prosodic level for speaker recognition task. Namely: perceptual linear prediction, Mel frequency cepstra, cepstral linear prediction, formant frequencies, and different auxiliary features. Both brief theoretical backgrounds and possible computational methods are outlined in regard to the speaker recognition task. In the series of experiments using 114 speakers database, it was observed that a model based method slightly outperformed the perceptual ones. Furthermore, it was found that auxiliary and prosodic features may not always improve scores when processed together with acoustic ones. On average the success rate was about 90% whereas the best recorded score was 99.1% for cepstral linear prediction coefficients in connection with k-nearest neighbor classifier.
  • Keywords
    "Mel frequency cepstral coefficient","Speech","Speech recognition","Speaker recognition","Statistics"
  • Publisher
    ieee
  • Conference_Titel
    ELMAR (ELMAR), 2015 57th International Symposium
  • Print_ISBN
    978-953-184-209-9
  • Type

    conf

  • DOI
    10.1109/ELMAR.2015.7334508
  • Filename
    7334508