DocumentCode
3695826
Title
Acoustic and auxiliary speech features for speaker identification system
Author
Juraj Kacur;Peter Truchly
Author_Institution
Department of telecommunications FEI STU, Ilkovicova 3, Bratislava, Slovakia
fYear
2015
Firstpage
109
Lastpage
112
Abstract
The focus of the article is on the selection, adjustment and overall performance of speech features at acoustical and prosodic level for speaker recognition task. Namely: perceptual linear prediction, Mel frequency cepstra, cepstral linear prediction, formant frequencies, and different auxiliary features. Both brief theoretical backgrounds and possible computational methods are outlined in regard to the speaker recognition task. In the series of experiments using 114 speakers database, it was observed that a model based method slightly outperformed the perceptual ones. Furthermore, it was found that auxiliary and prosodic features may not always improve scores when processed together with acoustic ones. On average the success rate was about 90% whereas the best recorded score was 99.1% for cepstral linear prediction coefficients in connection with k-nearest neighbor classifier.
Keywords
"Mel frequency cepstral coefficient","Speech","Speech recognition","Speaker recognition","Statistics"
Publisher
ieee
Conference_Titel
ELMAR (ELMAR), 2015 57th International Symposium
Print_ISBN
978-953-184-209-9
Type
conf
DOI
10.1109/ELMAR.2015.7334508
Filename
7334508
Link To Document