DocumentCode :
3695826
Title :
Acoustic and auxiliary speech features for speaker identification system
Author :
Juraj Kacur;Peter Truchly
Author_Institution :
Department of telecommunications FEI STU, Ilkovicova 3, Bratislava, Slovakia
fYear :
2015
Firstpage :
109
Lastpage :
112
Abstract :
The focus of the article is on the selection, adjustment and overall performance of speech features at acoustical and prosodic level for speaker recognition task. Namely: perceptual linear prediction, Mel frequency cepstra, cepstral linear prediction, formant frequencies, and different auxiliary features. Both brief theoretical backgrounds and possible computational methods are outlined in regard to the speaker recognition task. In the series of experiments using 114 speakers database, it was observed that a model based method slightly outperformed the perceptual ones. Furthermore, it was found that auxiliary and prosodic features may not always improve scores when processed together with acoustic ones. On average the success rate was about 90% whereas the best recorded score was 99.1% for cepstral linear prediction coefficients in connection with k-nearest neighbor classifier.
Keywords :
"Mel frequency cepstral coefficient","Speech","Speech recognition","Speaker recognition","Statistics"
Publisher :
ieee
Conference_Titel :
ELMAR (ELMAR), 2015 57th International Symposium
Print_ISBN :
978-953-184-209-9
Type :
conf
DOI :
10.1109/ELMAR.2015.7334508
Filename :
7334508
Link To Document :
بازگشت