Title :
Low-complexity automatic speaker recognition in the compressed GSM AMR domain
Author :
Petracca, M. ; Servetti, A. ; De Martin, J.C.
Author_Institution :
Dipt. di Autom. e Inf., Torino Univ., Italy
Abstract :
This paper presents an experimental implementation of a low-complexity speaker recognition algorithm working in the compressed speech domain. The goal is to perform speaker modeling and identification without decoding the speech bitstream to extract speaker dependent features, thus saving important system resources, for instance, in mobile devices. The compressed bitstream values of the widely used GSM AMR speech coding standard are studied to identify statistics enabling fair recognition after a few seconds of speech. Using Euclidean distance measures on elementary statistical values such as coefficient of variation and skewness of nine standard GSM AMR parameters delivers recognition accuracies close to 100% after about 20 seconds of active speech for a database of 14 speakers recorded in a normal room environment.
Keywords :
adaptive codes; cellular radio; code standards; data compression; feature extraction; speaker recognition; speech coding; statistical analysis; Euclidean distance measure; GSM AMR; adaptive multirate code; compressed domain; elementary statistical value; low-complexity automatic speaker recognition; speaker dependent feature extraction; speaker modeling; speech bitstream; speech coding standard; Databases; Decoding; Euclidean distance; Feature extraction; GSM; Measurement standards; Speaker recognition; Speech coding; Speech recognition; Statistics;
Conference_Titel :
Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
Print_ISBN :
0-7803-9331-7
DOI :
10.1109/ICME.2005.1521510