Title :
Optimal selection of bitstream features for compressed-domain automatic speaker recognition
Author :
Petracca, Matteo ; Servetti, Antonio ; De Martin, Juan Carlos
Author_Institution :
Dipt. di Autom. e Inf., Politec. di Torino, Turin, Italy
Abstract :
Low-complexity compressed-domain automatic speaker recognition algorithms are directly applied to the coded speech bitstream to avoid the computational burden of decoding the parameters and resynthesizing the speech waveform. The objective of this paper is to further reduce the complexity of this approach by determining the smallest set of bitstream features that has the maximum effectiveness on recognition accuracy. For this purpose, recognition accuracy is evaluated with various sets of medium-term statistical features extracted from GSM AMR compressed speech coded at 12.2 kb/s. Over a database of 14 speakers the results show that, using 20 seconds of active speech, a recognition ratio of 100% can be achieved with only nine of the 18 statistical features under analysis. This is a complexity reduction by a factor of two with respect to previous works. Moreover, the robustness of the proposed system has been assessed using test samples of different length and varying levels of frame losses, and proved to be the same of previous approaches.
Keywords :
data compression; decoding; feature extraction; speaker recognition; speech coding; speech synthesis; statistical analysis; GSM AMR compressed speech coding; adaptive multirate standard; bit rate 12.2 kbit/s; coded speech bitstream feature selection; low-complexity compressed-domain automatic speaker recognition algorithm; medium-term statistical feature extraction; speech waveform resynthesis; Complexity theory; Europe; Robustness; Speech; Speech coding; Speech recognition; Vectors;
Conference_Titel :
Signal Processing Conference, 2006 14th European
Conference_Location :
Florence