Title :
Speaker identification employing waveform based speech CODEC
Author :
Mikhael, Wasfy B. ; Premakanthan, Pravinkumar
Author_Institution :
Sch. of Electr. Eng. & Comput. Sci., Univ. of Central Florida, Orlando, FL, USA
Abstract :
A novel approach for Automatic Speaker Identification (ASI) employing Waveform based signal representation in multiple domains is presented. The proposed approach involves two stages, namely, the encoding stage, and the decoding stage. During the encoding stage (training mode), mixed transform coding, in conjunction with split vector Quantization (MTSVQ) is employed to form representative codebooks for each speaker. During the decoding stage (running mode), the vectors that best represent the unknown input vector are selected to represent the speech vectors. A normalised matching accuracy measure is developed to evaluate the proposed algorithm´s performance. The resulting technique is consistently found to obtain enhanced ASI accuracy in comparison with the earlier approaches as vector quantization employing single transform domains.
Keywords :
codecs; speaker recognition; transform coding; vector quantisation; decoding stage; encoding stage; matching accuracy measure; mixed transform coding; representative codebooks; running mode; speaker identification; speech vectors; split vector quantization; training mode; unknown input vector; waveform based speech CODEC; Automatic speech recognition; Compaction; Computer science; Decoding; Encoding; Signal representations; Speech codecs; Speech coding; Transform coding; Vector quantization;
Conference_Titel :
Circuits and Systems, 2002. MWSCAS-2002. The 2002 45th Midwest Symposium on
Print_ISBN :
0-7803-7523-8
DOI :
10.1109/MWSCAS.2002.1187042