Title :
Closed set and open set Speaker Identification using amplitude distribution of different Transforms
Author :
Kekre, H.B. ; Kulkarni, Vaishali
Author_Institution :
Mukesh Patel Sch. of Technol. Manage. & Eng., SVKM´s NMIMS Univ., Mumbai, India
Abstract :
In this paper, closed set and open set Speaker Identification has been performed on two different databases. Feature extraction for the Identification has been done by using the amplitude distribution of four different Transforms i.e. DFT, DHT, DCT and DST. Two similarity measures i.e. Euclidean Distance (ED) and Manhattan Distance (MD) have been used for matching. The performance has been compared with respect to the best value of each Transform for following parameters: length of speech sample, similarity measure score, size of feature vector, FAR/FRR performance and data acquisition system. Amongst the transforms the best result is given by DFT at 99.06% for feature vector of size 32. Amongst similarity measures Manhattan distance outnumbers the Euclidean distance by 54 to 15, considering the results of all three lengths of speech. The best GAR is 90.65% with a threshold of 94.11% for DFT with MD as a similarity measure.
Keywords :
discrete Fourier transforms; discrete Hartley transforms; discrete cosine transforms; feature extraction; speaker recognition; DCT; DFT; DHT; DST; Euclidean distance; FAR-FRR performance; GAR; Manhattan distance; amplitude distribution; discrete Fourier transform; discrete Hartley transform; discrete cosine transform; feature extraction; speaker identification; transforms; Databases; Discrete Fourier transforms; Discrete cosine transforms; Feature extraction; Speech; Vectors; Euclidean Distance (ED); False Acceptance Rate (FAR); False Rejection Rate (FRR); Genuine Acceptance Rate (GAR); Manhattan Distance (MD);
Conference_Titel :
Advances in Technology and Engineering (ICATE), 2013 International Conference on
Conference_Location :
Mumbai
Print_ISBN :
978-1-4673-5618-3
DOI :
10.1109/ICAdTE.2013.6524764