Title :
Efficient pre-quantization techniques based on probability density for speaker recognition system
Author :
Sarkar, Gourav ; Saha, Goutam
Author_Institution :
Dept. of Electron. & Electr. Commun. Eng., Indian Inst. of Technol., Kharagpur, India
Abstract :
The amount of speaker specific information in speech signal varies from frame to frame depending on spoken text and environmental conditions. A frame selection at the preprocessing stage can be an added advantage in this context. In pre-quantization (PQ) we select a new sequence of frames Y from the original frames X such that length of Y is less than X. In this paper, we first analyze a number of distance measure techniques for frame selection to exploit the redundancies of consecutive frames. Then we propose efficient techniques based on probability density function (PDF) that not only reduces the number of frames before feature extraction but also increases the recognition accuracy. The proposed methods are evaluated on two different databases, POLYCOST (telephone speech) and YOHO (microphone speech), and is shown to provide significant improvement in performance for speaker recognition.
Keywords :
probability; speaker recognition; POLYCOST; YOHO; distance measure technique; frame selection; microphone speech; prequantization technique; probability density function; speaker recognition; speech signal; telephone speech; Automatic speech recognition; Cepstral analysis; Clustering algorithms; Context; Feature extraction; Probability density function; Spatial databases; Speaker recognition; Speech analysis; Testing; Distance measure; moments; pre-quantization; probability density function; speaker recognition;
Conference_Titel :
TENCON 2009 - 2009 IEEE Region 10 Conference
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-4546-2
Electronic_ISBN :
978-1-4244-4547-9
DOI :
10.1109/TENCON.2009.5396081