DocumentCode
2362781
Title
Automatic enlargement of speech corpus for speaker recognition
Author
Alsulaiman, Mansour M.
Author_Institution
Coll. of Comput. & Inf. Sci., King Saud Univ., Riyadh, Saudi Arabia
fYear
2011
fDate
20-23 March 2011
Firstpage
302
Lastpage
306
Abstract
This research deals with the problem of recognition when only a few samples are available for training of the system. To avoid the low recognition rate caused by such type of speech corpus, automatic techniques for the enlargement of speech corpus are proposed in this paper. These techniques are: lengthening of sample by automatic segmentation, automatic noise addition at different sound-to-noise ratios (SNRs), and lengthening of reversed sample. Different combinations of samples, generated by the proposed techniques, are used to obtain the high recognition rate. These techniques have shown promising result.
Keywords
Gaussian processes; cepstral analysis; hidden Markov models; speaker recognition; Gaussian mixture model; automatic noise addition; automatic segmentation; hidden Markov model; mel-frequency cepstral coefficients; sound-to-noise ratios; speaker recognition; speech corpus automatic enlargement; Databases; Hidden Markov models; Mel frequency cepstral coefficient; Noise; Speaker recognition; Speech; Training; Automatic segmentation; Database enlargement; HMM; MFCC; Samples generation; Speaker Recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Computers & Informatics (ISCI), 2011 IEEE Symposium on
Conference_Location
Kuala Lumpur
Print_ISBN
978-1-61284-689-7
Type
conf
DOI
10.1109/ISCI.2011.5958931
Filename
5958931
Link To Document