• DocumentCode
    2362781
  • Title

    Automatic enlargement of speech corpus for speaker recognition

  • Author

    Alsulaiman, Mansour M.

  • Author_Institution
    Coll. of Comput. & Inf. Sci., King Saud Univ., Riyadh, Saudi Arabia
  • fYear
    2011
  • fDate
    20-23 March 2011
  • Firstpage
    302
  • Lastpage
    306
  • Abstract
    This research deals with the problem of recognition when only a few samples are available for training of the system. To avoid the low recognition rate caused by such type of speech corpus, automatic techniques for the enlargement of speech corpus are proposed in this paper. These techniques are: lengthening of sample by automatic segmentation, automatic noise addition at different sound-to-noise ratios (SNRs), and lengthening of reversed sample. Different combinations of samples, generated by the proposed techniques, are used to obtain the high recognition rate. These techniques have shown promising result.
  • Keywords
    Gaussian processes; cepstral analysis; hidden Markov models; speaker recognition; Gaussian mixture model; automatic noise addition; automatic segmentation; hidden Markov model; mel-frequency cepstral coefficients; sound-to-noise ratios; speaker recognition; speech corpus automatic enlargement; Databases; Hidden Markov models; Mel frequency cepstral coefficient; Noise; Speaker recognition; Speech; Training; Automatic segmentation; Database enlargement; HMM; MFCC; Samples generation; Speaker Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computers & Informatics (ISCI), 2011 IEEE Symposium on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-1-61284-689-7
  • Type

    conf

  • DOI
    10.1109/ISCI.2011.5958931
  • Filename
    5958931