• DocumentCode
    600222
  • Title

    Person Recognition Using Humming, Singing and Speech

  • Author

    Patil, Hemant A. ; Madhavi, Maulik C. ; Chhayani, Nirav H.

  • Author_Institution
    Dhirubhai Ambani Inst. of Inf. & Commun. Technol. (DA-IICT), Gandhinagar, India
  • fYear
    2012
  • fDate
    13-15 Nov. 2012
  • Firstpage
    149
  • Lastpage
    152
  • Abstract
    Speaker recognition deals with designing the system which recognizes the person by speech with the help of computers. In this paper, the various biometric signals produced by humans, viz., speech, singing and humming are considered for person recognition task. Corpus has been developed from 28 subjects in real-life settings. For person recognition task, state-of-the-art feature set, viz., Mel Frequency Cepstral Coefficients (MFCC) and a discriminatively-trained polynomial classifier of 2nd order approximation are used as spectral feature and classification techniques, respectively. Our experimental results indicate that the performance of person recognition system obtained using humming outperforms other biometric patterns (i.e., speech and singing) by 9 % in EER and 9 % in Identification Rate. We believe that this may be due to the person-specific characteristics are better captured in humming sounds, (which are nasalized sounds) than speech and singing.
  • Keywords
    approximation theory; biometrics (access control); cepstral analysis; feature extraction; pattern classification; speaker recognition; EER; MFCC; biometric patterns; biometric signals; classification techniques; discriminatively-trained polynomial classifier; humming feature; identification rate; mel frequency cepstral coefficients; person recognition system; person recognition task; person-specific characteristics; second order approximation; singing feature; speaker recognition; spectral feature; speech feature; state-of-the-art feature set; Mel frequency cepstral coefficient; Polynomials; Speaker recognition; Speech; Speech recognition; Testing; Training; Biometric; Corpus development; Humming; Singer recognition; Speaker recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2012 International Conference on
  • Conference_Location
    Hanoi
  • Print_ISBN
    978-1-4673-6113-2
  • Electronic_ISBN
    978-0-7695-4886-9
  • Type

    conf

  • DOI
    10.1109/IALP.2012.58
  • Filename
    6473718