• DocumentCode
    2183262
  • Title

    Improved gender/age recognition system using arousal-selection and feature-selection schemes

  • Author

    Chen, Oscal T.-C. ; Gu, Jhen Jhan

  • Author_Institution
    Department of Electrical Engineering, and Advanced Institute of Manufacturing with High-Tech Innovations, National Chung Cheng University, Chiayi, 62102 Taiwan
  • fYear
    2015
  • fDate
    21-24 July 2015
  • Firstpage
    148
  • Lastpage
    152
  • Abstract
    This work proposes the arousal-selection and feature-selection schemes to improve speaker´s gender and age identification performance. Our previous results showed that gender and age recognition rates would increase as affective stimulation degrees were lower and higher, respectively. Considering a practical scenario, the speaker´s mood does not alter frequently, so speech frames are partitioned into two groups with low and high arousal levels. Here, two Gaussian Mixture Model (GMM) probability density functions are employed to characterize the distributions of the degrees of speech stimuli in terms of tone and energy variations. Such approach can appropriately classify speech frames and easily adapt to different speakers. As well as speech frames are fairly filtered and partitioned, the feature-selection scheme is effectively used to determine adequate low-level features. To do fair comparison, the experiment database adopts Lwazi corpus from South Africa. The proposed system using the arousal-selection and feature-selection schemes exhibits that accuracy rates of gender and age estimations reach 98.9% and 71.6% with 1.7% and 10.8% increases, respectively, as compared to the ones without using arousal-selection and feature-selection schemes. Therefore, the recognition system proposed herein successfully enhances accuracy rates of age and gender estimations for various human-machine interaction and multimedia applications.
  • Keywords
    Accuracy; Estimation; Feature extraction; Jitter; Probability density function; Speech; Speech recognition; Gaussian mixture model; age recognition; arousal; feature selection; gender identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing (DSP), 2015 IEEE International Conference on
  • Conference_Location
    Singapore, Singapore
  • Type

    conf

  • DOI
    10.1109/ICDSP.2015.7251848
  • Filename
    7251848