• DocumentCode
    3728397
  • Title

    Two-Stage Gender Identification Using Pitch Frequencies, MFCCs and HMMs

  • Author

    Rong Phoophuangpairoj;Sukanya Phongsuphap

  • Author_Institution
    Dept. of Comput. Eng., Rangsit Univ., Pathumthani, Thailand
  • fYear
    2015
  • Firstpage
    2879
  • Lastpage
    2884
  • Abstract
    This paper proposes a two-stage method, which can identify the gender of a speaker from spoken syllables that have different tones, using: an average pitch frequency, MFCC-based features (Mel-Frequency Cepstral Coefficients), and Hidden Markov Models (HMM). The method can be divided into 2 stages. At the first stage, an average pitch frequency of each speaker was used to classify the gender. Still, a number of ambiguous speakers who were not clearly classified at the first stage were then forwarded to the second stage. At the second stage, gender identification using: MFCC features, phoneme acoustic models for females and males, and grammar for gender recognition was applied. The experimental results show that the proposed method achieved a high correct gender identification rate of 98.92%, which was higher than the conventional method using an average pitch frequency and a threshold. The proposed method is also more accurate than the Artificial Neural Network (ANN) with pitch frequencies. The results indicate that the proposed method is a practical and efficient way to identify gender.
  • Keywords
    "Acoustics","Hidden Markov models","Speech","Speech recognition","Grammar","Feature extraction","Dictionaries"
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics (SMC), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/SMC.2015.501
  • Filename
    7379633