Two-Stage Gender Identification Using Pitch Frequencies, MFCCs and HMMs

Author

Rong Phoophuangpairoj;Sukanya Phongsuphap

Author_Institution

Dept. of Comput. Eng., Rangsit Univ., Pathumthani, Thailand

fYear

2015

Firstpage

2879

Lastpage

2884

Abstract

This paper proposes a two-stage method, which can identify the gender of a speaker from spoken syllables that have different tones, using: an average pitch frequency, MFCC-based features (Mel-Frequency Cepstral Coefficients), and Hidden Markov Models (HMM). The method can be divided into 2 stages. At the first stage, an average pitch frequency of each speaker was used to classify the gender. Still, a number of ambiguous speakers who were not clearly classified at the first stage were then forwarded to the second stage. At the second stage, gender identification using: MFCC features, phoneme acoustic models for females and males, and grammar for gender recognition was applied. The experimental results show that the proposed method achieved a high correct gender identification rate of 98.92%, which was higher than the conventional method using an average pitch frequency and a threshold. The proposed method is also more accurate than the Artificial Neural Network (ANN) with pitch frequencies. The results indicate that the proposed method is a practical and efficient way to identify gender.

Keywords

"Acoustics","Hidden Markov models","Speech","Speech recognition","Grammar","Feature extraction","Dictionaries"

Publisher

ieee

Conference_Titel

Systems, Man, and Cybernetics (SMC), 2015 IEEE International Conference on

Type

conf

DOI

10.1109/SMC.2015.501

Filename

7379633