مرکز منطقه ای اطلاع رساني علوم و فناوري - I-vector based speaker gender recognition

DocumentCode :

3758786

Title :

I-vector based speaker gender recognition

Author :

Minghe Wang;Ying Chen;Zhenmin Tang;Erhua Zhang

Author_Institution :

School of Computer Science and Engineering, Nanjing University of Science and Technology, NUST, Nanjing, China

fYear :

2015

Firstpage :

729

Lastpage :

732

Abstract :

Automatic gender recognition has been becoming very important in potential applications. Many state-of-the-art gender recognition approaches based on a variety of biometrics, such as face, body shape, voice, are proposed recently. Among them, relying on voice is suboptimal due to significant variations in pitch, emotion, and noise in real-world speech. Inspired from the speaker recognition approaches relying on i-vector presentation in NIST SRE, it´s believed that i-vector contains information about gender as a part of speaker´s characters, and works for speaker recognition as well as for gender recognition in complex environments. So, we apply the total variability space analysis to gender classification and propose i-vector based discrimination for speaker gender recognition. The results of experiments on TIMIT corpus and NUST603_2014 database show that the proposed i-vector based speaker gender recognition improves the performance up to 99.9%, and surpasses the pitch method and UBM-SVM baseline subsystems in term of accuracy comparatively.

Keywords :

"Speech","Speech recognition","Principal component analysis","Feature extraction","Covariance matrices","Eigenvalues and eigenfunctions","Databases"

Publisher :

ieee

Conference_Titel :

Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), 2015 IEEE

Print_ISBN :

978-1-4799-1979-6

Type :

conf

DOI :

10.1109/IAEAC.2015.7428651

Filename :

7428651

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3758786