Title :
Speaker identification using pseudo pitch synchronized phase information in noisy environments
Author :
Kawakami, Y. ; Longbiao Wang ; Nakagawa, Sachiko
Author_Institution :
Dept. of Electr. Eng., Nagaoka Univ. of Technol., Nagaoka, Japan
fDate :
Oct. 29 2013-Nov. 1 2013
Abstract :
In conventional speaker identification methods based on mel-frequency cepstral coefficients (MFCCs), phase information is ignored. Recent studies have shown that phase information contains speaker dependent characteristics, and, pitch synchronous phase information is more suitable for speaker identification. In this paper, we verify the effectiveness of pitch synchronous phase information for speaker identification in noisy environments. Experiments were conducted using the JNAS (Japanese Newspaper Article Sentence) database. The pseudo pitch synchronized phase information based method achieved a relative speaker identification error reduction rate of 15.5% compared to the conventional phase information (that is pitch non-synchronized phase). By cutting frames with low power and combining phase information with MFCC, a furthermore improvement was obtained.
Keywords :
cepstral analysis; speaker recognition; JNAS database; Japanese Newspaper Article Sentence; MFCC; mel-frequency cepstral coefficients; noisy environments; pseudo pitch synchronized phase information; speaker identification; Databases; Mel frequency cepstral coefficient; Noise; Noise measurement; Speaker recognition; Speech; Synchronization;
Conference_Titel :
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
Conference_Location :
Kaohsiung
DOI :
10.1109/APSIPA.2013.6694385