DocumentCode
1863153
Title
Adaptive order of fractional Fourier transform for whispered speaker identification
Author
Qian Xiaohong ; Zhao Heming
Author_Institution
School of Electronic and Information Engineering, Soochow University, Suzhou215006, China
fYear
2012
fDate
3-5 March 2012
Firstpage
363
Lastpage
366
Abstract
A method widely used in speech signal analysis is based on short-time Fourier transform (STFT), but STFT only provides “average” characteristics of a signal, which can´t depict the refined structure of speech. Therefore, a new speech analysis tool called fractional Fourier transform (FRFT) is introduced into this article. The transform orders for FRFT are adaptively set according to piecewise linear fitting of instantaneous frequency (IF), which is based on AM-FM model theory of speech production. Then we present a kind of feature, namely, adaptive fractional Fourier transform cepstral coefficients (A-FRCC). The proposed speech parameters have been applied for whispered speaker identification. Experimental results show that in the test condition of mismatched channels, the new features can observe more sophisticated structure of speech and more personalized of speakers, at the same time, effectively improve the recognition rate and robustness, comparing with MFCC.
Keywords
AM-FM model; adaptive; fractional Fourier transform; instantaneous frequency;
fLanguage
English
Publisher
iet
Conference_Titel
Automatic Control and Artificial Intelligence (ACAI 2012), International Conference on
Conference_Location
Xiamen
Electronic_ISBN
978-1-84919-537-9
Type
conf
DOI
10.1049/cp.2012.0992
Filename
6492599
Link To Document