Title :
Adaptive order of fractional Fourier transform for whispered speaker identification
Author :
Qian Xiaohong ; Zhao Heming
Author_Institution :
School of Electronic and Information Engineering, Soochow University, Suzhou215006, China
Abstract :
A method widely used in speech signal analysis is based on short-time Fourier transform (STFT), but STFT only provides “average” characteristics of a signal, which can´t depict the refined structure of speech. Therefore, a new speech analysis tool called fractional Fourier transform (FRFT) is introduced into this article. The transform orders for FRFT are adaptively set according to piecewise linear fitting of instantaneous frequency (IF), which is based on AM-FM model theory of speech production. Then we present a kind of feature, namely, adaptive fractional Fourier transform cepstral coefficients (A-FRCC). The proposed speech parameters have been applied for whispered speaker identification. Experimental results show that in the test condition of mismatched channels, the new features can observe more sophisticated structure of speech and more personalized of speakers, at the same time, effectively improve the recognition rate and robustness, comparing with MFCC.
Keywords :
AM-FM model; adaptive; fractional Fourier transform; instantaneous frequency;
Conference_Titel :
Automatic Control and Artificial Intelligence (ACAI 2012), International Conference on
Conference_Location :
Xiamen
Electronic_ISBN :
978-1-84919-537-9
DOI :
10.1049/cp.2012.0992