Adaptive order of fractional Fourier transform for whispered speaker identification

Author

Qian Xiaohong ; Zhao Heming

Author_Institution

School of Electronic and Information Engineering, Soochow University, Suzhou215006, China

fYear

2012

fDate

3-5 March 2012

Firstpage

363

Lastpage

366

Abstract

A method widely used in speech signal analysis is based on short-time Fourier transform (STFT), but STFT only provides “average” characteristics of a signal, which can´t depict the refined structure of speech. Therefore, a new speech analysis tool called fractional Fourier transform (FRFT) is introduced into this article. The transform orders for FRFT are adaptively set according to piecewise linear fitting of instantaneous frequency (IF), which is based on AM-FM model theory of speech production. Then we present a kind of feature, namely, adaptive fractional Fourier transform cepstral coefficients (A-FRCC). The proposed speech parameters have been applied for whispered speaker identification. Experimental results show that in the test condition of mismatched channels, the new features can observe more sophisticated structure of speech and more personalized of speakers, at the same time, effectively improve the recognition rate and robustness, comparing with MFCC.

Keywords

AM-FM model; adaptive; fractional Fourier transform; instantaneous frequency;

fLanguage

English

Publisher

iet

Conference_Titel

Automatic Control and Artificial Intelligence (ACAI 2012), International Conference on

Conference_Location

Xiamen

Electronic_ISBN

978-1-84919-537-9

Type

conf

DOI

10.1049/cp.2012.0992

Filename

6492599