DocumentCode
294546
Title
Pole-filtered cepstral mean subtraction
Author
Naik, Devang
Author_Institution
CAIP Center, Rutgers Univ., Piscataway, NJ, USA
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
157
Abstract
The paper introduces a new methodology to remove the residual effects of speech from the cepstral mean used for channel normalization. The approach is based on filtering the eigenmodes of speech that are more susceptible to convolutional distortions caused by transmission channels. The filtering of linear prediction (LP) poles and their corresponding eigenmodes for a speech segment are investigated when there is a channel mismatch for speaker identification systems. An algorithm based on pole-filtering has been developed to improve the commonly employed cepstral mean subtraction. Experiments are presented in speaker identification using speech in the TIMIT database and on the San Diego portion of the KING database. The new technique is shown to offer improved recognition accuracy under cross channel scenarios when compared to conventional methods
Keywords
cepstral analysis; eigenvalues and eigenfunctions; filtering theory; poles and zeros; prediction theory; speaker recognition; telecommunication channels; KING database; TIMIT database; channel mismatch; channel normalization; convolutional distortions; cross channel scenarios; eigenmodes; linear prediction poles; pole-filtered cepstral mean subtraction; recognition accuracy; speaker identification systems; speech segment; transmission channel; Cepstral analysis; Cepstrum; Collision mitigation; Databases; Filtering; Linear systems; Microphones; Nonlinear filters; Speaker recognition; Speech analysis; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479388
Filename
479388
Link To Document