DocumentCode :
3163899
Title :
Robust block-based clustering and identification of autoregressive speech parameters based on dynamic state tracking
Author :
Chen, Ruofei ; Chan, Cheung-Fat
Author_Institution :
Dept. of Electron. Eng., City Univ. of Hong Kong, Kowloon, China
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4469
Lastpage :
4472
Abstract :
In this paper, we propose two block-based clustering and identification algorithms that contribute to robust estimation of autoregressive (AR) speech parameters in noisy environments. Motivated by the fact that the evolution pattern of speech dynamics could be an observable feature that are retained in a series of noisy observations, a dynamic state tracking scheme based on Kalman filter is incorporated to utilize this additional trajectory information in block-based AR codebook design. The proposed algorithm is devised in a sense that AR blocks with similar clean line spectrum frequency trajectories as well as noisy-to-clean mappings are clustered offline and identified online. It is compared with conventional vector quantization based approaches that directly minimize a distortion between AR parameters. Through objective assessments based on mean square error and log-spectral distance, it is demonstrated that the proposed algorithm achieves significant improvement over conventional methods in various conditions.
Keywords :
Kalman filters; autoregressive processes; estimation theory; speech processing; vector quantisation; AR blocks; AR parameters; Kalman filter; autoregressive speech parameters; block-based AR codebook design; clean line spectrum frequency trajectories; dynamic state tracking; evolution pattern; identification algorithms; log-spectral distance; mean square error; noisy environments; noisy observations; noisy-to-clean mappings; robust block-based clustering; robust estimation; speech dynamics; trajectory information; vector quantization; Algorithm design and analysis; Clustering algorithms; Distortion measurement; Kalman filters; Noise; Noise measurement; Speech; Kalman filter; autoregressive model; clustering; vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288912
Filename :
6288912
Link To Document :
بازگشت