DocumentCode :
3151999
Title :
Noise-robust dynamic time warping using PLCA features
Author :
King, Brian ; Smaragdis, Paris ; Mysore, Gautham J.
Author_Institution :
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
1973
Lastpage :
1976
Abstract :
Conventional speech features, such as mel-frequency cepstral coefficients, tend to perform well in template matching systems, such as dynamic time warping, in low noise conditions. However, they tend to degrade in noisy environments. We propose a method of calculating features using the probabilistic latent component analysis (PLCA) framework. This framework models the speech and noise separately, leading to higher performance in noisy conditions than conventional methods. In this work, we compare our PLCA-based features with conventional features on the task of aligning a high-fidelity speech recording to a noisy speech recording, a scenario common in automatic dialogue replacement.
Keywords :
feature extraction; pattern matching; principal component analysis; speech processing; PLCA features; automatic dialogue replacement; high-fidelity speech recording; low noise conditions; noise-robust dynamic time warping; noisy environment degradation; noisy speech recording; probabilistic latent component analysis framework; speech features; template matching systems; Mel frequency cepstral coefficient; Noise; Noise measurement; Probabilistic logic; Spectrogram; Speech; Vectors; Automatic Dialogue Replacement; Dynamic Time Warping; Probabilistic Latent Component Analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288293
Filename :
6288293
Link To Document :
بازگشت