Title :
A new independent component analysis for speech recognition and separation
Author :
Chien, Jen-Tzung ; Chen, Bo-Cheng
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan
fDate :
7/1/2006 12:00:00 AM
Abstract :
This paper presents a novel nonparametric likelihood ratio (NLR) objective function for independent component analysis (ICA). This function is derived through the statistical hypothesis test of independence of random observations. A likelihood ratio function is developed to measure the confidence toward independence. We accordingly estimate the demixing matrix by maximizing the likelihood ratio function and apply it to transform data into independent component space. Conventionally, the test of independence was established assuming data distributions being Gaussian, which is improper to realize ICA. To avoid assuming Gaussianity in hypothesis testing, we propose a nonparametric approach where the distributions of random variables are calculated using kernel density functions. A new ICA is then fulfilled through the NLR objective function. Interestingly, we apply the proposed NLR-ICA algorithm for unsupervised learning of unknown pronunciation variations. The clusters of speech hidden Markov models are estimated to characterize multiple pronunciations of subword units for robust speech recognition. Also, the NLR-ICA is applied to separate the linear mixture of speech and audio signals. In the experiments, NLR-ICA achieves better speech recognition performance compared to parametric and nonparametric minimum mutual information ICA
Keywords :
Gaussian distribution; hidden Markov models; independent component analysis; maximum likelihood estimation; speech recognition; statistical testing; unsupervised learning; Gaussian data distribution; ICA; audio signals; demixing matrix; hidden Markov models; independent component analysis; kernel density functions; multiple pronunciations; nonparametric likelihood ratio; speech recognition; speech separation; speech signals; statistical hypothesis test; unsupervised learning; Clustering algorithms; Density functional theory; Gaussian distribution; Hidden Markov models; Independent component analysis; Kernel; Random variables; Speech recognition; Testing; Unsupervised learning; Acoustic modeling; blind source separation (BSS); independent component analysis (ICA); nonparametric likelihood ratio (NLR); pronunciation variation; speech recognition; unsupervised learning;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TSA.2005.858061