DocumentCode :
980863
Title :
A new independent component analysis for speech recognition and separation
Author :
Chien, Jen-Tzung ; Chen, Bo-Cheng
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan
Volume :
14
Issue :
4
fYear :
2006
fDate :
7/1/2006 12:00:00 AM
Firstpage :
1245
Lastpage :
1254
Abstract :
This paper presents a novel nonparametric likelihood ratio (NLR) objective function for independent component analysis (ICA). This function is derived through the statistical hypothesis test of independence of random observations. A likelihood ratio function is developed to measure the confidence toward independence. We accordingly estimate the demixing matrix by maximizing the likelihood ratio function and apply it to transform data into independent component space. Conventionally, the test of independence was established assuming data distributions being Gaussian, which is improper to realize ICA. To avoid assuming Gaussianity in hypothesis testing, we propose a nonparametric approach where the distributions of random variables are calculated using kernel density functions. A new ICA is then fulfilled through the NLR objective function. Interestingly, we apply the proposed NLR-ICA algorithm for unsupervised learning of unknown pronunciation variations. The clusters of speech hidden Markov models are estimated to characterize multiple pronunciations of subword units for robust speech recognition. Also, the NLR-ICA is applied to separate the linear mixture of speech and audio signals. In the experiments, NLR-ICA achieves better speech recognition performance compared to parametric and nonparametric minimum mutual information ICA
Keywords :
Gaussian distribution; hidden Markov models; independent component analysis; maximum likelihood estimation; speech recognition; statistical testing; unsupervised learning; Gaussian data distribution; ICA; audio signals; demixing matrix; hidden Markov models; independent component analysis; kernel density functions; multiple pronunciations; nonparametric likelihood ratio; speech recognition; speech separation; speech signals; statistical hypothesis test; unsupervised learning; Clustering algorithms; Density functional theory; Gaussian distribution; Hidden Markov models; Independent component analysis; Kernel; Random variables; Speech recognition; Testing; Unsupervised learning; Acoustic modeling; blind source separation (BSS); independent component analysis (ICA); nonparametric likelihood ratio (NLR); pronunciation variation; speech recognition; unsupervised learning;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TSA.2005.858061
Filename :
1643652
Link To Document :
بازگشت