DocumentCode
980863
Title
A new independent component analysis for speech recognition and separation
Author
Chien, Jen-Tzung ; Chen, Bo-Cheng
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan
Volume
14
Issue
4
fYear
2006
fDate
7/1/2006 12:00:00 AM
Firstpage
1245
Lastpage
1254
Abstract
This paper presents a novel nonparametric likelihood ratio (NLR) objective function for independent component analysis (ICA). This function is derived through the statistical hypothesis test of independence of random observations. A likelihood ratio function is developed to measure the confidence toward independence. We accordingly estimate the demixing matrix by maximizing the likelihood ratio function and apply it to transform data into independent component space. Conventionally, the test of independence was established assuming data distributions being Gaussian, which is improper to realize ICA. To avoid assuming Gaussianity in hypothesis testing, we propose a nonparametric approach where the distributions of random variables are calculated using kernel density functions. A new ICA is then fulfilled through the NLR objective function. Interestingly, we apply the proposed NLR-ICA algorithm for unsupervised learning of unknown pronunciation variations. The clusters of speech hidden Markov models are estimated to characterize multiple pronunciations of subword units for robust speech recognition. Also, the NLR-ICA is applied to separate the linear mixture of speech and audio signals. In the experiments, NLR-ICA achieves better speech recognition performance compared to parametric and nonparametric minimum mutual information ICA
Keywords
Gaussian distribution; hidden Markov models; independent component analysis; maximum likelihood estimation; speech recognition; statistical testing; unsupervised learning; Gaussian data distribution; ICA; audio signals; demixing matrix; hidden Markov models; independent component analysis; kernel density functions; multiple pronunciations; nonparametric likelihood ratio; speech recognition; speech separation; speech signals; statistical hypothesis test; unsupervised learning; Clustering algorithms; Density functional theory; Gaussian distribution; Hidden Markov models; Independent component analysis; Kernel; Random variables; Speech recognition; Testing; Unsupervised learning; Acoustic modeling; blind source separation (BSS); independent component analysis (ICA); nonparametric likelihood ratio (NLR); pronunciation variation; speech recognition; unsupervised learning;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TSA.2005.858061
Filename
1643652
Link To Document