Title :
Efficient manifold learning for speech recognition using locality sensitive hashing
Author :
Tomar, Vikrant Singh ; Rose, Richard C.
Author_Institution :
Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
Abstract :
This paper considers the application of a random projections based hashing scheme, known as locality sensitive hashing (LSH), for fast computation of neighborhood graphs in manifold learning based feature space transformations in automatic speech recognition (ASR). Discriminative manifold learning based feature transformations have already been found to provide significant improvements in ASR performance. The motivation of this work is the fact that the high computational complexity of these techniques has prevented their application to very large speech corpora. The performance of this integrated system is evaluated both in terms of computational complexity and ASR word recognition accuracy. Further comparisons of ASR performance with the well-known linear discriminant analysis are provided. These results demonstrate that LSH provides the much needed speed boost to manifold learning techniques with minimal impact on their ASR performance, thus enabling the application of these algorithms to large speech databases.
Keywords :
graph theory; learning (artificial intelligence); random processes; speech recognition; ASR word recognition; LSH; automatic speech recognition; computational complexity; discriminative manifold learning; feature space transformation; large speech corpora; linear discriminant analysis; locality sensitive hashing; neighborhood graph; random projection; Approximation algorithms; Hidden Markov models; Manifolds; Noise; Speech; Training; Vectors; Locality sensitive hashing; dimensionality reduction; locality preserving discriminant analysis; manifold learning; speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639018