Title :
Exemplar-based Sparse Representation phone identification features
Author :
Sainath, Tara N. ; Nahamoo, David ; Ramabhadran, Bhuvana ; Kanevsky, Dimitri ; Goel, Vaibhava ; Shah, Parikshit M.
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
Exemplar-based techniques, such as k-nearest neighbors (kNNs) and Sparse Representations (SRs), can be used to model a test sample from a few training points in a dictionary set. In past work, we have shown that using a SR approach for phonetic classification allows for a higher accuracy than other classification techniques. These phones are the basic units of speech to be recognized. Motivated by this result, we create a new dictionary which is a function of the phonetic labels of the original dictionary. The SR method now selects relevant samples from this new dictionary to create a new feature representation of the test sample, where the new feature is better linked to the actual units to be recognized. We will refer to these new features as Spif. We present results using these new Spif features in a Hidden Markov Model (HMM) framework for speech recognition. We find that the Spif features allow for a 2.9% relative reduction in Phonetic Error Rate (PER) on the TIMIT phonetic recognition task. Furthermore, we find that the Spif features allow for a 4.8% relative improvement in Word Error Rate (WER) on a large vocabulary 50 hour Broadcast News task.
Keywords :
hidden Markov models; speech recognition; HMM framework; PER; SR approach; TIMIT phonetic recognition task; WER; exemplar-based sparse representation phone identification features; hidden Markov model framework; k-nearest neighbors; kNN; phonetic classification; phonetic error rate; sparse representation approach; speech recognition; word error rate; Accuracy; Dictionaries; Entropy; Hidden Markov models; Speech recognition; Strontium; Training; Sparse representations; speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947352