Title :
Vocabulary selection for high performance speech recognition
Author_Institution :
Texas Instruments Inc.
Abstract :
The performance of a speech recognizer can be enhanced by carefully selecting the words that form the vocabulary, for example, by maximizing the dissimilarity of words. This paper presents a method to select a set of words from a given large vocabulary, such that the minimum of the distances between all pairs of words (minimum interset distance) is maximum. To achieve speaker independence, and to keep the task manageable, the phonemic content of words is compared by a dynamic programming method. The selection of words, based on a word-distance matrix generated from the dynamic programming, consists of two steps: a vocabulary buildup step and an optimization step. As an example the method is tested on the vocabulary of letters of the English alphabet.
Keywords :
Computer science; Content management; Dynamic programming; Hamming distance; Instruments; Laboratories; Speech enhancement; Speech recognition; Testing; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '83.
DOI :
10.1109/ICASSP.1983.1172083