Title :
Recognition of isolated-word sentences from a 5000-word vocabulary office correspondence task
Author :
Bahl, L.A. ; Cole, A.G. ; Jelinek, F. ; Mercer, R.L. ; Nadas, A. ; Nahamoo, D. ; Picheny, M.A.
Author_Institution :
IBM T. J. Watson Research Centre, Yorktown Height, NY
Abstract :
Recognition results on sentences from a 5000-word vocabulary drawn from office correspondence are presented. The sentences were read with pauses between the words. The vocabulary comprises the 5000 most frequently occurring words in a data-base of 14,000 office memoranda and letters, and has a perplexity of 90, measured from a trigram language model. Experiments were carried out with 6 speakers (4 male, 2 female) in an office environment using a close-talking microphone. The recognition system was automatically trained to each speaker by having the speaker read 100 typical sentences from the office correspondence data-base. Recognition was carried out for each speaker on 20 test sentences, consisting of 299 words. The recognition rate (% words correct) averaged across the 6 speakers was 94.5%.
Keywords :
Databases; Decoding; Electronic mail; Humans; Loudspeakers; Microphones; Speech recognition; Statistics; Testing; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '83.
DOI :
10.1109/ICASSP.1983.1172161