Title :
Speaker-independent phonetic classification in continuous English letters
Author :
Janssen, Rik D T ; Fanty, Mark ; Cole, Ronald A.
Author_Institution :
Dept. of Comput. Sci. & Eng., Oregon Graduate Inst., Beaverton, OR, USA
Abstract :
A phonetic front-end for speaker-independent recognition of continuous letter strings is described. A feedforward neutral network is trained to classify 3 msec speech frames as one of the 30 phonemes in the English alphabet. Phonetic context is used in two ways: first, by providing spectral and waveform information before and after the frame to be classified, and second, by a second-pass network that uses both acoustic features and the phonetic outputs of the first-pass network. This use of context reduced the error rate by 50%. The effectiveness of the DFT and the more compact PLP (perceptual linear predictive) analysis is compared, and several other features, such as zero crossing rate, are investigated. A frame-based phonetic classification performance of 75.7% was achieved
Keywords :
neural nets; speech recognition; acoustic features; continuous English letters; feedforward neutral network; perceptual linear predictive; phonetic front-end; phonetic outputs; speaker independent phonetic classification; spectral information; waveform information; zero crossing rate; Acoustic waves; Computer science; Databases; Error analysis; Feedforward neural networks; Information retrieval; Laboratories; Linear predictive coding; Neural networks; Speech recognition;
Conference_Titel :
Neural Networks, 1991., IJCNN-91-Seattle International Joint Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-0164-1
DOI :
10.1109/IJCNN.1991.155437