Title :
A neural network speech interface with the DOS commander
Author_Institution :
Dept. of Electron. & Commun., Helwan Univ., Cairo, Egypt
Abstract :
A user friendly interface with the operating system requires the interfacing of speech data entry with the DOS commander. This intelligent interface relieves the user from tidy typing of DOS commands. This task necessitates the users independent recognition of the spoken commands and interfacing them with the operating system shell. Traditional speech recognition techniques are not capable of extracting significant features and are not able to generalize. These problems are conveniently solved with neural network techniques. The paper introduces the application of a three layer neural network for classification of DOS commands. A supervised learning method which is based on the back propagation technique is described. A training set including different patterns for each command recorded under different conditions, is first preprocessed, and then spotted to guarantee invariance under translation in time. Since the learning phase of a neural network based classifier is a time consuming task, it was necessary to use a data reduction technique which can preserve the command information. A linear prediction technique is used to achieve a high degree of data reduction. Only nine linear prediction coefficients are proved to be sufficient for discrimination of DOS commands. The experimental results for the neural classifier indicated a high percentage of correct classification for a training set including twelve DOS commands. The most effective number of units in the hidden layer and the value of the learning rate are conducted through extensive experimental work
Keywords :
backpropagation; feedforward neural nets; interactive systems; linear predictive coding; natural language interfaces; operating systems (computers); speech recognition; DOS commander; back propagation technique; classification; data reduction technique; intelligent interface; linear prediction technique; neural network based classifier; neural network speech interface; neural network techniques; operating system; speech data entry; supervised learning method; three layer neural network; training set; user friendly interface; Artificial neural networks; Data mining; Multilayer perceptrons; Nearest neighbor searches; Neural networks; Neurofeedback; Operating systems; Speech recognition; Supervised learning; Vocabulary;
Conference_Titel :
Electrical and Computer Engineering, 1993. Canadian Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
0-7803-2416-1
DOI :
10.1109/CCECE.1993.332226