DocumentCode :
1463160
Title :
Enhancement of Speech Recognitions for Control Automation Using an Intelligent Particle Swarm Optimization
Author :
Chan, Kit Yan ; Yiu, Cedric K F ; Dillon, Tharam S. ; Nordholm, Sven ; Ling, Sai Ho
Author_Institution :
Dept. of Electr. & Comput. Eng., Curtin Univ., Perth, WA, Australia
Volume :
8
Issue :
4
fYear :
2012
Firstpage :
869
Lastpage :
879
Abstract :
For over two decades, speech control mechanisms have been widely applied in manufacturing systems such as factory automation, warehouse automation, and industrial robotic control for over two decades. To implement speech controls, a commercial speech recognizer is used as the interface between users and the automation system. However, users´ commands are often contaminated by environmental noise which degrades the performance of speech recognition for controlling automation systems. This paper presents a multichannel signal enhancement methodology to improve the performance of commercial speech recognizers. The proposed methodology aims to optimize speech recognition accuracy of a commercial speech recognizer in a noisy environment based on a beamformer, which is developed by an intelligent particle swarm optimization. It overcomes the limitation of the existing signal enhancement approaches whereby the parameters inside commercial speech recognizers are required to be tuned, which is impossible in a real-world situation. Also, it overcomes the limitation of the existing optimization algorithm including gradient descent methods, genetic algorithms and classical particle swarm optimization that are unlikely to develop optimal beamformers for maximizing speech recognition accuracy. The performance of the proposed methodology was evaluated by developing beamformers for a commercial speech recognizer, which was implemented on warehouse automation. Results indicate a significant improvement regarding speech recognition accuracy.
Keywords :
array signal processing; factory automation; genetic algorithms; gradient methods; particle swarm optimisation; speech enhancement; speech recognition; beamformer; commercial speech recognizer; control automation; environmental noise; genetic algorithms; gradient descent methods; intelligent particle swarm optimization; manufacturing systems; multichannel signal enhancement methodology; speech control mechanisms; speech recognition enhancement; Automation; Fuzzy systems; Noise measurement; Optimization; Particle swarm optimization; Speech recognition; Beamformer; intelligent fuzzy systems; particle swarm optimization; speech control; speech recognition;
fLanguage :
English
Journal_Title :
Industrial Informatics, IEEE Transactions on
Publisher :
ieee
ISSN :
1551-3203
Type :
jour
DOI :
10.1109/TII.2012.2187910
Filename :
6164262
Link To Document :
بازگشت