Title :
Reinforcement learning to drive a car by pattern matching
Author :
Krodel, Michael ; Kuhnert, Klaus-Dieter
Author_Institution :
Inst. for Real-Time-Syst., Siegen Univ., Germany
Abstract :
In the paper the actual state of a system is presented that is aimed to learn driving autonomously different vehicles on different courses exclusively by visual input. The subsystem intelligent image processing allows, as required, to locate the road mark edges of each single image. A trained search algorithm allows optimal search speed and high recognition rate and consequently efficiently converts road mark edges into abstract complete situation descriptions (ACSDs) capturing in a storage limited way the current situation the vehicle is in. The subsystem pattern matching successfully retrieves similar situations to the current one based on a pattern matching algorithm. The pattern matching algorithm used in this subsystem is optimised for search speed on one hand and usage for road situations on the other hand. The subsystem reinforcement learning is still under implementation. However, a simple approach implemented so far allows already autonomous driving on a learning-by-knowledge-transfer basis promising further positive results in the area of autonomous driving based on pattern matching.
Keywords :
image matching; learning (artificial intelligence); mobile robots; road vehicles; abstract complete situation descriptions; autonomous driving; driving learning; high recognition rate; intelligent image processing subsystem; learning-by-knowledge-transfer basis; optimal search speed; pattern matching; reinforcement learning; road mark edges location; search speed optimisation; trained search algorithm; visual input; Image processing; Layout; Learning; Neural networks; Object oriented modeling; Pattern matching; Remotely operated vehicles; Roads; Streaming media; Vehicle driving;
Conference_Titel :
IECON 02 [Industrial Electronics Society, IEEE 2002 28th Annual Conference of the]
Print_ISBN :
0-7803-7474-6
DOI :
10.1109/IECON.2002.1185231