Title :
Training with heterogeneous data
Author :
Drakopoulos, J.A.
Author_Institution :
Microsoft Corp., Redmond, WA, USA
fDate :
31 July-4 Aug. 2005
Abstract :
Data pruning and ordered training are two methods used to train a learner with heterogeneous data. The former is a typical procedure that attempts to factor out noise from the data; the latter is a novel method that partitions the data into a number of categories and assigns training times to those assuming that data size and training time have a polynomial relation. In its current form, ordered training is an approximate and a priori data-emphasizing method. Both methods have been applied to a time-delay neural network - which is one of the main learners in Microsoft´s Tablet PC handwriting recognition system. Their effect on the learner is presented in this paper. The handwriting data and the chosen language are Italian.
Keywords :
data handling; handwriting recognition; learning (artificial intelligence); neural nets; data emphasizing; data pruning; heterogeneous data; ordered training; time-delay neural network; training schedule; Boosting; Dictionaries; Electronic mail; Handwriting recognition; Ink; Neural networks; Noise level; Polynomials; Scheduling; Telephony;
Conference_Titel :
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Print_ISBN :
0-7803-9048-2
DOI :
10.1109/IJCNN.2005.1555811