Title : 
Training with heterogeneous data
         
        
            Author : 
Drakopoulos, J.A.
         
        
            Author_Institution : 
Microsoft Corp., Redmond, WA, USA
         
        
        
        
            fDate : 
31 July-4 Aug. 2005
         
        
        
            Abstract : 
Data pruning and ordered training are two methods used to train a learner with heterogeneous data. The former is a typical procedure that attempts to factor out noise from the data; the latter is a novel method that partitions the data into a number of categories and assigns training times to those assuming that data size and training time have a polynomial relation. In its current form, ordered training is an approximate and a priori data-emphasizing method. Both methods have been applied to a time-delay neural network - which is one of the main learners in Microsoft´s Tablet PC handwriting recognition system. Their effect on the learner is presented in this paper. The handwriting data and the chosen language are Italian.
         
        
            Keywords : 
data handling; handwriting recognition; learning (artificial intelligence); neural nets; data emphasizing; data pruning; heterogeneous data; ordered training; time-delay neural network; training schedule; Boosting; Dictionaries; Electronic mail; Handwriting recognition; Ink; Neural networks; Noise level; Polynomials; Scheduling; Telephony;
         
        
        
        
            Conference_Titel : 
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
         
        
            Print_ISBN : 
0-7803-9048-2
         
        
        
            DOI : 
10.1109/IJCNN.2005.1555811