Title : 
Using background knowledge to improve inductive learning of DNA sequences
         
        
            Author : 
Hirsh, Haym ; Noordewier, Michiel
         
        
            Author_Institution : 
Dept. of Comput. Sci., Rutgers Univ., New Brunswick, NJ, USA
         
        
        
        
        
        
            Abstract : 
Successful inductive learning requires that training data be expressed in a form where underlying regularities can be recognized by the learning system. Unfortunately, many applications of inductive learning-especially in the domain of molecular biology-have assumed that data are provided in a form already suitable for learning, whether or not such an assumption is actually justified. This paper describes the use of background knowledge of molecular biology to re-express data into a form more appropriate for learning. Our results show dramatic improvements in classification accuracy for two very different classes of DNA sequences using traditional “off-the-sheIf” decision-tree and neural-network inductive-learning methods
         
        
            Keywords : 
DNA; biology computing; inference mechanisms; learning (artificial intelligence); neural nets; pattern recognition; trees (mathematics); DNA sequences; background knowledge; classification accuracy; decision tree; inductive learning; molecular biology; neural network; training data; underlying regularities; Biological information theory; DNA; Data mining; Encoding; Polymers; Sampling methods; Sequences; Speech; Training data; US Department of Energy;
         
        
        
        
            Conference_Titel : 
Artificial Intelligence for Applications, 1994., Proceedings of the Tenth Conference on
         
        
            Conference_Location : 
San Antonia, TX
         
        
            Print_ISBN : 
0-8186-5550-X
         
        
        
            DOI : 
10.1109/CAIA.1994.323654