DocumentCode :
1612644
Title :
Finding alphabet indexing for decision trees over regular patterns: an approach to bioinformatical knowledge acquisition
Author :
Shimozono, Shinichi ; Shinohara, Ayumi ; Shinohara, Takeshi ; Miyano, Satoru ; Kuhara, Satoru ; Arikawa, Setsuo
Author_Institution :
Dept. of Control Eng. & Sci., Kyushu Inst. of Technol., Iizuka, Japan
fYear :
1993
Firstpage :
763
Abstract :
Considers a transformation from an alphabet to a smaller alphabet which does not lose any positive and negative information of the original examples. Such a transformation is called indexing. A method which exploits indexing by a local search technique for learning decision trees over regular patterns is proposed. From positive and negative examples, the system produces, as a hypothesis, an indexing-decision tree pair. The authors also report some experimental results obtained by this machine learning system on the following identification problems: transmembrane domains, and signal peptides. For transmembrane domains, the system discovered an indexing by two symbols and a decision tree with just three nodes that achieves 92% accuracy. The indexing was almost the same as that biased on the hydropathy index of Kyte and Doolittle (1982). For signal peptides, the system also found sufficiently good hypotheses.
Keywords :
biocybernetics; biology computing; biomembranes; decision theory; indexing; knowledge acquisition; molecular biophysics; pattern recognition; search problems; trees (mathematics); alphabet indexing; amino acid residues; bioinformatical knowledge acquisition; decision trees; hydropathy index; hypothesis; local search technique; machine learning system; regular patterns; signal peptides; transmembrane domains; Amino acids; Bioinformatics; Control engineering; Decision trees; Indexing; Information science; Knowledge acquisition; Learning systems; Peptides; Sequences; Signal processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Sciences, 1993, Proceeding of the Twenty-Sixth Hawaii International Conference on
Print_ISBN :
0-8186-3230-5
Type :
conf
DOI :
10.1109/HICSS.1993.270664
Filename :
270664
Link To Document :
بازگشت