DocumentCode
3395335
Title
Generating Patterns for Knowledge Discovery using First Principles Modeling of Activity
Author
Fournelle, Connie ; Tierno, Jose ; Stephenson, Thomas
Author_Institution
Intelligent Syst. Div., BAE Syst. Adv. Inf. Technol., Burlington, MA
fYear
2006
fDate
10-13 July 2006
Firstpage
1
Lastpage
5
Abstract
Knowledge discovery algorithms generate alerts to interesting activity by finding partial matches to user-specified patterns of interest. Several statistical models can be used to generate pattern based on training data. However, when training sets are not readily available it is still possible to construct patterns from domain knowledge. To test the feasibility of such an approach we examine a database of publications in biological sciences, BioBase, and attempt to predict whether or not two researchers who did not coauthor a paper in 1998-2002 will coauthor a paper in 2003. We constructed a set of four distinct patterns depicting scenarios in which a new coauthoring relationship might emerge. In each of the scenarios, we needed to identify individuals playing different roles within research groups, and use the number of publications of individuals to predict those roles. We also use the 1998-2002 coauthor relationships to provide evidence of the associated research groups and collaborations. Our results in a test database show that this approach is feasible and competitive when compared to others that rely on more extensive statistical modeling
Keywords
data mining; statistical analysis; first principles model; knowledge discovery algorithm; statistical models; Biological system modeling; Biology; Data mining; Databases; Information technology; Intelligent systems; Pattern analysis; Pattern matching; Testing; Training data; modeling; pattern based fusion;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Fusion, 2006 9th International Conference on
Conference_Location
Florence
Print_ISBN
1-4244-0953-5
Electronic_ISBN
0-9721844-6-5
Type
conf
DOI
10.1109/ICIF.2006.301657
Filename
4085943
Link To Document