DocumentCode :
1172971
Title :
The Babylon project: toward an extensible text-mining platform
Author :
Merrill, Gary H.
Volume :
5
Issue :
2
fYear :
2003
Firstpage :
23
Lastpage :
30
Abstract :
GlaxoSmithKline committed to adopting Verity, an indexing and search engine, to retrieve knowledge within the company intranet. Verity was attractive because it supported taxonomy-oriented browsing and concept-enhanced search. To tap the power in Verity´s features, GlaxoSmithKline´s Data Exploration Sciences began the Babylon project in late 2002 with two primary goals: develop a text-mining platform that would be the foundation for a variety of text-mining applications and would be extensible to a variety of domains; and develop one or more significant prototype applications on that platform. The first prototype application considered was to mine reports on adverse drug events to identify trends, drug and reaction events, or drug-drug interactions that might not be apparent in nontextual structured databases. The paper discusses the Babylon project.
Keywords :
biology computing; data mining; hypermedia markup languages; information retrieval; intranets; search engines; very large databases; Babylon project; Data Exploration Sciences; GlaxoSmithKline; Verity; XML topic maps; adverse drug events; concept-enhanced search; drug interactions; extensible text-mining platform; indexing; information retrieval; intranet; prototype application; search engine; taxonomy-oriented browsing; Genetics; Information retrieval; Knowledge management; Laboratories; Ontologies; Prototypes; Safety; Search engines; Taxonomy; Text mining;
fLanguage :
English
Journal_Title :
IT Professional
Publisher :
ieee
ISSN :
1520-9202
Type :
jour
DOI :
10.1109/MITP.2003.1191789
Filename :
1191789
Link To Document :
بازگشت