DocumentCode :
3717291
Title :
Agile text mining with Sherlok
Author :
Renaud Richardet;Jean-C?dric Chappelier;Shreejoy Tripathy;Sean Hill
Author_Institution :
Blue Brain Project, Brain Mind Institute, Ecole Polytechnique F?d?rale de Lausanne, Switzerland
fYear :
2015
Firstpage :
1479
Lastpage :
1484
Abstract :
The successful development of an intelligent text mining application requires the collaboration of two main stakeholders: subject matter experts and text miners. In this paper, we describe a new methodology, agile text mining to improve that collaboration. Agile text mining is characterized by short development cycles, frequent tasks redefinition and continuous performance monitoring through integration tests. We introduce Sherlok, a system supporting the development of agile text mining applications and present an application to extract mention of neurons from a very large corpus of scientific articles. The resulting code and models are publicly available.
Keywords :
"Text mining","Pipelines","Ontologies","Engines","Proteins","Collaboration"
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/BigData.2015.7363910
Filename :
7363910
Link To Document :
بازگشت