DocumentCode :
1694391
Title :
Revealing Trends Based on Defined Queries in Biological Publications Using Cosine Similarity
Author :
Mohammadzadeh, Hadi ; Paknia, Omid ; Schweiggert, Franz ; Gottron, Thomas
Author_Institution :
Inst. of Appl. Inf. Process., Univ. of Ulm, Ulm, Germany
fYear :
2012
Firstpage :
218
Lastpage :
222
Abstract :
Extracting valuable information in terms of number and content of published papers in any field of research will simplify decision making for future researches and investments. A novel and simple text mining approach, called TrendFinder, has been developed in this paper to reveal the content-based trends of expert-defined queries in selected biological published papers during the last five decades. So, in order to evaluate the results, three different data sets were collected and four vectors of selected keywords were considered as the four queries. "Title", "Published date" and the "Abstract" were downloaded for three series of journals namely, "Conservation Biology", "Ecology", and "The American Naturalist" as data sets, including total number of 19,010 papers. In order to show the trend between each query and the Abstract of each paper, Cosine similarity method was used by TrendFinder. Afterwards, three diagrams were demonstrated content-based trends of the four defined queries on the three provided data sets.
Keywords :
biology computing; content-based retrieval; data mining; decision making; electronic publishing; pattern matching; query processing; text analysis; vectors; Conservation Biology; Ecology; The American Naturalist; TrendFinder; abstract downloading; biological publications; content-based trends; cosine similarity; data sets; decision making; defined queries; expert-defined queries; keywords; published date downloading; published papers; text mining; title downloading; valuable information extraction; vector space model; Abstracts; Biodiversity; Environmental factors; Evolution (biology); Market research; Vectors; Cosine Similarity; Information Retrieval; Queries; Trends in Biological Publications; Vector Space Model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2012 23rd International Workshop on
Conference_Location :
Vienna
ISSN :
1529-4188
Print_ISBN :
978-1-4673-2621-6
Type :
conf
DOI :
10.1109/DEXA.2012.16
Filename :
6327429
Link To Document :
بازگشت