Title :
Exploration on Personal Database of CNKI Literature Based on C # Regular Expression
Author :
Zhang Lina ; Yang Bo
Author_Institution :
Jilin Agric. Univ., Changchun, China
Abstract :
This article describes the method that the CNKI literature in PDF format be extracted data. Obtain basic data according to downloaded to the local machine CNKI papers, builds a relational database of literature information, and establish the personal literature management system according to our requirements. Although CNKI archive does not provide access interfaces to other database, but most of the literature is PDF format, we can use C # language and the regular expression to extract data. In this paper, the literature to extract basic data, and import it into personal relational database approach to a certain amount of research and practice.
Keywords :
C listings; document image processing; literature; relational databases; C# language; C# regular expression; CNKI literature; PDF format; data extraction; literature information; personal database; personal literature management system; relational database; Abstracts; Computers; Data mining; Educational institutions; Portable document format; Relational databases; CNKI; basic data; personal literature management system; regular expressions;
Conference_Titel :
Instrumentation, Measurement, Computer, Communication and Control (IMCCC), 2013 Third International Conference on
Conference_Location :
Shenyang
DOI :
10.1109/IMCCC.2013.103