DocumentCode :
480163
Title :
Extracting Structured House Information from BBS Based on Prolog
Author :
Chen, Shengshuang ; Wen, Lijuan ; Xiao, Lihua ; Li, Shijun
Author_Institution :
Sch. of Sci., Wuhan Univ. of Technol., Wuhan
Volume :
4
fYear :
2008
fDate :
12-14 Dec. 2008
Firstpage :
500
Lastpage :
503
Abstract :
BBS provides users with a space of free communication and plentiful information resources. However, to gain manually useful information from constantly updated, huge and unstructured data is very difficult for users. This paper applies Prolog to BBS data mining, and builds a housing information mining system based on Prolog, which extracts structured house leasing information from the large number of unstructured text in BBS. Firstly, we develop a BBS crawler, which obtains relevant documents from specific BBS sites and stores them to local directory. Then, by analyzing all crawled unstructured documents, we extract useful information (e.g. building name, type and price etc.) and store them to a relational database, on which an interface provided for users to query. We experiment on a real-world BBS, the result shows that this Prolog programs can be applied to BBS data mining effectively.
Keywords :
PROLOG; data mining; information retrieval; BBS crawler; BBS data mining; Prolog; relational database; structured house information extraction; Computer science; Crawlers; Data mining; Data structures; Information analysis; Information resources; Logic programming; Relational databases; Space technology; Text categorization; Keywords-Bulletin Board System; Programming in Logic; crawler; data mining; house information;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3336-0
Type :
conf
DOI :
10.1109/CSSE.2008.1309
Filename :
4722667
Link To Document :
بازگشت