DocumentCode
480163
Title
Extracting Structured House Information from BBS Based on Prolog
Author
Chen, Shengshuang ; Wen, Lijuan ; Xiao, Lihua ; Li, Shijun
Author_Institution
Sch. of Sci., Wuhan Univ. of Technol., Wuhan
Volume
4
fYear
2008
fDate
12-14 Dec. 2008
Firstpage
500
Lastpage
503
Abstract
BBS provides users with a space of free communication and plentiful information resources. However, to gain manually useful information from constantly updated, huge and unstructured data is very difficult for users. This paper applies Prolog to BBS data mining, and builds a housing information mining system based on Prolog, which extracts structured house leasing information from the large number of unstructured text in BBS. Firstly, we develop a BBS crawler, which obtains relevant documents from specific BBS sites and stores them to local directory. Then, by analyzing all crawled unstructured documents, we extract useful information (e.g. building name, type and price etc.) and store them to a relational database, on which an interface provided for users to query. We experiment on a real-world BBS, the result shows that this Prolog programs can be applied to BBS data mining effectively.
Keywords
PROLOG; data mining; information retrieval; BBS crawler; BBS data mining; Prolog; relational database; structured house information extraction; Computer science; Crawlers; Data mining; Data structures; Information analysis; Information resources; Logic programming; Relational databases; Space technology; Text categorization; Keywords-Bulletin Board System; Programming in Logic; crawler; data mining; house information;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location
Wuhan, Hubei
Print_ISBN
978-0-7695-3336-0
Type
conf
DOI
10.1109/CSSE.2008.1309
Filename
4722667
Link To Document