• DocumentCode
    480163
  • Title

    Extracting Structured House Information from BBS Based on Prolog

  • Author

    Chen, Shengshuang ; Wen, Lijuan ; Xiao, Lihua ; Li, Shijun

  • Author_Institution
    Sch. of Sci., Wuhan Univ. of Technol., Wuhan
  • Volume
    4
  • fYear
    2008
  • fDate
    12-14 Dec. 2008
  • Firstpage
    500
  • Lastpage
    503
  • Abstract
    BBS provides users with a space of free communication and plentiful information resources. However, to gain manually useful information from constantly updated, huge and unstructured data is very difficult for users. This paper applies Prolog to BBS data mining, and builds a housing information mining system based on Prolog, which extracts structured house leasing information from the large number of unstructured text in BBS. Firstly, we develop a BBS crawler, which obtains relevant documents from specific BBS sites and stores them to local directory. Then, by analyzing all crawled unstructured documents, we extract useful information (e.g. building name, type and price etc.) and store them to a relational database, on which an interface provided for users to query. We experiment on a real-world BBS, the result shows that this Prolog programs can be applied to BBS data mining effectively.
  • Keywords
    PROLOG; data mining; information retrieval; BBS crawler; BBS data mining; Prolog; relational database; structured house information extraction; Computer science; Crawlers; Data mining; Data structures; Information analysis; Information resources; Logic programming; Relational databases; Space technology; Text categorization; Keywords-Bulletin Board System; Programming in Logic; crawler; data mining; house information;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Software Engineering, 2008 International Conference on
  • Conference_Location
    Wuhan, Hubei
  • Print_ISBN
    978-0-7695-3336-0
  • Type

    conf

  • DOI
    10.1109/CSSE.2008.1309
  • Filename
    4722667