DocumentCode
1835142
Title
Information Extraction of Forum Based on Regular Expression
Author
Gang He ; Yingwei Zhang ; Xiaochun Wu
Author_Institution
Beijing Key Lab. of Network Syst. Archit. & Convergence, Beijing Univ. of Posts & Telecommun., Beijing, China
Volume
2
fYear
2013
fDate
26-27 Aug. 2013
Firstpage
118
Lastpage
122
Abstract
This paper introduces the popular universal forum systems in domestic mainstream forum and analyzes the unique characteristics of these forum systems. Based on these unique characteristics, we propose the concept of system fingerprint which can used to detect the different systems of forum exactly and extract the users´ information efficiently. It contributes to the development of network information auditing. Experimental results show that the approach can achieve high extraction accuracy. It has important application value and practical significance.
Keywords
Web sites; information retrieval; domestic mainstream forum; forum information extraction; network information auditing; popular universal forum systems; regular expression; system fingerprint; Data mining; Digital video broadcasting; Feature extraction; Fingerprint recognition; Information retrieval; Internet; Lead; forum system fingerprint; information extraction; regular expression matching;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2013 5th International Conference on
Conference_Location
Hangzhou
Print_ISBN
978-0-7695-5011-4
Type
conf
DOI
10.1109/IHMSC.2013.175
Filename
6642703
Link To Document