DocumentCode :
3241495
Title :
Genetic Algorithm Based to Improve HTML Document Retrieval
Author :
Al-Dallal, Ammar ; Abdul-Wahab, Rasha S.
Author_Institution :
Sch. of Inf. Syst. Comput. & Math., Brunel Univ., Uxbridge, UK
fYear :
2009
fDate :
14-16 Dec. 2009
Firstpage :
343
Lastpage :
348
Abstract :
This paper describes GAHWM, a new evolutionary algorithm that integrates genetic algorithm paradigm with an inverted index model to mine the content of HTML documents for effective Web document retrieval. This method is superior in terms of recall and precision over various real life datasets.
Keywords :
Internet; data mining; genetic algorithms; hypermedia markup languages; information retrieval; GAHWM; HTML Web content mining; HTML document retrieval; Web document retrieval; evolutionary algorithm; genetic algorithm; inverted index model; Biological cells; Content based retrieval; Data mining; Evolutionary computation; Genetic algorithms; HTML; Information retrieval; Search engines; Web mining; Web pages; AI; Genetic Algorithm; Inverted Index; Web Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Developments in eSystems Engineering (DESE), 2009 Second International Conference on
Conference_Location :
Abu Dhabi
Print_ISBN :
978-1-4244-5401-3
Electronic_ISBN :
978-1-4244-5402-0
Type :
conf
DOI :
10.1109/DeSE.2009.57
Filename :
5395140
Link To Document :
بازگشت