DocumentCode :
1648416
Title :
Similarity score for information filtering thresholds in business processes
Author :
Lai, Jun ; Soh, Ben ; Ali, Saqib
Author_Institution :
Dept. of Comput. Sci. & Comput. Eng., La Trobe Univ., Australia
fYear :
2004
Firstpage :
743
Lastpage :
748
Abstract :
The tremendous growth in the amount of information available poses some key challenges for information filtering and retrieval. Users not only expect high quality and relevant information, but also wish that the information he presented in an as efficient way as possible. The traditional filtering methods, however, only consider the relevant values of document. These conventional methods fail to consider the efficiency of documents retrieval. In this paper, we propose a new algorithm to calculate an index called document similarity score based on elements of the document. Using the index, document profile is derived. Any documents with the similarity score above a given threshold is clustered. Using these pre-clustered documents, information filtering and retrieval can be made more efficient. Experimental results clearly show our proposed method tremendously improves the efficiency of information filtering and retrieval. We also give an example application of our proposed method in business processes.
Keywords :
electronic commerce; information filtering; business processes; document profile; document similarity score; information filtering; information retrieval; preclustered documents; Australia; Clustering algorithms; Computer science; Crawlers; Information filtering; Information filters; Information retrieval; Internet; Search engines; Web sites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multitopic Conference, 2004. Proceedings of INMIC 2004. 8th International
Print_ISBN :
0-7803-8680-9
Type :
conf
DOI :
10.1109/INMIC.2004.1492988
Filename :
1492988
Link To Document :
بازگشت