DocumentCode
3627022
Title
Document Classification Based on the Topic Evaluation and Its Usage in Data Compression
Author
Jan Martinovic;Jiri Dvorsky
Author_Institution
VSB - Tech. Univ. of Ostrava, Ostrava
fYear
2007
Firstpage
204
Lastpage
207
Abstract
Several actions are usually performed when document is appended to textual database in information retrieval system. The most frequent actions are compression of the document and cluster analysis of the textual database to improve quality of answers to users´ queries. The information retrieved from the clustering can be very helpful in compression. Word-based compression using information about cluster hierarchy is presented in this paper. Some experimental results are provided at the end of the paper.
Keywords
"Data compression","Search engines","Information retrieval","Web search","Intelligent agent","Spatial databases","Clustering algorithms","Clustering methods","Extraterrestrial measurements","Conferences"
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology Workshops, 2007 IEEE/WIC/ACM International Conferences on
Print_ISBN
0-7695-3028-1
Type
conf
DOI
10.1109/WI-IATW.2007.109
Filename
4427572
Link To Document