Title :
A new approach to hash function construction for textual data: A comparison
Author :
Skala, Vaclav ; Petruska, Radek
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of West Bohemia, Plzen, Czech Republic
Abstract :
Many techniques for text processing are based on efficient data storing and retrieval techniques. Careful selection of data structures used and retrieval techniques play a significant role in efficiency of the whole system of data processing. Hashing technique is one very often used technique with O(1) run-time complexity for data storing and retrieval. A comparison of new technique for hash function construction is presented in the paper without need of division operation. The comparison of the proposed technique is especially convenient for large textual data sets processing. State of the art in hashing of textual data is given (the perfect hashing techniques are not included). The proposed hash function construction and hashing technique have been compared with other comparative techniques for different languages and textual data (chemical data sets etc.).
Keywords :
data structures; information retrieval; text analysis; data retrieval techniques; data storing techniques; data structures; hash function construction; hashing technique; run-time complexity; text processing; textual data; Chemicals; Data structures; Databases; Dictionaries; Geophysical measurement techniques; Ground penetrating radar; Java; Hashing function; data structure; information retrieval; large data processing; summarization; text mining; text processing;
Conference_Titel :
Information and Communication Technologies (WICT), 2014 Fourth World Congress on
Print_ISBN :
978-1-4799-8114-4
DOI :
10.1109/WICT.2014.7077299