• DocumentCode
    3584957
  • Title

    A new approach to hash function construction for textual data: A comparison

  • Author

    Skala, Vaclav ; Petruska, Radek

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of West Bohemia, Plzen, Czech Republic
  • fYear
    2014
  • Firstpage
    39
  • Lastpage
    44
  • Abstract
    Many techniques for text processing are based on efficient data storing and retrieval techniques. Careful selection of data structures used and retrieval techniques play a significant role in efficiency of the whole system of data processing. Hashing technique is one very often used technique with O(1) run-time complexity for data storing and retrieval. A comparison of new technique for hash function construction is presented in the paper without need of division operation. The comparison of the proposed technique is especially convenient for large textual data sets processing. State of the art in hashing of textual data is given (the perfect hashing techniques are not included). The proposed hash function construction and hashing technique have been compared with other comparative techniques for different languages and textual data (chemical data sets etc.).
  • Keywords
    data structures; information retrieval; text analysis; data retrieval techniques; data storing techniques; data structures; hash function construction; hashing technique; run-time complexity; text processing; textual data; Chemicals; Data structures; Databases; Dictionaries; Geophysical measurement techniques; Ground penetrating radar; Java; Hashing function; data structure; information retrieval; large data processing; summarization; text mining; text processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Communication Technologies (WICT), 2014 Fourth World Congress on
  • Print_ISBN
    978-1-4799-8114-4
  • Type

    conf

  • DOI
    10.1109/WICT.2014.7077299
  • Filename
    7077299