Title :
A signature file method for Korean text retrieval
Author :
Song, Byoungho ; Lee, Sukho
Author_Institution :
Dept. of Comput. Eng., Seoul Nat. Univ., South Korea
Abstract :
Many content-based retrieval methods for text have been proposed. Among them, signature file methods are useful when inverted files are not available. However, traditional word-oriented signature file methods cannot retrieve all the relevant text items in the variable-spacing environment such as Korean text. The authors describe a spacing-tolerant Korean signature extraction method using 2-syllable patterns. With this method, all the relevant text items are retrieved. The advantages of this method are presented by mathematical analysis and experimental performance comparison with a word-oriented method
Keywords :
information retrieval systems; natural languages; pattern recognition; word processing; 2-syllable patterns; Korean text; content-based retrieval methods; inverted files; mathematical analysis; signature file methods; spacing-tolerant Korean signature extraction method; variable-spacing environment; word-oriented method; Automation; Concatenated codes; Content based retrieval; Data mining; Dictionaries; Information retrieval; Mathematical analysis; Multimedia databases; Natural languages;
Conference_Titel :
Developing and Managing Intelligent System Projects, 1993., IEEE International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
0-8186-3730-7
DOI :
10.1109/DMISP.1993.248620