Title :
Towards automatic multilevel indexing for Thai text information retrieval
Author :
Kawtrakul, Asanee ; Thumkanon, Chalathip ; Mcfetridge, Paul
Author_Institution :
Dept. of Comput. Eng, Kasetsart Univ., Bangkok, Thailand
Abstract :
To enhance the effectiveness of Thai text retrieval system, we need a significant improvement in automatic indexing. This paper is intended to provide the application of statistical and natural language processing techniques to obtain multilevel content identifiers: phrasal level, single term level and conceptual level. These multilevel indices will cover a very wide range of document retrieval without degradation of system performance. Automatic multilevel indexing for Thai text requires three processes: lexical token identification, phrase identification and extraction, and multilevel index generation. The results give a significant benefit both in precision and recall
Keywords :
indexing; information retrieval; natural languages; statistical analysis; text analysis; Thai text information retrieval; automatic multilevel indexing; conceptual level; document retrieval; lexical token identification; multilevel content identifiers; multilevel index generation; natural language processing techniques; phrasal level; phrase extraction; phrase identification; single term level; statistical techniques; Degradation; Information retrieval; Intelligent systems; Laboratories; Machine assisted indexing; Natural language processing; Natural languages; System performance; Telephony; Thesauri;
Conference_Titel :
Circuits and Systems, 1998. IEEE APCCAS 1998. The 1998 IEEE Asia-Pacific Conference on
Conference_Location :
Chiangmai
Print_ISBN :
0-7803-5146-0
DOI :
10.1109/APCCAS.1998.743879