DocumentCode
401781
Title
Realization and evaluation of a decaying co-occurrence model based on parallel computing
Author
Wang, Zhong ; Guo, Lan-Shen ; He, Pi-Lian ; Zheng, Xiao-Shen
Author_Institution
Sch. of Electron. & Inf., Tianjin Univ., China
Volume
4
fYear
2003
fDate
2-5 Nov. 2003
Firstpage
2112
Abstract
This paper proposes and realizes a decaying co-occurrence model, which can generate the thesaurus of similarity between words automatically. The model can get a more precise thesaurus because it considers the distance between words, which is unlike the common co-occurrence model. But its huge computation makes it impracticable in a very large corpus. In order to overcome this limitation, a parallel computing system for calculating the decaying co-occurrence model has been developed, and has been running successfully on the MPI cluster environment. High speedup and efficiency are obtained. This is a substantive step for the construction of basics resource of Chinese information processing.
Keywords
information retrieval; message passing; natural languages; parallel algorithms; thesauri; word processing; Chinese information processing; decaying cooccurrence model; information retrieval; message passing interface; natural language processing; parallel computing; thesaurus generation; word cooccurrence; High performance computing; Information processing; Information retrieval; Logic; Natural language processing; Parallel processing; Predictive models; Probability; Statistics; Thesauri;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2003 International Conference on
Print_ISBN
0-7803-8131-9
Type
conf
DOI
10.1109/ICMLC.2003.1259854
Filename
1259854
Link To Document