Title :
Realization and evaluation of a decaying co-occurrence model based on parallel computing
Author :
Wang, Zhong ; Guo, Lan-Shen ; He, Pi-Lian ; Zheng, Xiao-Shen
Author_Institution :
Sch. of Electron. & Inf., Tianjin Univ., China
Abstract :
This paper proposes and realizes a decaying co-occurrence model, which can generate the thesaurus of similarity between words automatically. The model can get a more precise thesaurus because it considers the distance between words, which is unlike the common co-occurrence model. But its huge computation makes it impracticable in a very large corpus. In order to overcome this limitation, a parallel computing system for calculating the decaying co-occurrence model has been developed, and has been running successfully on the MPI cluster environment. High speedup and efficiency are obtained. This is a substantive step for the construction of basics resource of Chinese information processing.
Keywords :
information retrieval; message passing; natural languages; parallel algorithms; thesauri; word processing; Chinese information processing; decaying cooccurrence model; information retrieval; message passing interface; natural language processing; parallel computing; thesaurus generation; word cooccurrence; High performance computing; Information processing; Information retrieval; Logic; Natural language processing; Parallel processing; Predictive models; Probability; Statistics; Thesauri;
Conference_Titel :
Machine Learning and Cybernetics, 2003 International Conference on
Print_ISBN :
0-7803-8131-9
DOI :
10.1109/ICMLC.2003.1259854