• DocumentCode
    401781
  • Title

    Realization and evaluation of a decaying co-occurrence model based on parallel computing

  • Author

    Wang, Zhong ; Guo, Lan-Shen ; He, Pi-Lian ; Zheng, Xiao-Shen

  • Author_Institution
    Sch. of Electron. & Inf., Tianjin Univ., China
  • Volume
    4
  • fYear
    2003
  • fDate
    2-5 Nov. 2003
  • Firstpage
    2112
  • Abstract
    This paper proposes and realizes a decaying co-occurrence model, which can generate the thesaurus of similarity between words automatically. The model can get a more precise thesaurus because it considers the distance between words, which is unlike the common co-occurrence model. But its huge computation makes it impracticable in a very large corpus. In order to overcome this limitation, a parallel computing system for calculating the decaying co-occurrence model has been developed, and has been running successfully on the MPI cluster environment. High speedup and efficiency are obtained. This is a substantive step for the construction of basics resource of Chinese information processing.
  • Keywords
    information retrieval; message passing; natural languages; parallel algorithms; thesauri; word processing; Chinese information processing; decaying cooccurrence model; information retrieval; message passing interface; natural language processing; parallel computing; thesaurus generation; word cooccurrence; High performance computing; Information processing; Information retrieval; Logic; Natural language processing; Parallel processing; Predictive models; Probability; Statistics; Thesauri;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2003 International Conference on
  • Print_ISBN
    0-7803-8131-9
  • Type

    conf

  • DOI
    10.1109/ICMLC.2003.1259854
  • Filename
    1259854