• DocumentCode
    2686182
  • Title

    A Text Representation and Retrieval Method Based on Concept Algebra

  • Author

    Ye, Feiyue ; Cao, Hongxin ; Luo, Xiangfeng

  • Author_Institution
    Sch. of Comput. Eng. & Sci., Shanghai Univ., Shanghai, China
  • fYear
    2012
  • fDate
    27-29 Oct. 2012
  • Firstpage
    1066
  • Lastpage
    1071
  • Abstract
    This paper introduces the concept algebra (CA) theory as a basis for the conceptual representation and the derivation of text processing to realize a semantic based retrieval system. We also take advantage of Hownet to create the concept attributes space for concept algebra. With the help of LTP, we get the key words and their dependent relations of every sentence to build the CA concept representation of the content with a five-tuple. Concepts make it possible to express both the keyword itself and the semantic relation with its context. According to the demands of text retrieval, some CA operations are optimized to calculate the relations and similarity between concepts. Besides, a text retrieval system framework which processes information based on the concept relations at a concept level is also proposed to verify the advantages of our method.
  • Keywords
    information retrieval; knowledge representation; text analysis; Hownet; concept algebra; concept attribute space; concept similarity; conceptual representation; key words; semantic based retrieval system; semantic relation; sentence; text processing; text representation; text retrieval method; text retrieval system; Algebra; Cognition; Context; Educational institutions; Information retrieval; Knowledge representation; Semantics; Hownet; LTP; concept algebra; concept relation; concept similarity; semantic; text representation; text retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology (CIT), 2012 IEEE 12th International Conference on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4673-4873-7
  • Type

    conf

  • DOI
    10.1109/CIT.2012.218
  • Filename
    6392054