• DocumentCode
    3699219
  • Title

    A parallel cross-language retrieval system for patent documents

  • Author

    Xin Shen;Heyan Huang;Lingzhi Li;Yonggang Huang

  • Author_Institution
    Beijing Engineering Research Center of High Volume Language Information Processing &
  • fYear
    2015
  • Firstpage
    672
  • Lastpage
    676
  • Abstract
    In order to help people obtain useful information from patent documents in different languages. This paper proposes a cross-language retrieval system to search Chinese and English patent documents simultaneously. This system consists of query translation module, document retrieval module and user interaction module. Query translation module is used to translate query based on bilingual dictionaries. Document retrieval module consists of monolingual retrieval system using standard vector space model. In order to retrieve in highly parallel, we use the MapReduce model to calculate the similarity. User interaction module provides users with interactive mechanism used to improve the retrieval accuracy in the system. It contains two parts: the second translation and relevance feedback. The experimental results show that our system has good performance.
  • Keywords
    "Patents","Dictionaries","Context","Data processing","Accuracy","Machine learning algorithms"
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering and Service Science (ICSESS), 2015 6th IEEE International Conference on
  • ISSN
    2327-0586
  • Print_ISBN
    978-1-4799-8352-0
  • Electronic_ISBN
    2327-0594
  • Type

    conf

  • DOI
    10.1109/ICSESS.2015.7339147
  • Filename
    7339147