• DocumentCode
    2350044
  • Title

    Biological question answering with syntactic and semantic feature matching and an improved mean reciprocal ranking measurement

  • Author

    Lin, Ryan T.K. ; Liang-Te Chiu, Justin ; Dai, Hong-Jei ; Day, Min-Yuh ; Tsai, Richard Tzong-Han ; Hsu, Wen-Lian

  • Author_Institution
    Institute of Information Science, Academia Sinica, Taipei, China
  • fYear
    2008
  • fDate
    13-15 July 2008
  • Firstpage
    184
  • Lastpage
    189
  • Abstract
    Specific information on biomolecular events such as protein-protein and gene-protein interactions is essential for molecular biology researchers. However, the results derived by current keyword-based information retrieval engine contain a great deal of noisy information, which forces biologists to use a combination of several keywords to locate information. To resolve this problem, we propose a question answering (QA) system that offers more efficient and user-friendly ways to retrieve desired information. In addition, QA system measurements may suffer from the same score problem, so the evaluation of a QA system may be unfair. An improved mean reciprocal rank (MRR) measurement, mean average reciprocal rank (MARR), and an efficient formula to reduce the computational complexity of the MARR are proposed to address the same score problem. With our syntactic and semantic features, our system achieves a Top-1 MARR of 74.11% and Top-5 MARR of 76.68%. Compared to the baseline system, Top-1 MARR and Top-5 MARR increase by 16.17% and 18.61% respectively.
  • Keywords
    Biomedical measurements; Computer science; DNA; Data mining; Engines; Feature extraction; Information retrieval; Information science; Protein engineering; RNA;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration, 2008. IRI 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV, USA
  • Print_ISBN
    978-1-4244-2659-1
  • Electronic_ISBN
    978-1-4244-2660-7
  • Type

    conf

  • DOI
    10.1109/IRI.2008.4583027
  • Filename
    4583027