• DocumentCode
    3141690
  • Title

    An answer extraction method based on discourse structure and rank learning

  • Author

    Zong, Huanyun ; Yu, Zhengtao ; Guo, Jianyi ; Xian, Yantuan ; Li, Jian

  • Author_Institution
    Sch. of Inf. Eng. & Autom., Kunming Univ. of Sci. & Technol., Kunming, China
  • fYear
    2011
  • fDate
    27-29 Nov. 2011
  • Firstpage
    132
  • Lastpage
    139
  • Abstract
    For the complex questions of Chinese question answering system such as `why´, `how´ these non-factoid questions, we proposed an answer extraction method using discourse structures features and ranking algorithm. This method takes the judge problem of answers relevance as learning to rank answers. First, the method analyses questions to generate the query string, and then uses rhetorical structure theory and the natural language processing technology of vocabulary, syntax, semantic analysis to analyze the retrieved documents, so as to determine the inherent relationship between paragraphs or sentences and generate the answer candidate paragraphs or sentences. Thirdly, construct the answer ranking model, extract five group features of similarity features, density and frequency features, translation features, discourse structure features and external knowledge features to train ranking model. Finally, re-ranking the answers with the training model and find the optimal answers. Experiments show that the proposed method can effectively improve the accuracy and quality of non-factoid answers.
  • Keywords
    document handling; natural language processing; query processing; question answering (information retrieval); vocabulary; Chinese question answering system; answer candidate paragraph generation; answer extraction method; answer rank learning; answer ranking model; discourse structures features; document; group feature extraction; judge problem; knowledge features; natural language processing technology; query string generation; ranking algorithm; rhetorical structure theory; semantic analysis; sentences; syntax; vocabulary; Analytical models; Biomedical optical imaging; Integrated optics; Optical reflection; Pragmatics; answer extracting; complex questions; discourse structure; learning to rank;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing andKnowledge Engineering (NLP-KE), 2011 7th International Conference on
  • Conference_Location
    Tokushima
  • Print_ISBN
    978-1-61284-729-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2011.6138181
  • Filename
    6138181