Knowledge Extraction Based on Sentence Matching and Analyzing

Author

Bolin, Hua

Author_Institution

Instn. of Sci. & Tech. Inf. of China, Beijing

fYear

2008

fDate

21-22 Dec. 2008

Firstpage

122

Lastpage

126

Abstract

Knowledge extraction extracts knowledge from documents by analyzing document contents, then markups knowledge attribute and stores them into knowledge base. This paper discusses definition of knowledge extraction at first, then presents goal, key technique, application and system architecture of knowledge extraction based on sentence matching and analyzing. The system identifies new sentences in a paper, and makes an analysis of the sentences from internal structure and subject semantics. Then justify knowledge metadata by analyzing association among sentences and pragmatic of the whole paper. At last, allocates knowledge attribute tag for extracted knowledge, such as definition, historic development, characteristics, key technique, classification, application and development trend in the future. Knowledge extraction based on sentence matching and analyzing can not only justify the academic copying or scientific citation automatically, as well as make a document reviewed automatically, but also achieve a conversion on analysis from paper and chapter to sentence and paragraph, which can induce a revolution in knowledge organization and management. Therefore, the study has far-reaching theoretical significance and wide application.

Keywords

document handling; knowledge acquisition; knowledge based systems; pattern matching; document content analysis; document knowledge extraction; knowledge attribute markup; knowledge attribute tag allocation; knowledge management; knowledge meta data; knowledge organization; sentence matching; subject semantics; Citation analysis; Data mining; Explosions; Graphics; Image converters; Information analysis; Knowledge acquisition; Knowledge management; Natural language processing; Tagging; content analysis; knowledge extraction; natural language processing; sentence similarity;

fLanguage

English

Publisher

ieee

Conference_Titel

Knowledge Acquisition and Modeling, 2008. KAM '08. International Symposium on

Conference_Location

Wuhan

Print_ISBN

978-0-7695-3488-6

Type

conf

DOI

10.1109/KAM.2008.156

Filename

4732799