Title :
A Rules and Statistical Learning Based Method for Chinese Patent Information Extraction
Author :
Guangpu, Feng ; Xu, Chen ; Zhiyong, Peng
Author_Institution :
Comput. Sch., Wuhan Univ., Wuhan, China
Abstract :
Patent documents, as a kind of open scientific literature protected by law, the abstracts of which often highly summarize the main information. Information extraction work and analysis of the abstracts can contribute to better protection of intellectual property rights and promotion of enterprise technological innovation. This paper focus on patent abstracts and view information extraction of patent documents as a short text categorization problem, a method based on the combination of rules and statistical learning is used to annotate and extract the information of patent features, composition and usage. Experiments show that our method can not only extract the above three types of information in the patent abstracts, but also has higher accuracy when compared to the rules based method or SVM, which is an efficient and commonly used statistical learning classification algorithm.
Keywords :
industrial property; information retrieval; learning (artificial intelligence); patents; pattern classification; statistical analysis; support vector machines; text analysis; Chinese patent information extraction; enterprise technological innovation; intellectual property right; patent abstract; patent document; rules based method; short text categorization problem; statistical learning based method; statistical learning classification algorithm; support vector machines; Abstracts; Classification algorithms; Data mining; Feature extraction; Libraries; Patents; Support vector machines; information extraction; patent document; rules-based method; statistic learning;
Conference_Titel :
Web Information Systems and Applications Conference (WISA), 2011 Eighth
Conference_Location :
Chongqing
Print_ISBN :
978-1-4577-1812-0
DOI :
10.1109/WISA.2011.29