Title :
The research and application of the Chinese sports information extraction
Author :
Zhang, Su-xiang ; Meng, Luo-ming ; Gao, Guo-yang ; Qin, Ying
Author_Institution :
Inst. of Network Technol., Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
By combining the model framework and rules of the unified conditional random fields, a new approach is proposed in this paper. We investigate the two key technologies of Information Extraction (IE), known as Named Entity Recognition (NER) and automatic entity Relation Extraction (RE), and explore several new features of our model on the “sports game” scene. The proposed method is implemented on some data collected from sohu.com and sina.com. Experimental results show that by combining the above new elements, our approach brings some improvements to IE, where the recall, precision and the F-measure are 95.70%, 93.00% and 94.33% respectively.
Keywords :
computational linguistics; data structures; feature extraction; natural language processing; sport; Chinese sports information extraction; F-measure; IE; NER; RE; automatic entity relation extraction; data collection; named entity recognition; sina.com; sohu.com; sports game scene; unified conditional random fields; Abstracts; Games; Manuals; Semantics; Conditional random fields; Entity relation extraction; Information extraction; Named entity recognition;
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location :
Xian
Print_ISBN :
978-1-4673-1484-8
DOI :
10.1109/ICMLC.2012.6358888