Title :
Research of auto-constructed implementation technology for information extraction patterns
Author :
Azgli, W. ; De-zheng Zhang ; Huan-sheng Zhang ; Xiao-Li Li ; Xi-xuan Chen
Author_Institution :
Sch. of Inf. Eng., Univ. of Sci. & Technol., Beijing
Abstract :
According to the characteristics of traditional Chinese medicine clinical cases, in this paper, we propose an automatic generation algorithm of extraction patterns, GEPTCMA, which is based on bootstrapping. The algorithm includes two models: TCM-RPAM and TCMW-MODEL. TCM-RPAM is a method for the acquisition of bi relations and bi relation patterns; TCMW-MODEL is used for acquiring the sets of the domain keywords in the traditional Chinese medicine. Our experimental results show the precision/recall of data extraction using this system is as good as those from the templates based on structured information extraction. The domain experts in traditional Chinese medicine also give us an affirmation.
Keywords :
information retrieval; medical computing; GEPTCMA; TCM-RPAM; TCMW-MODEL; auto-constructed implementation technology; automatic generation algorithm; bi relations; bootstrapping; information extraction patterns; traditional Chinese medicine clinical cases; Automation; Character generation; Data mining; Educational institutions; Hidden Markov models; Intelligent control; Pattern matching; Traditional Chinese Medicine; bi relations; extraction patterns; information extraction;
Conference_Titel :
Intelligent Control and Automation, 2008. WCICA 2008. 7th World Congress on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4244-2113-8
Electronic_ISBN :
978-1-4244-2114-5
DOI :
10.1109/WCICA.2008.4593804