Title :
Classification of PICO elements by text features systematically extracted from PubMed abstracts
Author :
Huang, Ke-Chun ; Liu, Charles Chih-Ho ; Yang, Shung-Shiang ; Liao, Chun-Chih ; Xiao, Furen ; Wong, Jau-Min ; Chiang, I-Jen
Author_Institution :
Grad. Inst. of Biomed. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
We propose and evaluate a systematic approach to detect and classify Patient/Problem, Intervention, Comparison and Outcome (PICO) from the medical literature. The training and test corpora were generated systematically and automatically from structured PubMed abstracts. 23,472 sentences by exact pattern match of head words of P-I-O categories. Afterward, the terms with top frequencies were used as the features of Naïve Bayesian classifier. This approach achieves F-measure values of 0.91 for Patient/Problem, 0.75 for Intervention and 0.88 for Outcome, comparable to previous studied based on mixed textural, paragraphical, and semantic features. In conclusion, we show that by stricter pattern matching criteria of training set, detection and classification of PICO elements can be reproducible with minimal expert intervention. The results of this work are higher than previous studies.
Keywords :
belief networks; information retrieval; pattern classification; pattern matching; text analysis; F-measure values; Naïve Bayesian classifier; P-I-O categories; PICO element classification; medical literature; mixed textural feature; paragraphical feature; patient-problem-intervention-comparison outcome; pattern matching criteria; semantic features; structured PubMed abstracts; test corpora; training corpora; training set; Abstracts; Bayesian methods; Informatics; Knowledge based systems; Pattern matching; Testing; Training; information extraction; natural language processing; question answering; text mining;
Conference_Titel :
Granular Computing (GrC), 2011 IEEE International Conference on
Conference_Location :
Kaohsiung
Print_ISBN :
978-1-4577-0372-0
DOI :
10.1109/GRC.2011.6122608