DocumentCode :
3132342
Title :
Classification of Cancer Stage from Free-text Histology Reports
Author :
McCowan, Iain ; Moore, Darren ; Fry, Mary-Jane
Author_Institution :
CSIRO eHealth Res. Centre, Brisbane, Qld.
fYear :
2006
fDate :
Aug. 30 2006-Sept. 3 2006
Firstpage :
5153
Lastpage :
5156
Abstract :
This article investigates the classification of a patient´s lung cancer stage based on analysis of their free-text medical reports. The system uses natural language processing to transform the report text, including identification of UMLS terms and detection of negated findings. The transformed report is then classified using statistical machine learning techniques. A support vector machine is trained for each stage category based on word occurrences in a corpus of histology reports for pathologically staged patients. New reports can be classified according to the most likely stage, allowing the collection of population stage data for analysis of outcomes. While the system could in principle be applied to stage different cancer types, the current work focuses on lung cancer due to data availability. The article presents initial experiments quantifying system performance for T and N staging on a corpus of histology reports from more than 700 lung cancer patients
Keywords :
cancer; classification; learning (artificial intelligence); lung; medical information systems; natural language processing; support vector machines; text analysis; tumours; N staging; SVM-based text classification; T staging; UMLS term identification; free-text histology reports; lung cancer stage classification; natural language processing; population stage data; statistical machine learning techniques; support vector machine; unified medical language system; Availability; Cancer; Data analysis; Lungs; Machine learning; Natural language processing; Support vector machine classification; Support vector machines; System performance; Unified modeling language;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society, 2006. EMBS '06. 28th Annual International Conference of the IEEE
Conference_Location :
New York, NY
ISSN :
1557-170X
Print_ISBN :
1-4244-0032-5
Electronic_ISBN :
1557-170X
Type :
conf
DOI :
10.1109/IEMBS.2006.259563
Filename :
4462964
Link To Document :
بازگشت