Title :
Information extraction from legal documents
Author :
Cheng, Tin Tin ; Cua, Jeffrey Leonard ; Tan, Mark Davies ; Yao, Kenneth Gerard ; Roxas, Rachel Edita
Author_Institution :
BS Comput. Sci., De La Salle Univ., Manila, Philippines
Abstract :
Legal TRUTHS (turning unstructured texts to helpful structure) is a system that extracts relevant information from Philippine Supreme Court decisions, specifically on criminal cases. We describe here the processes involved in the development of Legal TRUTHS focusing on the issues relating to the domain and the geographical setting of the source documents, and the performance evaluation results are also presented. Pertinent information to be extracted for criminal cases such as the crime, the date and time of commission, the plaintiff, and the penalty were determined from a sample set of documents. Sections of these documents were identified for initial segmentation of the data. Automatic filtering of the data was involved in drawing out relevant information from the texts. From 25 training documents and also the same set for testing, performance showed over-all precision at 91.7%, recall at 99.5%, and F-measure at 95.6%. Testing on another 50 documents showed over-all precision at 84.3%, recall at 95.8%, and F-measure at 91.0%.
Keywords :
criminal law; document handling; information retrieval; Philippine Supreme Court decisions; automatic data filtering; criminal cases; helpful structure; information extraction; legal TRUTHS; legal documents; turning unstructured texts; Data mining; Information filtering; Information filters; Law; Legal factors; Natural language processing; Tagging; Testing; Tin; Turning;
Conference_Titel :
Natural Language Processing, 2009. SNLP '09. Eighth International Symposium on
Conference_Location :
Bangkok
Print_ISBN :
978-1-4244-4138-9
Electronic_ISBN :
978-1-4244-4139-6
DOI :
10.1109/SNLP.2009.5340925