DocumentCode
495196
Title
Information Extraction Using Link Grammar
Author
Zamin, Norshuhani
Author_Institution
Dept. of Comput. & Inf. Sci., Univ. Teknol. PETRONAS, Tronoh, Malaysia
Volume
5
fYear
2009
fDate
March 31 2009-April 2 2009
Firstpage
149
Lastpage
153
Abstract
In the last few years, information extraction (IE) has become a rapidly expanding field as the machine-readable documents keep growing exponentially. IE is the perfect solution to transform factual knowledge from publications into database entries. Many efforts have been made to automatically extract and mine scientific texts ranging from biochemical to terrorism attacks reports. This study is looking into the opportunity to extract important facts from the PETRONAS health safety and environment (HSE) reports for database construction and analysis purpose. The reports are currently managed by PETRONAS Group HSE in Malaysia which contain the information on incidents and accidents occurred during the design, construction, operation and maintenance by all the PETRONAS Operating Units locally and worldwide. The effort to automate PETRONAS HSE reports will greatly benefit the PETRONAS Group HSE to automatically populate the database entries in which traditionally the task is arduous and time consuming. Many algorithms have been reported for IE ranging from simple statistical methods to advanced natural language processing (NLP) methods. This study investigates one of the NLP approach known as link grammar (LG) for extracting relevant information. LG appears within limited literature search to be the most suitable candidate algorithm. However, an exhaustive literature search will reveal the algorithm best suited to this application work.
Keywords
grammars; information retrieval; natural language processing; statistical analysis; text analysis; PETRONAS health safety and environment; database analysis; database construction; information extraction; link grammar; literature search; machine-readable documents; natural language processing methods; statistical methods; Accidents; Computer science; Data analysis; Data mining; Databases; Health and safety; Information analysis; Natural languages; Statistical analysis; Terrorism; information extraction; link grammar; parser;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location
Los Angeles, CA
Print_ISBN
978-0-7695-3507-4
Type
conf
DOI
10.1109/CSIE.2009.661
Filename
5170515
Link To Document