DocumentCode
2363217
Title
Automated dictionary construction for information extraction from text
Author
Riloff, Ellen ; Lehnert, Wendy
Author_Institution
Dept. of Comput. Sci., Massachusetts Univ., Amherst, MA, USA
fYear
1993
fDate
1-5 Mar 1993
Firstpage
93
Lastpage
99
Abstract
The authors have developed a tool called AutoSlog that automatically constructs domain-specific dictionaries given a set of annotated training texts. Using AutoSlog, a first-year graduate student who had minimal experience with the CIRCUS sentence analyzer on which AutoSlog is based, created a dictionary for the domain of terrorism in 8 hours. In the experiments, the 8-hour AutoSlog dictionary achieved 90% of the performance of a hand-crafted dictionary that required 1500 person-hours of effort by 2 advanced graduate students who were highly skilled with the sentence analyzer
Keywords
character recognition; feature extraction; glossaries; word processing; AutoSlog; CIRCUS sentence analyzer; annotated training texts; automated dictionary construction; domain-specific dictionaries; information extraction; Computer science; Data mining; Dictionaries; Industrial training; Knowledge engineering; Natural language processing; Performance analysis; Scalability; Terrorism; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Artificial Intelligence for Applications, 1993. Proceedings., Ninth Conference on
Conference_Location
Orlando, FL
Print_ISBN
0-8186-3840-0
Type
conf
DOI
10.1109/CAIA.1993.366656
Filename
366656
Link To Document