DocumentCode :
3457474
Title :
Automated categorization of real-time newswire stories. Hooked on lexiconics: how I taught my Sun to read
Author :
Mitchell, Stephen ; Auernheimer, Brent
Author_Institution :
California Agric. Technol. Inst., Fresno, CA, USA
Volume :
5
fYear :
1996
fDate :
3-6 Jan 1996
Firstpage :
92
Abstract :
Modern international trade activities rely heavily on thousands of daily information artifacts reporting the state of the world´s trading blocks. These information artifacts often require handling by human operators for indexing, sorting, and categorization. Intervention by human operators costs precious hours in the dissemination of these artifacts to end users. The paper describes the information recognition capability that the California Agricultural Technology Institute (CATI) developed as part of its Advanced Technologies Information Network (ATI-Net). The capability includes software using statistical analysis of previously human-recognized documents in order to seed information recognition databases. The recognition databases are used by automated recognition software to classify and store information artifacts without human intervention. This software is discussed with reference to two ATI-Net projects: automatic storage of newspaper articles into categories of interest to the public, and the assignment of Department of Commerce industry codes to international trade lead reports. Each of these projects takes advantage of several years of previously human-recognized information artifacts in order to automate the recognition process
Keywords :
classification; indexing; information dissemination; international trade; pattern recognition; sorting; statistical analysis; Advanced Technologies Information Network; California Agricultural Technology Institute; Department of Commerce industry codes; automated real-time newswire story categorisation; automated recognition software; automatic newspaper article storage; categorization; classification; daily information artifacts; indexing; information artifact storage; information dissemination; information recognition capability; information recognition databases; international trade activities; international trade lead reports; lexiconics; sorting; statistical analysis; trading block; Costs; Databases; Humans; Indexing; International trade; Paper technology; Sorting; Statistical analysis; Storage automation; US Department of Commerce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Sciences, 1996., Proceedings of the Twenty-Ninth Hawaii International Conference on ,
Conference_Location :
Wailea, HI
Print_ISBN :
0-8186-7324-9
Type :
conf
DOI :
10.1109/HICSS.1996.495302
Filename :
495302
Link To Document :
بازگشت