مرکز منطقه ای اطلاع رساني علوم و فناوري - Text classification and minimal-bias training vectors

DocumentCode :

1856219

Title :

Text classification and minimal-bias training vectors

Author :

Ahmad, Khurshid ; Bale, Tracey A. ; Burford, Darren

Author_Institution :

Surrey Univ., Guildford, UK

Volume :

fYear :

1999

fDate :

1999

Firstpage :

2816

Abstract :

The categorisation of text using neural networks has been pursued in the context of emerging digital libraries. Frequency of key terms in a document set are used to create training vectors and a feature map can be trained such that texts are topologically ordered. The choice of training vectors and training regimes remains an open question in neural network research. In order to minimise training bias we have developed a method which uses the linguistic characteristics of domain-specific texts for the creation of training vectors. This method has been evaluated by classifying a standard free-text document set (TIPSTER´s SUMMAC AP news wire collection) using a Kohonen feature map. This method is particularly relevant to the domain of financial prediction, considering the large volume of news reports available to financial analysts

Keywords :

classification; financial data processing; learning (artificial intelligence); self-organising feature maps; text analysis; topology; Kohonen feature map; SUMMAC AP news wire collection; TIPSTER; digital libraries; domain-specific texts; financial analysts; financial prediction; key term frequency; minimal-bias training vectors; neural networks; text categorisation; text classification; topological ordering; training bias minimisation; training vectors; Drugs; Environmental factors; Frequency; Instruments; Neural networks; Software libraries; Terrorism; Text categorization; Vocabulary; Wire;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks, 1999. IJCNN '99. International Joint Conference on

Conference_Location :

Washington, DC

ISSN :

1098-7576

Print_ISBN :

0-7803-5529-6

Type :

conf

DOI :

10.1109/IJCNN.1999.833528

Filename :

833528

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1856219