DocumentCode :
3282046
Title :
Using a Probabilistic Neural Network for a Large Multi-label Problem
Author :
Oliveira, Elias ; Ciarelli, Patrick Marques ; Souza, Alvaro ; Badue, Claudine
Author_Institution :
Dept. of Inf. Sci., Univ. Fed. do Espirito Santo, Vitoria
fYear :
2008
fDate :
26-30 Oct. 2008
Firstpage :
195
Lastpage :
200
Abstract :
The automation of the categorization of economic activities from business descriptions in free text format is a huge challenge for the Brazilian governmental administration in the present day. When this problem is tackled by humans, the subjectivity on their classification brings another problem: different human classifiers can give different results when working on a set of the same business descriptions. This can cause a serious distortion on the information for the planning and taxation of the governmental administrations on the three levels: County, State and Federal. Furthermore, the number of possible categories considered is very large, more than 1000 in the Brazilian scenario. The large number of categories makes the problem even harder to be solved, as this is also a multi-labeled problem. In this work we compared the multi-label lazy learning technique, ML-kNN, to our probabilistic neural network approach. Our implementation overcome the ML-kNN algorithm in four metrics typically used in the literature for multi-label categorization problems.
Keywords :
commerce; government data processing; learning (artificial intelligence); neural nets; pattern classification; probability; public administration; taxation; Brazilian governmental administration; business descriptions; government planning; human classification; large multilabel categorization problem; multilabel lazy learning; probabilistic neural network; taxation; Companies; Computer science; Contracts; Humans; Law; Legal factors; Neural networks; Software libraries; US Government; US local government; Categorization multi-labeled problem; Categorization of economic activities; Content Analysis and Indexing; Information Search and Retrieval; Probabilistic Neural Network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2008. SBRN '08. 10th Brazilian Symposium on
Conference_Location :
Salvador
ISSN :
1522-4899
Print_ISBN :
978-1-4244-3219-6
Electronic_ISBN :
1522-4899
Type :
conf
DOI :
10.1109/SBRN.2008.38
Filename :
4665915
Link To Document :
بازگشت