DocumentCode :
566441
Title :
Optimizating one-class techniques applied to verify information extractors
Author :
De Viana, Iñaki Fernández ; Abad, Pedro J. ; Álvarez, José L. ; Arjona, José L.
Author_Institution :
Dept. de Tecnol. de le Informacion, Univ. de Huelva, Huelva, Spain
fYear :
2012
fDate :
20-23 June 2012
Firstpage :
1
Lastpage :
6
Abstract :
One-class techniques are classification algorithms, largely unsupervised, that learn using a single class. The problem of verify the information obtained by an informaction extractor, could be considered as a one-class problem because the verifier is build only using instances of classes we want to extract. We propose the use of a multi-level classifier based on One-class techniques to solve the problem of verify information. The need for this new proposal arises from the bad behavior of One-class techniques when they use categorical characteristics. As we shall see, its use significantly improves the performance of all of the algorithms studied. To evaluate the performance obtained by these techniques and modifications, we use different databases proposed in the literature as well as nonparametric statistical test that will help us strengthen the statistical significance of the results achieved.
Keywords :
natural language processing; optimisation; pattern classification; classification algorithms; information extractors verification; information verification; multilevel classifier; natural language processing; optimizating one class techniques; Classification algorithms; Data mining; Databases; Media; Proposals; Silicon; Software; Clasificadores One Class; Palabras Clave-Detección de Outlier; Reconocimiento de Novedades; Web Wrapper;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Systems and Technologies (CISTI), 2012 7th Iberian Conference on
Conference_Location :
Madrid
ISSN :
2166-0727
Print_ISBN :
978-1-4673-2843-2
Type :
conf
Filename :
6263212
Link To Document :
بازگشت