Title :
Optimizating one-class techniques applied to verify information extractors
Author :
De Viana, Iñaki Fernández ; Abad, Pedro J. ; Álvarez, José L. ; Arjona, José L.
Author_Institution :
Dept. de Tecnol. de le Informacion, Univ. de Huelva, Huelva, Spain
Abstract :
One-class techniques are classification algorithms, largely unsupervised, that learn using a single class. The problem of verify the information obtained by an informaction extractor, could be considered as a one-class problem because the verifier is build only using instances of classes we want to extract. We propose the use of a multi-level classifier based on One-class techniques to solve the problem of verify information. The need for this new proposal arises from the bad behavior of One-class techniques when they use categorical characteristics. As we shall see, its use significantly improves the performance of all of the algorithms studied. To evaluate the performance obtained by these techniques and modifications, we use different databases proposed in the literature as well as nonparametric statistical test that will help us strengthen the statistical significance of the results achieved.
Keywords :
natural language processing; optimisation; pattern classification; classification algorithms; information extractors verification; information verification; multilevel classifier; natural language processing; optimizating one class techniques; Classification algorithms; Data mining; Databases; Media; Proposals; Silicon; Software; Clasificadores One Class; Palabras Clave-Detección de Outlier; Reconocimiento de Novedades; Web Wrapper;
Conference_Titel :
Information Systems and Technologies (CISTI), 2012 7th Iberian Conference on
Conference_Location :
Madrid
Print_ISBN :
978-1-4673-2843-2