Title of article :
Multi-label Text Categorization using Error-correcting Output Coding with Weighted Probability
Author/Authors :
Balamurugan ، V. Department of ECE - Sathyabama Institute of Science and Technology , Vedanarayanan ، V. Department of ECE - Sathyabama Institute of Science and Technology , Sahaya Anselin Nisha ، A. Department of ECE - Sathyabama Institute of Science and Technology , Narmadha ، R. Department of ECE - Sathyabama Institute of Science and Technology , Amirthalakshmi ، T. M. Department of Electronics and Communication Engineering - SRM Institute of Technology
From page :
1516
To page :
1523
Abstract :
In several real-world categorization problems, labeled data is generally hard to acquire when there is a huge number of unlabeled data. Hence, it is very important to devise a novel approaches to solve these problems, thereby choosing the most valuable instances for labeling and creating a superior classifier. Several existing techniques are devised for the binary categorization issues, only a limited number of algorithms are designed for handling the multi-label cases. The multi-label classification problem turns out to be more complex when the sample belongs to multiple labels from the group of accessible classes. In World Wide Web, text data is generally present nowadays, and is an obvious example for such type of tasks. This paper develops a novel technique to perform the multi-label text categorization by modifying the Error-Correcting Output Coding (ECOC) approach. Here, a cluster of binary complimentary classifiers are employed to facilitate the ECOC more effective for the multi-class problems. In addition, a weighted posterior probability is computed to enhance the multi-label text classification performance more effectively. Moreover, the performance of the proposed ECOC with weighted probability is analyzed using the performance metrics, like precision, recall, and f-measure with maximal precision of 0.897, higher recall value of 0.896, and maximum f-measure of 0.895.
Keywords :
Text Categorization , Multi , Label Classification , Multi , label Text categorization , Error correcting output coding , Posterior Probability
Journal title :
International Journal of Engineering
Journal title :
International Journal of Engineering
Record number :
2710349
Link To Document :
بازگشت