Title :
Learning Phrase Patterns for Text Classification
Author :
Bin Zhang ; Marin, A. ; Hutchinson, Brian ; Ostendorf, Mari
Author_Institution :
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
Abstract :
This paper introduces methods to discriminatively learn phrase patterns for use as features in text classification. An efficient solution is described using a recursive algorithm with a mutual information selection criterion. The algorithm automatically determines when word classes are useful in specific locations of a phrase pattern, allowing for variable specificity depending on the amount of labeled data available. Experiments are carried out on three text classification tasks in both English and Chinese, resulting in improved performance when adding the phrase patterns to the existing n-gram features.
Keywords :
feature extraction; text detection; feature extractor; learning phrase pattern; mutual information selection criterion; recursive algorithm; text classification; text detection; Abstracts; Context; Feature extraction; Materials; Mutual information; Natural language processing; Pattern matching; Mutual information; natural language processing; phrase pattern; text classification;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2013.2245651