DocumentCode :
48631
Title :
Learning Phrase Patterns for Text Classification
Author :
Bin Zhang ; Marin, A. ; Hutchinson, Brian ; Ostendorf, Mari
Author_Institution :
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
Volume :
21
Issue :
6
fYear :
2013
fDate :
Jun-13
Firstpage :
1180
Lastpage :
1189
Abstract :
This paper introduces methods to discriminatively learn phrase patterns for use as features in text classification. An efficient solution is described using a recursive algorithm with a mutual information selection criterion. The algorithm automatically determines when word classes are useful in specific locations of a phrase pattern, allowing for variable specificity depending on the amount of labeled data available. Experiments are carried out on three text classification tasks in both English and Chinese, resulting in improved performance when adding the phrase patterns to the existing n-gram features.
Keywords :
feature extraction; text detection; feature extractor; learning phrase pattern; mutual information selection criterion; recursive algorithm; text classification; text detection; Abstracts; Context; Feature extraction; Materials; Mutual information; Natural language processing; Pattern matching; Mutual information; natural language processing; phrase pattern; text classification;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2013.2245651
Filename :
6457440
Link To Document :
بازگشت