DocumentCode
48631
Title
Learning Phrase Patterns for Text Classification
Author
Bin Zhang ; Marin, A. ; Hutchinson, Brian ; Ostendorf, Mari
Author_Institution
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
Volume
21
Issue
6
fYear
2013
fDate
Jun-13
Firstpage
1180
Lastpage
1189
Abstract
This paper introduces methods to discriminatively learn phrase patterns for use as features in text classification. An efficient solution is described using a recursive algorithm with a mutual information selection criterion. The algorithm automatically determines when word classes are useful in specific locations of a phrase pattern, allowing for variable specificity depending on the amount of labeled data available. Experiments are carried out on three text classification tasks in both English and Chinese, resulting in improved performance when adding the phrase patterns to the existing n-gram features.
Keywords
feature extraction; text detection; feature extractor; learning phrase pattern; mutual information selection criterion; recursive algorithm; text classification; text detection; Abstracts; Context; Feature extraction; Materials; Mutual information; Natural language processing; Pattern matching; Mutual information; natural language processing; phrase pattern; text classification;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2013.2245651
Filename
6457440
Link To Document