DocumentCode :
1586669
Title :
Text classification in fragmented sublanguage domains
Author :
Frail, Robert P. ; Freedman, Roy S.
Author_Institution :
Dept. of Comput. Sci., Polytech. Univ., New York, NY, USA
fYear :
1991
Firstpage :
33
Lastpage :
36
Abstract :
The unique problems involved in developing text classification systems for texts that have low conceptual predictability are addressed. The authors present a shell called the FLUE (fragmented language understanding environment), which is capable of generating applications in fragmented sublanguage domains. The FLUE combines an expressive concept representation with a robust parsing technique called piecewise parsing. A common source of classification failure is unrecognized lexemes. The representation of concepts leverages differences in word class restrictions in order to learn unknown lexemes. The parser´s parallel search for concepts also gives it a large measure of immunity to the conceptual unpredictabilities of fragmented texts. Yet, the technique can be scaled to accommodate more grammatical texts, something not usually possible for other systems. Although the authors demonstrate the technique on English language texts, it is applicable to texts in other languages
Keywords :
classification; computational linguistics; grammars; natural languages; word processing; English language texts; FLUE; classification failure; conceptual unpredictabilities; expressive concept representation; fragmented language understanding environment; fragmented sublanguage domains; grammatical texts; low conceptual predictability; parallel search; piecewise parsing; robust parsing technique; text classification systems; unknown lexemes; unrecognized lexemes; word class restrictions; Artificial intelligence; Computer science; Content based retrieval; Data mining; Database languages; Electronic mail; Information retrieval; Natural language processing; Robustness; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Artificial Intelligence Applications, 1991. Proceedings., Seventh IEEE Conference on
Conference_Location :
Miami Beach, FL
Print_ISBN :
0-8186-2135-4
Type :
conf
DOI :
10.1109/CAIA.1991.120842
Filename :
120842
Link To Document :
بازگشت