DocumentCode :
234361
Title :
Mixed method for extraction of domain terminology from text: Linguistic and statistical filtering
Author :
Lamrani, El Khadir ; Ben Lahmar, El Habib ; Marzak, Abdelaziz ; Ballaoui, Hammad
Author_Institution :
Lab. de Technol. de l´Inf. et Modelisation, Univ. Hassan II - Mohammedia, Casablanca, Morocco
fYear :
2014
fDate :
20-22 Oct. 2014
Firstpage :
291
Lastpage :
295
Abstract :
Extraction of identifier terminology from a specific domain is an indispensable task in extracting information from text, In this work we propose a hybrid method of extracting complex terms from Arabic texts which combines between linguistic and statistical approach, which focuses on a linguistic and morph syntactic analysis of the Arabic language deep to introduce an linguistic filtering algorithm of complex terms.
Keywords :
computational linguistics; information filtering; natural language processing; text analysis; Arabic language; Arabic texts; domain terminology; identifier terminology; information extraction; linguistic; statistical filtering; Data mining; Decision support systems; Filtering; Filtering algorithms; Pragmatics; Syntactics; Terminology; extraction of terminology; extraction of the information; linguistic analysis; linguistic filter; morph syntactic analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science and Technology (CIST), 2014 Third IEEE International Colloquium in
Conference_Location :
Tetouan
Print_ISBN :
978-1-4799-5978-5
Type :
conf
DOI :
10.1109/CIST.2014.7016634
Filename :
7016634
Link To Document :
بازگشت