DocumentCode :
2718629
Title :
Extraction of reported speeches from Arabic Lebanese newspapers
Author :
Al-Hajj, Moustafa ; Mourad, Ghassan
Author_Institution :
Center for Language Sci. & Commun., Lebanese Univ., Beirut, Lebanon
fYear :
2015
fDate :
April 29 2015-May 1 2015
Firstpage :
125
Lastpage :
128
Abstract :
This paper presents ongoing work on the extraction of Arabic reported speech, made by Lebanese politicians, from Arabic Lebanese newspapers. This work is part of a functional system for extraction, presentation and archiving of reported speech made by Lebanese politicians, which constitutes a valuable resource for political analysts, press agents, company researchers and political actors. The system automatically identifies about 280 reported speeches per day from about 1,000 newspaper articles, together with their referents, all are correctly identified as reported speech but only about 200 are correctly referred to their referents. The correctly identified reported speeches that refer to the correctly identified referents are then submitted to a web-based application, which is publicly accessible at http://citations-explorer.com/lpc/.
Keywords :
Internet; information retrieval; natural language processing; speech processing; Arabic Lebanese newspapers; Arabic reported speech extraction; Lebanese politicians; Web-based application; company researchers; functional system; newspaper articles; political actors; political analysts; press agents; reported speech archiving; reported speech presentation; Companies; Data mining; Internet; Manuals; Pragmatics; Presses; Speech; Arabic Language Processing; Contextual Exploration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Information and Communication Technology and its Applications (DICTAP), 2015 Fifth International Conference on
Conference_Location :
Beirut
Print_ISBN :
978-1-4799-4130-8
Type :
conf
DOI :
10.1109/DICTAP.2015.7113184
Filename :
7113184
Link To Document :
بازگشت