DocumentCode
2718629
Title
Extraction of reported speeches from Arabic Lebanese newspapers
Author
Al-Hajj, Moustafa ; Mourad, Ghassan
Author_Institution
Center for Language Sci. & Commun., Lebanese Univ., Beirut, Lebanon
fYear
2015
fDate
April 29 2015-May 1 2015
Firstpage
125
Lastpage
128
Abstract
This paper presents ongoing work on the extraction of Arabic reported speech, made by Lebanese politicians, from Arabic Lebanese newspapers. This work is part of a functional system for extraction, presentation and archiving of reported speech made by Lebanese politicians, which constitutes a valuable resource for political analysts, press agents, company researchers and political actors. The system automatically identifies about 280 reported speeches per day from about 1,000 newspaper articles, together with their referents, all are correctly identified as reported speech but only about 200 are correctly referred to their referents. The correctly identified reported speeches that refer to the correctly identified referents are then submitted to a web-based application, which is publicly accessible at http://citations-explorer.com/lpc/.
Keywords
Internet; information retrieval; natural language processing; speech processing; Arabic Lebanese newspapers; Arabic reported speech extraction; Lebanese politicians; Web-based application; company researchers; functional system; newspaper articles; political actors; political analysts; press agents; reported speech archiving; reported speech presentation; Companies; Data mining; Internet; Manuals; Pragmatics; Presses; Speech; Arabic Language Processing; Contextual Exploration;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Information and Communication Technology and its Applications (DICTAP), 2015 Fifth International Conference on
Conference_Location
Beirut
Print_ISBN
978-1-4799-4130-8
Type
conf
DOI
10.1109/DICTAP.2015.7113184
Filename
7113184
Link To Document