• DocumentCode
    566561
  • Title

    An arabic information management messaging system: Using information extraction for the proper flow of information within organizations

  • Author

    Al-Anesi, Bushra Abdullah ; Thabit, Khalid Omar

  • Author_Institution
    Fac. of Comput. & Inf. Technol., King Abdulaziz Univ., Jeddah, Saudi Arabia
  • Volume
    1
  • fYear
    2012
  • fDate
    24-26 April 2012
  • Firstpage
    321
  • Lastpage
    326
  • Abstract
    The current method for passing information at King Abdulaziz University (KAU) is by using general announcements; whereby the general announcement is written on a certain topic and then forwarded on to all employees regardless of whether or not each employee has an interest or need for the general announcement. If the university were to decide to only send the general announcement out to only the interested parties, the job for the person assigned this task would be long and arduous. The proposed solution to this problem is to automate this process. This can be done through the use of the Information Management Messaging System (IMMS). The automation of this process will result in a savings in time and money for the organization. KAU´s general announcements are written in the Arabic language. Therefore, the IMMS will make use of the information extraction technology and apply it to Arabic text in the form of organizational general announcements. Little work on Arabic information extraction has been done. There were many obstacles to overcome when applying the first task of information extraction, which is the named-entity recognition task, to the Arabic language. The IMMS, however, performed fairly well when it was applied on Arabic general announcements. The IMMS received an f-measure score of 84% when applied to unstructured text and a score of 97% when applied to semi-structured text. The general announcements were made up of two types of text: unstructured text and semi-structured text. The IMMS recognizes a total of seven named-entities in unstructured text; they are: names of people, organizations, locations, Hijjri dates, Gregorian dates, web addresses and email addresses. In the semi-structured text, the IMMS recognizes a total of nineteen named-entities; they are: organizations, locations, job-titles, faculties, centers, departments, deanships, majors, administrations, job Hijjri dates (two dates), contact addresses (two types), job application Hijjri date- (two dates), contact number, interview Hijjri date, interview Gregorian date, and email addresses. Eight of the named-entities from the semi-structured text received f-measures of 100%.
  • Keywords
    educational administrative data processing; educational institutions; electronic messaging; information management; natural language processing; text analysis; Arabic general announcement; Arabic information extraction; Arabic information management messaging system; Arabic language; Arabic text; Gregorian dates recognition; King Abdulaziz University; Web addresses recognition; administration recognition; center recognition; contact address recognition; contact number recognition; deanship recognition; department recognition; email address recognition; email addresses recognition; f-measure score; faculties recognition; information passing; interview Gregorian date recognition; interview Hijjri date recognition; job Hijjri date recognition; job application Hijjri dates recognition; job-titles recognition; location recognition; named-entity recognition task; organization recognition; organizational general announcement; people name recognition; semistructured text; unstructured text; Electronic mail; Interviews; Organizations; Postal services; arabic named-entity recognition; information extraction; information management; information messaging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing Technology and Information Management (ICCM), 2012 8th International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-4673-0893-9
  • Type

    conf

  • Filename
    6268516