• DocumentCode
    469253
  • Title

    Ontology Based Structured Representation for Domain Specific Unstructured Documents

  • Author

    Shashirekha, H.L. ; Murali, S.

  • Author_Institution
    P.E.S. Coll. of Eng., Mandya
  • Volume
    1
  • fYear
    2007
  • fDate
    13-15 Dec. 2007
  • Firstpage
    50
  • Lastpage
    54
  • Abstract
    Extracting information from unstructured, brief and short text composed of short phrases, incomplete sentences, unordered sequence of words and words in short form not falling into any regular syntax is a challenging task. This paper describes an approach to automatically extract information from data rich unstructured text documents based on a domain dependent ontology and populate a database. Here, we apply pattern matching in terms of keywords/constants to extract the patterns and generate a structured text representation with respect to a domain specific ontology. The approach is illustrated on one such unstructured, short and brief text -classified matrimonial advertisement. The performance analysis of the approach on this case study is presented.
  • Keywords
    data structures; document handling; information retrieval; ontologies (artificial intelligence); text analysis; classified matrimonial advertisement; domain dependent ontology; domain specific unstructured documents; information extraction; ontology based structured representation; pattern matching; structured text representation; text documents; Computational intelligence; Data mining; Databases; Educational institutions; IEEE news; Information retrieval; Natural languages; Ontologies; Pattern matching; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Conference on Computational Intelligence and Multimedia Applications, 2007. International Conference on
  • Conference_Location
    Sivakasi, Tamil Nadu
  • Print_ISBN
    0-7695-3050-8
  • Type

    conf

  • DOI
    10.1109/ICCIMA.2007.255
  • Filename
    4426552