• DocumentCode
    3523621
  • Title

    Automated semantic annotation and retrieval based on sharable ontology and case-based learning techniques

  • Author

    Soo, Von-Wun ; Lee, Chen-Yu ; Li, Chung-Cheng ; Chen, Shu Lei ; Chen, Ching-Chih

  • Author_Institution
    Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
  • fYear
    2003
  • fDate
    27-31 May 2003
  • Firstpage
    61
  • Lastpage
    72
  • Abstract
    Effective information retrieval (IR) using domain knowledge and semantics is one of the major challenges in IR. We propose a framework that can facilitate image retrieval based on a sharable domain ontology and thesaurus. In particular, case-based learning (CBL) using a natural language phrase parser is proposed to convert a natural language query into resource description framework (RDF) format, a semantic-web standard of metadata description that supports machine readable semantic representation. This same parser also is extended to perform semantic annotation on the descriptive metadata of images and convert metadata automatically into the same RDF representation. The retrieval of images then can be conducted by matching the semantic and structural descriptions of the user query with those of the annotated descriptive metadata of images. We tested in our problem domain by retrieving the historical and cultural images taken from Dr. Ching-chih Chen\´s "First Emperor of China" CD-ROM (1991) as part of our productive international digital library collaboration. We have constructed and implemented the domain ontology, a Mandarin Chinese thesaurus, as well as the similarity match and retrieval algorithms in order to test our proposed framework. Our experiments have shown the feasibility and usability of these approaches.
  • Keywords
    case-based reasoning; digital libraries; grammars; image retrieval; indexing; learning (artificial intelligence); meta data; natural languages; semantic networks; string matching; thesauri; CBL technique; IR; Mandarin Chinese thesaurus; RDF format; automated semantic annotation; case-based learning; digital library; domain knowledge; image retrieval; information retrieval; machine readable semantic representation; metadata description; natural language phrase parser; natural language query; resource description framework; semantic-Web standard; sharable domain ontology; structural description matching; thesaurus; user query; Cultural differences; Image converters; Image retrieval; Information retrieval; Machine learning; Natural languages; Ontologies; Resource description framework; Testing; Thesauri;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Libraries, 2003. Proceedings. 2003 Joint Conference on
  • Print_ISBN
    0-7695-1939-3
  • Type

    conf

  • DOI
    10.1109/JCDL.2003.1204844
  • Filename
    1204844