DocumentCode :
813696
Title :
On embedding machine-processable semantics into documents
Author :
Thirunarayan, Krishnaprasad
Author_Institution :
Dept. of Comput. Sci. & Eng., Wright State Univ., Dayton, OH, USA
Volume :
17
Issue :
7
fYear :
2005
fDate :
7/1/2005 12:00:00 AM
Firstpage :
1014
Lastpage :
1018
Abstract :
Most Web and legacy paper-based documents are available in human comprehensible text form, not readily accessible to or understood by computer programs. Here, we investigate an approach to amalgamate XML technology with programming languages for representational purposes that can enhance traceability, thereby facilitating semiautomatic extraction and update. Specifically, we propose a modular technique to embed machine-processable semantics into a text document with tabular data via annotations, resulting sometimes in ill-formed XML fragments, and evaluate this technique vis a vis document querying, manipulation, and integration. The ultimate aim is to be able to author and extract human-readable and machine-comprehensible parts of a document hand in hand and keep them side by side.
Keywords :
XML; authoring systems; data mining; knowledge representation languages; programming language semantics; semantic Web; Web-based documents; XML-based programming language; document integration; document manipulation; document querying; human comprehensible text; knowledge representation; machine-processable semantics embedding; paper-based documents; semantic Web; semiautomatic extraction; semiautomatic update; structured data; Computer languages; Data mining; Documentation; Footwear; Humans; Knowledge representation; Natural languages; Semantic Web; Web sites; XML; Index Terms- Structured data and knowledge representation; Semantic Web.; XML-based programming language;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2005.113
Filename :
1432709
Link To Document :
بازگشت