Title :
From information to knowledge: introducing WebStract´s knowledge engineering approach
Author :
Babowal, David ; Joerg, Werner
Author_Institution :
Dept. of Electr. & Comput. Eng., Alberta Univ., Edmonton, Alta., Canada
Abstract :
Information overload is a problem because of the overwhelming volume of data that has become accessible through the Internet or other mass communication media. It is difficult for users to sift through this data and to locate useful knowledge because large amounts of unorganized, raw data are confusing to search engines as well as people. WebStract is an experimental tool to assist in the qualification, organization and distribution of information. It offers semi-automated mechanisms to transform raw electronic data into domain knowledge and it provides multiple views for easier user consumption. After introducing WebStract, the paper focuses on the three stage transformation process that is used to transform raw data into domain knowledge. The three stages are knowledge extraction, knowledge elucidation and knowledge presentation. Knowledge extraction retrieves information from electronic documents (e.g., accessible through the WWW) and analyzes it for useful syntactical patterns that are stored in a database. Knowledge elucidation analyzes the syntactical patterns, in a semi-automated fashion, to produce a prioritized, hierarchical summary of the original documents. A fuzzy filtering mechanism allows retrieval of the stored knowledge and the resulting organized summary is presented using the familiar "book" metaphor. Each book can provide several viewpoints for users to review the information. A main application of WebStract is the support of problem based learning in on-line course delivery. WebStract is currently in its third generation of development.
Keywords :
information needs; information resources; knowledge acquisition; search engines; Internet; WebStract knowledge engineering approach; book metaphor; database; domain knowledge; electronic documents; fuzzy filtering mechanism; information distribution; information organization; information overload; information qualification; knowledge elucidation; knowledge extraction; knowledge presentation; mass communication media; multiple views; on-line course delivery; organized summary; prioritized hierarchical summary; problem based learning; raw electronic data transformation; search engines; semi-automated mechanisms; syntactical patterns; Data analysis; Data mining; Databases; Information analysis; Information retrieval; Internet; Pattern analysis; Qualifications; Search engines; World Wide Web;
Conference_Titel :
Electrical and Computer Engineering, 1999 IEEE Canadian Conference on
Conference_Location :
Edmonton, Alberta, Canada
Print_ISBN :
0-7803-5579-2
DOI :
10.1109/CCECE.1999.804938