DocumentCode :
144450
Title :
Research on Novel Methodology for Documentation Taxonomy for English IDIOLECT to GAIN Lesser Ambiguity Results
Author :
Lohi, Snehal A. ; Motwani, Anand
fYear :
2014
fDate :
7-9 April 2014
Firstpage :
369
Lastpage :
372
Abstract :
Information in the form of textual documents would have a huge growth due to the factors like digitization of libraries, exponential rise in internet usage, use of e-mail for various reasons, acceptability of soft copies for many works even critical financial work and other things. In view of these reasons, this paper discusses a new approach for Documentation Taxonomy (DT) based on combining efficient algorithms to be used for english language. The important aspect of automatically classifying and providing taxonomy to a set of documents into any taxonomical structure with the help of predefined categories is termed as Documentation Taxonomy. Automated Documentation Taxonomy is gaining prominence since it frees organizations from the frantic and time consuming need of manually organizing documents. This manual process can be expensive and given the time constraints of the application or the number of documents involved, simply not feasible. In terms of accuracy, modern documentation taxonomy systems proves better than that of trained human professionals, which is made possible by a combination of information retrieval technology and machine learning technology. This research put forwards a mechanism that shows that the use of combination of different algorithms brings down the ambiguity in taxonomical issues. There are numerable useful applications of this approach spanning various scientific and general fields of work.
Keywords :
inference mechanisms; information retrieval; learning (artificial intelligence); natural language processing; pattern classification; text analysis; English idiolect; English language; Internet usage; deductive inference; document classification; document organization; documentation taxonomy; e-mail; information retrieval technology; library digitization; machine learning technology; taxonomical structure; textual documents; Communication systems; Deductive Inference; Documentation Taxonomy; Lazy Evaluation Method (Fast-KNN); Space Reduction; pair-relation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communication Systems and Network Technologies (CSNT), 2014 Fourth International Conference on
Conference_Location :
Bhopal
Print_ISBN :
978-1-4799-3069-2
Type :
conf
DOI :
10.1109/CSNT.2014.79
Filename :
6821420
Link To Document :
بازگشت