Title :
Technical documents classification
Author :
Chagheri, Samaneh ; Roussey, Catherine ; Calabretto, Sylvie ; Dumoulin, Cyril
Author_Institution :
LIRIS, Univ. de LYON, Lyon, France
Abstract :
This research takes place in an industrial context: the CONTINEW Company. This company ensures the storage and security of critical data and technical documentation. The term “technical documentation” refers to different documents with product-related data and information that are used and stored for different purposes, such as user manuals and product specifications. They are strongly structured, but different authors have used different styles and models for document construction. The management of this increasing volume of documents requires document classification in order to retrieve information quickly and to construct a standard model for each category of documents.
Keywords :
document handling; information retrieval; pattern classification; CONTINEW company; data security; data storage; information retrieval; technical documents classification; Computational modeling; Documentation; Feature extraction; Kernel; Support vector machine classification; XML; Structural document; document classification; support vector machine; vector space model;
Conference_Titel :
Computer Supported Cooperative Work in Design (CSCWD), 2011 15th International Conference on
Conference_Location :
Lausanne
Print_ISBN :
978-1-4577-0386-7
DOI :
10.1109/CSCWD.2011.5960211