Title :
Enabling Analysis and Measurement of Conventional Software Development Documents Using Project-Specific Formalism
Author :
Nakamura, Taiga ; Takeuchi, Hironori ; Iwama, Futoshi ; Mizuno, Ken
Author_Institution :
IBM Res. Tokyo, Yamato, Japan
Abstract :
We describe a new approach to modeling and analyzing software development documents that are typically written using conventional office applications. Our approach brings automation to content extraction, quality checking and measurement of massive document artifacts that tend to be handled by labor-intensive manual work in industry today. Rather than seeking an approach based on creation or rewriting of contents using more rigid, machine-friendly representations such as standardized formal models and restricted languages, we provide a method to deal with the diversity of document artifacts by making use of project-specific formalism that exists in target documents. We demonstrate that such project-specific formalism often tends to "naturally" exist at syntactic levels, and it is possible to define a "document model", a logical data representation gained by transformation rule from the physical, syntactic structure to the logical, semantic structure. With this transformation, various quality checking rules for completeness, consistency, traceability, etc., are realized by evaluating constraints for data items in the logical structure, and measurement of these quality aspects is automated. We developed a tool to allow a user to easily define document models and checking rules, and provide the insights on transformations when defining document models for various industry specification documents written in word processor files, spreadsheets and presentations. We also demonstrate the use of natural language processing can improve document modeling and quality checking by compensating for a weakness of formalism and applying analysis to specific parts of the target documents.
Keywords :
formal verification; software quality; word processing; content extraction; document artifact; document modeling; industry specification documents; logical data representation; logical structure; natural language processing; physical structure; project specific formalism; quality checking rules; semantic structure; software development document; spreadsheets; syntactic structure; target document; transformation rule; word processor files; Analytical models; Business; Data models; Estimation; Industries; Programming; Text analysis; document modeling; document quality metrics; project document analysis; quality; quality measurement; text analytics;
Conference_Titel :
Software Measurement, 2011 Joint Conference of the 21st Int'l Workshop on and 6th Int'l Conference on Software Process and Product Measurement (IWSM-MENSURA)
Conference_Location :
Nara
Print_ISBN :
978-1-4577-1930-1
DOI :
10.1109/IWSM-MENSURA.2011.51