Title :
Investigating influence of data storage organization on structured code search performance
Author :
Bernau, Daniel ; Mordvinova, Olga ; Karstens, Jan ; Hickl, Susan
Author_Institution :
Univ. of Appl. Sci. Ludwigshafen, Ludwigshafen, Germany
Abstract :
Code search in an industrial environment is driven by the programmers wish to scan huge source code repositories with high precision in a very short time. Given a challenging scenario of a huge software repository, the question for an efficient code search backend is relevant. This paper discusses the question of an appropriate data storage model for a structured code search engine applied in an industrial development scenario, where a search on large software repositories is common. To investigate this, a search engine approach with integrated Abstract Syntax Trees is adapted. Using the capabilities of a hybrid in-memory database, we stored a big amount of structured data obtained from the source code repository into column-, row-, and a hybrid store layout and performed a set of typical queries using an SQL interface on them. The results have shown the superiority of the column-oriented approach for the investigated scenario.
Keywords :
SQL; data structures; query processing; search engines; source coding; storage management; SQL interface; abstract syntax trees; column-oriented approach; data storage organization; data structure; hybrid in-memory database; hybrid store layout; industrial development scenario; software repository; source code repositories; structured code search engine; structured code search performance; Databases; Layout; Memory; Organizations; Search engines; Software; Software engineering;
Conference_Titel :
Computing, Control and Industrial Engineering (CCIE), 2011 IEEE 2nd International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-9599-3
DOI :
10.1109/CCIENG.2011.6008112