• DocumentCode
    2177124
  • Title

    MUDABlue: an automatic categorization system for open source repositories

  • Author

    Kawaguchi, Shinji ; Garg, Pankaj K. ; Matsushita, Makoto ; Inoue, Katsuro

  • Author_Institution
    Graduate Sch. of Inf. Sci. & Technol., Osaka Univ., Japan
  • fYear
    2004
  • fDate
    30 Nov.-3 Dec. 2004
  • Firstpage
    184
  • Lastpage
    193
  • Abstract
    Open source communities typically use a software repository to archive various software projects with their source code, mailing list discussions, documentation, bug reports, and so forth. For example, SourceForge currently hosts over seventy thousand open source software systems. Because of the size of the rich information content, such repositories offer numerous opportunities for sharing information among projects. For example, one would like to know a set of projects that are related or similar to each other, so that the project groups can collaborate and share their work. With thousands of projects in typical repositories, however, manually locating related projects can be difficult. Hence, we propose MUDABlue, a tool that automatically categorizes software systems. MUDABlue has three major aspects: 1) it relies on no other information than the source code, 2) it determines category sets automatically, and 3) it allows a software system to be a member of multiple categories. MUDABlue has a Web interface to visualize determined categories, which eases browsing a software repository. We show the effectiveness of MUDABlue´s categorization capability by comparing its generated categories with that of some other existing research tools.
  • Keywords
    Internet; information retrieval systems; public domain software; software development management; MUDABlue; Web browsing; automatic categorization system; open source community; software archiving services; software project; software repository; Collaborative work; Documentation; Graphical user interfaces; Information science; Open source software; Software engineering; Software systems; Software tools; Visualization; Web and internet services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering Conference, 2004. 11th Asia-Pacific
  • ISSN
    1530-1362
  • Print_ISBN
    0-7695-2245-9
  • Type

    conf

  • DOI
    10.1109/APSEC.2004.69
  • Filename
    1371919