• DocumentCode
    3193226
  • Title

    Digging for Diamonds: Identifying Valuable Web Automation Programs in Repositories

  • Author

    Jackson, Jarrod ; Scaffidi, Christopher ; Stolee, Kathryn T.

  • Author_Institution
    Sch. of Electr. Eng. & Comput. Sci., Oregon State Univ., Corvallis, OR, USA
  • fYear
    2011
  • fDate
    26-29 April 2011
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    Web automation programs offer a means for users to enhance the usability of the web. These programs can be published on a wiki or other repository, thereby making them available for use by other users. However, in addition to programs of broad usefulness to the community at large, these repositories also contain many programs that are unreliable or highly specialized to the needs of very small sub- communities. These less valuable programs clutter the repository and make it difficult to find the valuable web automation programs. In this paper, we evaluate a machine learning model that can distinguish between high-value and low-value web automation programs. We find that the model performs well for a wide range of different languages, purposes and configurations, indicating that the model could serve as an effective basis for future repository enhancements.
  • Keywords
    Internet; Web sites; learning (artificial intelligence); natural language processing; search engines; software reliability; software reusability; Web automation programs; Web usability; different languages; high-value web automation programs; less valuable programs; low-value web automation programs; machine learning model; repository enhancements; wiki; Accuracy; Automation; Communities; Mashups; Predictive models; Search engines; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science and Applications (ICISA), 2011 International Conference on
  • Conference_Location
    Jeju Island
  • Print_ISBN
    978-1-4244-9222-0
  • Electronic_ISBN
    978-1-4244-9223-7
  • Type

    conf

  • DOI
    10.1109/ICISA.2011.5772326
  • Filename
    5772326