论文标题
这个GitHub项目是否维持?衡量开源项目的维护活动水平
Is this GitHub Project Maintained? Measuring the Level of Maintenance Activity of Open-Source Projects
论文作者
论文摘要
上下文:GitHub拥有大量高质量的OSS项目。但是,选择“工作的正确工具”是一项具有挑战性的任务,因为我们没有有关这些高质量项目的精确信息。目的:在本文中,我们提出了一种数据驱动的方法来衡量GitHub项目的维护活动水平。我们的目标是提醒用户使用不太细的项目的风险,并可能激励其他开发人员承担此类项目的维护。方法:我们训练机器学习模型来定义一个指标,以表达GitHub项目的维护活动水平。接下来,我们分析了一年的时间范围内2,927个活跃项目的历史演变。结果:从2,927个活跃项目中,一年间的16%的人都不是不到的。我们还发现,与其他语言中实施的项目相比,Objective-C项目的维护活动往往较低。最后,软件工具(例如编译器和编辑器)随着时间的推移维护活动最高。结论:关于GITHUB项目的维护活动水平的指标可以帮助开发人员选择开源项目。
Context: GitHub hosts an impressive number of high-quality OSS projects. However, selecting "the right tool for the job" is a challenging task, because we do not have precise information about those high-quality projects. Objective: In this paper, we propose a data-driven approach to measure the level of maintenance activity of GitHub projects. Our goal is to alert users about the risks of using unmaintained projects and possibly motivate other developers to assume the maintenance of such projects. Method: We train machine learning models to define a metric to express the level of maintenance activity of GitHub projects. Next, we analyze the historical evolution of 2,927 active projects in the time frame of one year. Results: From 2,927 active projects, 16% become unmaintained in the interval of one year. We also found that Objective-C projects tend to have lower maintenance activity than projects implemented in other languages. Finally, software tools---such as compilers and editors---have the highest maintenance activity over time. Conclusions: A metric about the level of maintenance activity of GitHub projects can help developers to select open source projects.
