论文标题

未探索的Phabricator代码评论的宝库

The Unexplored Treasure Trove of Phabricator Code Review

论文作者

Kudrjavets, Gunnar, Nagappan, Nachiappan, Rastogi, Ayushi

论文摘要

Phabricator是一种现代代码协作工具,由FreeBSD和Mozilla等流行项目使用。但是,与其他著名的代码审查环境(例如Gerrit或Github)不同,没有容易访问的Phabricator公共代码评论数据集。本文介绍了我们使用Phabricator(Blender,FreeBSD,KDE,LLVM和Mozilla)的五个不同项目的挖掘代码评论。我们讨论与数据检索过程和我们的解决方案相关的挑战,从而导致数据集,其中包含有关317,476个Phabricator代码评论的详细信息。我们的数据集提供JSON和MYSQL数据库转储格式。该数据集比其他平台更精细的代码评论历史记录进行分析。此外,鉴于我们开采的项目可以通过Conduit API公开访问,因此我们的数据集可以用作获取其他详细信息和见解的基础。

Phabricator is a modern code collaboration tool used by popular projects like FreeBSD and Mozilla. However, unlike the other well-known code review environments, such as Gerrit or GitHub, there is no readily accessible public code review dataset for Phabricator. This paper describes our experience mining code reviews from five different projects that use Phabricator (Blender, FreeBSD, KDE, LLVM, and Mozilla). We discuss the challenges associated with the data retrieval process and our solutions, resulting in a dataset with details regarding 317,476 Phabricator code reviews. Our dataset is available in both JSON and MySQL database dump formats. The dataset enables analyses of the history of code reviews at a more granular level than other platforms. In addition, given that the projects we mined are publicly accessible via the Conduit API, our dataset can be used as a foundation to fetch additional details and insights.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源