Analyzing Rich-Club Behavior in Open Source Projects

Title: Analyzing Rich-Club Behavior in Open Source Projects

Authors: Mattia Gasparini (Politecnico di Milano), Javier Luis Canovas Izquierdo (Universtat Oberto de Catalunya), Robert Clariso (Universtat Oberto de Catalunya), Marco Brambilla (Politecnico di Milano), Jordi Cabot (ICREA-UOC)

Abstract: The network of collaborations in an open source project can reveal relevant emergent properties that influence its prospects of success. In this work, we analyze open source projects to determine whether they exhibit a rich-club behavior, i.e., a phenomenon where contributors with a high number of collaborations (i.e., strongly connected within the collaboration network) are likely to cooperate with other well-connected individuals. The presence or absence of a rich-club has an impact on the sustainability and robustness of the project. For this analysis, we build and study a dataset with the 100 most popular projects in GitHub, exploiting connectivity patterns in the graph structure of collaborations that arise from commits, issues and pull requests. Results show that rich-club behavior is present in all the projects, but only few of them have an evident club structure. We compute coefficients both for single source graphs and the overall interaction graph, showing that rich-club behavior varies across different layers of software development. We provide possible explanations of our results, as well as implications for further analysis.

Download: This contribution is part of the OpenSym 2019 proceedings and is available as a PDF file.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.