The secret life of hackathon code where does it come from and where does it go?
Background: Hackathons have become popular events for teams to collaborate on projects and develop software prototypes. Most existing research focuses on activities during an event with limited attention to the evolution of the code brought to or created during a hackathon. Aim: We aim to understand the evolution of hackathon-related code, specifically, how much hackathon teams rely on pre-existing code or how much new code they develop during a hackathon. Moreover, we aim to understand if and where that code gets reused, and what factors affect reuse. Method: We collected information about 22,183 hackathon projects from DEVPOST– a hackathon database – and obtained related code (blobs), authors, and project characteristics from the WORLD OF CODE. We investigated if code blobs in hackathon projects were created before, during, or after an event by identifying the original blob creation date and author, and also checked if the original author was a hackathon project member. We tracked code reuse by first identifying all commits containing blobs created during an event before determining all projects that contain those commits. Result: While only approximately 9.14% of the code blobs are created during hackathons, this amount is still significant considering time and member constraints of such events. Approximately a third of these code blobs get reused in other projects. The number of associated technologies and the number of participants in a project increase reuse probability. Conclusion: Our study demonstrates to what extent pre-existing code is used and new code is created during a hackathon and how much of it is reused elsewhere afterwards. Our findings help to better understand code reuse as a phenomenon and the role of hackathons in this context and can serve as a starting point for further studies in this area.
Funding
BIGDATA: Collaborative Research: IA: OSCAR - Open Source Supply Chains and Avoidance of Risk: An Evidence Based Approach to Improve FLOSS Supply Chains
Directorate for Computer & Information Science & Engineering
Find out more...CHS: Medium: Collaborative Research: SDI-CPR: Sustaining Digital Infrastructure as a Common Pool Resource
Directorate for Computer & Information Science & Engineering
Find out more...CCRI: Collaborative Research: Planning for World Of Code (WoC): An Infrastructure for Open Source Software Census
Directorate for Computer & Information Science & Engineering
Find out more...History
Publication
2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR), pp. 68-79Publisher
Institute of Electrical and Electronics EngineersRights
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.”Also affiliated with
- LERO - The Science Foundation Ireland Research Centre for Software
Sustainable development goals
- (4) Quality Education