Source code analysis dataset

The data in this article pair source code with three artifacts from 108,568 projects downloaded from Github that have a redistributable license and at least 10 stars. The first set of pairs connects snippets of source code in C, C++, Java, and Python with their corresponding comments, which are extr...

Full description

Bibliographic Details
Main Authors: Ben Gelman, Banjo Obayomi, Jessica Moore, David Slater
Format: Article
Language:English
Published: Elsevier 2019-12-01
Series:Data in Brief
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340919310674