Source code analysis dataset
The data in this article pair source code with three artifacts from 108,568 projects downloaded from Github that have a redistributable license and at least 10 stars. The first set of pairs connects snippets of source code in C, C++, Java, and Python with their corresponding comments, which are extr...
Main Authors: | Ben Gelman, Banjo Obayomi, Jessica Moore, David Slater |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2019-12-01
|
Series: | Data in Brief |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340919310674 |
Similar Items
Similar Items
-
Collecting Vulnerable Source Code from Open-Source Repositories for Dataset Generation
by: Razvan Raducu, et al.
Published: (2020-02-01) -
Source Code Plagiarism Detection in Academia with Information Retrieval: Dataset and the Observation
by: Oscar KARNALIM, et al.
Published: (2019-10-01) -
Super-orthogonal space-time turbo coded OFDM systems.
by: Oluwafemi, Ilesanmi Banjo.
Published: (2013) -
Dataset of coded handwriting features for use in statistical modelling
by: Anna Agius, et al.
Published: (2018-02-01) -
LDGM Codes for Channel Coding and Joint Source-Channel Coding of Correlated Sources
by: Javier Garcia-Frias, et al.
Published: (2005-05-01)