Improving visualization on code repository issues for tasks understanding
Abstract. Understanding the tasks and bug locating are extremely challenging and time-consuming. Achieving a new state of the art of understanding the tasks or issues and provide a high-level visualization to the users would be an incredible asset to both developers and research communities. Open Gi...
Main Author: | |
---|---|
Format: | Dissertation |
Language: | English |
Published: |
University of Oulu
2019
|
Online Access: | http://jultika.oulu.fi/Record/nbnfioulu-201907202706 |
id |
ndltd-oulo.fi-oai-oulu.fi-nbnfioulu-201907202706 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-oulo.fi-oai-oulu.fi-nbnfioulu-2019072027062019-08-06T03:16:37ZImproving visualization on code repository issues for tasks understandingSomkiadcharoen, R. (Robroo)info:eu-repo/semantics/openAccess© Robroo Somkiadcharoen, 2019Abstract. Understanding the tasks and bug locating are extremely challenging and time-consuming. Achieving a new state of the art of understanding the tasks or issues and provide a high-level visualization to the users would be an incredible asset to both developers and research communities. Open Github archive are gathered, and the data is programmatically labelled. The Fasttext embedding model was trained to map the words to together based on semantics. Then, both CNN and RNN types of deep learning architectures are trained to classify whether each tokenized instance is a source file attribute or not. The word embedding and LSTM models worked well and did generalize in the real-world usage up to an extent. The models could achieve around 0.80 F1 scores on the test set. Along with the model, the generated usage graphs are presented that are the final output of the thesis work. Some types of issues were suitable for this workflow and did produce reasonable graphs which might be useful for the users to see the big picture of an issue.University of Oulu2019-07-12info:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://jultika.oulu.fi/Record/nbnfioulu-201907202706eng |
collection |
NDLTD |
language |
English |
format |
Dissertation |
sources |
NDLTD |
description |
Abstract. Understanding the tasks and bug locating are extremely challenging and time-consuming. Achieving a new state of the art of understanding the tasks or issues and provide a high-level visualization to the users would be an incredible asset to both developers and research communities. Open Github archive are gathered, and the data is programmatically labelled. The Fasttext embedding model was trained to map the words to together based on semantics. Then, both CNN and RNN types of deep learning architectures are trained to classify whether each tokenized instance is a source file attribute or not. The word embedding and LSTM models worked well and did generalize in the real-world usage up to an extent. The models could achieve around 0.80 F1 scores on the test set. Along with the model, the generated usage graphs are presented that are the final output of the thesis work. Some types of issues were suitable for this workflow and did produce reasonable graphs which might be useful for the users to see the big picture of an issue. |
author |
Somkiadcharoen, R. (Robroo) |
spellingShingle |
Somkiadcharoen, R. (Robroo) Improving visualization on code repository issues for tasks understanding |
author_facet |
Somkiadcharoen, R. (Robroo) |
author_sort |
Somkiadcharoen, R. (Robroo) |
title |
Improving visualization on code repository issues for tasks understanding |
title_short |
Improving visualization on code repository issues for tasks understanding |
title_full |
Improving visualization on code repository issues for tasks understanding |
title_fullStr |
Improving visualization on code repository issues for tasks understanding |
title_full_unstemmed |
Improving visualization on code repository issues for tasks understanding |
title_sort |
improving visualization on code repository issues for tasks understanding |
publisher |
University of Oulu |
publishDate |
2019 |
url |
http://jultika.oulu.fi/Record/nbnfioulu-201907202706 |
work_keys_str_mv |
AT somkiadcharoenrrobroo improvingvisualizationoncoderepositoryissuesfortasksunderstanding |
_version_ |
1719233497598525440 |