Improving visualization on code repository issues for tasks understanding

Abstract. Understanding the tasks and bug locating are extremely challenging and time-consuming. Achieving a new state of the art of understanding the tasks or issues and provide a high-level visualization to the users would be an incredible asset to both developers and research communities. Open Gi...

Full description

Bibliographic Details
Main Author: Somkiadcharoen, R. (Robroo)
Format: Dissertation
Language:English
Published: University of Oulu 2019
Online Access:http://jultika.oulu.fi/Record/nbnfioulu-201907202706
id ndltd-oulo.fi-oai-oulu.fi-nbnfioulu-201907202706
record_format oai_dc
spelling ndltd-oulo.fi-oai-oulu.fi-nbnfioulu-2019072027062019-08-06T03:16:37ZImproving visualization on code repository issues for tasks understandingSomkiadcharoen, R. (Robroo)info:eu-repo/semantics/openAccess© Robroo Somkiadcharoen, 2019Abstract. Understanding the tasks and bug locating are extremely challenging and time-consuming. Achieving a new state of the art of understanding the tasks or issues and provide a high-level visualization to the users would be an incredible asset to both developers and research communities. Open Github archive are gathered, and the data is programmatically labelled. The Fasttext embedding model was trained to map the words to together based on semantics. Then, both CNN and RNN types of deep learning architectures are trained to classify whether each tokenized instance is a source file attribute or not. The word embedding and LSTM models worked well and did generalize in the real-world usage up to an extent. The models could achieve around 0.80 F1 scores on the test set. Along with the model, the generated usage graphs are presented that are the final output of the thesis work. Some types of issues were suitable for this workflow and did produce reasonable graphs which might be useful for the users to see the big picture of an issue.University of Oulu2019-07-12info:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://jultika.oulu.fi/Record/nbnfioulu-201907202706eng
collection NDLTD
language English
format Dissertation
sources NDLTD
description Abstract. Understanding the tasks and bug locating are extremely challenging and time-consuming. Achieving a new state of the art of understanding the tasks or issues and provide a high-level visualization to the users would be an incredible asset to both developers and research communities. Open Github archive are gathered, and the data is programmatically labelled. The Fasttext embedding model was trained to map the words to together based on semantics. Then, both CNN and RNN types of deep learning architectures are trained to classify whether each tokenized instance is a source file attribute or not. The word embedding and LSTM models worked well and did generalize in the real-world usage up to an extent. The models could achieve around 0.80 F1 scores on the test set. Along with the model, the generated usage graphs are presented that are the final output of the thesis work. Some types of issues were suitable for this workflow and did produce reasonable graphs which might be useful for the users to see the big picture of an issue.
author Somkiadcharoen, R. (Robroo)
spellingShingle Somkiadcharoen, R. (Robroo)
Improving visualization on code repository issues for tasks understanding
author_facet Somkiadcharoen, R. (Robroo)
author_sort Somkiadcharoen, R. (Robroo)
title Improving visualization on code repository issues for tasks understanding
title_short Improving visualization on code repository issues for tasks understanding
title_full Improving visualization on code repository issues for tasks understanding
title_fullStr Improving visualization on code repository issues for tasks understanding
title_full_unstemmed Improving visualization on code repository issues for tasks understanding
title_sort improving visualization on code repository issues for tasks understanding
publisher University of Oulu
publishDate 2019
url http://jultika.oulu.fi/Record/nbnfioulu-201907202706
work_keys_str_mv AT somkiadcharoenrrobroo improvingvisualizationoncoderepositoryissuesfortasksunderstanding
_version_ 1719233497598525440