Categorization of Frequent Errors in Solution Codes Created by Novice Programmers

In recent times, e-learning has become indispensable for both technical and general education. Among all the subjects, programming education has drawn attention because of its importance for continuous development in the ICT sector. Finding errors in a solution code is a laborious task for novice pr...

Full description

Bibliographic Details
Main Authors: Rahman Md. Mostafizer, Kawabayashi Shunsuke, Watanobe Yutaka
Format: Article
Language:English
Published: EDP Sciences 2021-01-01
Series:SHS Web of Conferences
Online Access:https://www.shs-conferences.org/articles/shsconf/pdf/2021/13/shsconf_etltc2021_04014.pdf
id doaj-3585424711174569b77f6afe7835bd1e
record_format Article
spelling doaj-3585424711174569b77f6afe7835bd1e2021-05-04T12:25:01ZengEDP SciencesSHS Web of Conferences2261-24242021-01-011020401410.1051/shsconf/202110204014shsconf_etltc2021_04014Categorization of Frequent Errors in Solution Codes Created by Novice ProgrammersRahman Md. Mostafizer0Kawabayashi Shunsuke1Watanobe Yutaka2Graduate Department of Computer and Information Systems, The University of AizuGraduate Department of Computer and Information Systems, The University of AizuGraduate Department of Computer and Information Systems, The University of AizuIn recent times, e-learning has become indispensable for both technical and general education. Among all the subjects, programming education has drawn attention because of its importance for continuous development in the ICT sector. Finding errors in a solution code is a laborious task for novice programmers, teachers and instructors. Novice programmers are spending a lot of valuable time to search errors in the solution codes. In this paper, a method for the categorization of frequent errors in solution codes is presented. In the proposed method, the differences between wrong solutions and accepted solutions are used to define feature vectors for a clustering algorithm. A longest common subsequence (LCS) algorithm is leveraged to find the differences between wrong and accepted codes, then all the inequalities are converted into feature vectors. The k-mean clustering algorithm is applied to cluster the elements of the feature vector to find the most common errors in solution codes. In our experiment, the method was applied to a set of program solution codes accumulated in an e-learning system. Experimental results show that the proposed method is efficient and capable to detect the most common errors occurred in solution codes that can be helpful for novice programmers to resolve errors quickly as well as useful for teachers to prepare lesson plan.https://www.shs-conferences.org/articles/shsconf/pdf/2021/13/shsconf_etltc2021_04014.pdf
collection DOAJ
language English
format Article
sources DOAJ
author Rahman Md. Mostafizer
Kawabayashi Shunsuke
Watanobe Yutaka
spellingShingle Rahman Md. Mostafizer
Kawabayashi Shunsuke
Watanobe Yutaka
Categorization of Frequent Errors in Solution Codes Created by Novice Programmers
SHS Web of Conferences
author_facet Rahman Md. Mostafizer
Kawabayashi Shunsuke
Watanobe Yutaka
author_sort Rahman Md. Mostafizer
title Categorization of Frequent Errors in Solution Codes Created by Novice Programmers
title_short Categorization of Frequent Errors in Solution Codes Created by Novice Programmers
title_full Categorization of Frequent Errors in Solution Codes Created by Novice Programmers
title_fullStr Categorization of Frequent Errors in Solution Codes Created by Novice Programmers
title_full_unstemmed Categorization of Frequent Errors in Solution Codes Created by Novice Programmers
title_sort categorization of frequent errors in solution codes created by novice programmers
publisher EDP Sciences
series SHS Web of Conferences
issn 2261-2424
publishDate 2021-01-01
description In recent times, e-learning has become indispensable for both technical and general education. Among all the subjects, programming education has drawn attention because of its importance for continuous development in the ICT sector. Finding errors in a solution code is a laborious task for novice programmers, teachers and instructors. Novice programmers are spending a lot of valuable time to search errors in the solution codes. In this paper, a method for the categorization of frequent errors in solution codes is presented. In the proposed method, the differences between wrong solutions and accepted solutions are used to define feature vectors for a clustering algorithm. A longest common subsequence (LCS) algorithm is leveraged to find the differences between wrong and accepted codes, then all the inequalities are converted into feature vectors. The k-mean clustering algorithm is applied to cluster the elements of the feature vector to find the most common errors in solution codes. In our experiment, the method was applied to a set of program solution codes accumulated in an e-learning system. Experimental results show that the proposed method is efficient and capable to detect the most common errors occurred in solution codes that can be helpful for novice programmers to resolve errors quickly as well as useful for teachers to prepare lesson plan.
url https://www.shs-conferences.org/articles/shsconf/pdf/2021/13/shsconf_etltc2021_04014.pdf
work_keys_str_mv AT rahmanmdmostafizer categorizationoffrequenterrorsinsolutioncodescreatedbynoviceprogrammers
AT kawabayashishunsuke categorizationoffrequenterrorsinsolutioncodescreatedbynoviceprogrammers
AT watanobeyutaka categorizationoffrequenterrorsinsolutioncodescreatedbynoviceprogrammers
_version_ 1721478884503846912