Categorization of Frequent Errors in Solution Codes Created by Novice Programmers

In recent times, e-learning has become indispensable for both technical and general education. Among all the subjects, programming education has drawn attention because of its importance for continuous development in the ICT sector. Finding errors in a solution code is a laborious task for novice pr...

Full description

Bibliographic Details
Main Authors:	Rahman Md. Mostafizer, Kawabayashi Shunsuke, Watanobe Yutaka
Format:	Article
Language:	English
Published:	EDP Sciences 2021-01-01
Series:	SHS Web of Conferences
Online Access:	https://www.shs-conferences.org/articles/shsconf/pdf/2021/13/shsconf_etltc2021_04014.pdf

id	doaj-3585424711174569b77f6afe7835bd1e
record_format	Article
spelling	doaj-3585424711174569b77f6afe7835bd1e2021-05-04T12:25:01ZengEDP SciencesSHS Web of Conferences2261-24242021-01-011020401410.1051/shsconf/202110204014shsconf_etltc2021_04014Categorization of Frequent Errors in Solution Codes Created by Novice ProgrammersRahman Md. Mostafizer0Kawabayashi Shunsuke1Watanobe Yutaka2Graduate Department of Computer and Information Systems, The University of AizuGraduate Department of Computer and Information Systems, The University of AizuGraduate Department of Computer and Information Systems, The University of AizuIn recent times, e-learning has become indispensable for both technical and general education. Among all the subjects, programming education has drawn attention because of its importance for continuous development in the ICT sector. Finding errors in a solution code is a laborious task for novice programmers, teachers and instructors. Novice programmers are spending a lot of valuable time to search errors in the solution codes. In this paper, a method for the categorization of frequent errors in solution codes is presented. In the proposed method, the differences between wrong solutions and accepted solutions are used to define feature vectors for a clustering algorithm. A longest common subsequence (LCS) algorithm is leveraged to find the differences between wrong and accepted codes, then all the inequalities are converted into feature vectors. The k-mean clustering algorithm is applied to cluster the elements of the feature vector to find the most common errors in solution codes. In our experiment, the method was applied to a set of program solution codes accumulated in an e-learning system. Experimental results show that the proposed method is efficient and capable to detect the most common errors occurred in solution codes that can be helpful for novice programmers to resolve errors quickly as well as useful for teachers to prepare lesson plan.https://www.shs-conferences.org/articles/shsconf/pdf/2021/13/shsconf_etltc2021_04014.pdf
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Rahman Md. Mostafizer Kawabayashi Shunsuke Watanobe Yutaka
spellingShingle	Rahman Md. Mostafizer Kawabayashi Shunsuke Watanobe Yutaka Categorization of Frequent Errors in Solution Codes Created by Novice Programmers SHS Web of Conferences
author_facet	Rahman Md. Mostafizer Kawabayashi Shunsuke Watanobe Yutaka
author_sort	Rahman Md. Mostafizer
title	Categorization of Frequent Errors in Solution Codes Created by Novice Programmers
title_short	Categorization of Frequent Errors in Solution Codes Created by Novice Programmers
title_full	Categorization of Frequent Errors in Solution Codes Created by Novice Programmers
title_fullStr	Categorization of Frequent Errors in Solution Codes Created by Novice Programmers
title_full_unstemmed	Categorization of Frequent Errors in Solution Codes Created by Novice Programmers
title_sort	categorization of frequent errors in solution codes created by novice programmers
publisher	EDP Sciences
series	SHS Web of Conferences
issn	2261-2424
publishDate	2021-01-01
description	In recent times, e-learning has become indispensable for both technical and general education. Among all the subjects, programming education has drawn attention because of its importance for continuous development in the ICT sector. Finding errors in a solution code is a laborious task for novice programmers, teachers and instructors. Novice programmers are spending a lot of valuable time to search errors in the solution codes. In this paper, a method for the categorization of frequent errors in solution codes is presented. In the proposed method, the differences between wrong solutions and accepted solutions are used to define feature vectors for a clustering algorithm. A longest common subsequence (LCS) algorithm is leveraged to find the differences between wrong and accepted codes, then all the inequalities are converted into feature vectors. The k-mean clustering algorithm is applied to cluster the elements of the feature vector to find the most common errors in solution codes. In our experiment, the method was applied to a set of program solution codes accumulated in an e-learning system. Experimental results show that the proposed method is efficient and capable to detect the most common errors occurred in solution codes that can be helpful for novice programmers to resolve errors quickly as well as useful for teachers to prepare lesson plan.
url	https://www.shs-conferences.org/articles/shsconf/pdf/2021/13/shsconf_etltc2021_04014.pdf
work_keys_str_mv	AT rahmanmdmostafizer categorizationoffrequenterrorsinsolutioncodescreatedbynoviceprogrammers AT kawabayashishunsuke categorizationoffrequenterrorsinsolutioncodescreatedbynoviceprogrammers AT watanobeyutaka categorizationoffrequenterrorsinsolutioncodescreatedbynoviceprogrammers
_version_	1721478884503846912

Categorization of Frequent Errors in Solution Codes Created by Novice Programmers

Similar Items