Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model

Predicting student performance as early as possible and analysing to which extent initial student behaviour could lead to failure or success is critical in introductory programming (CS1) courses, for allowing prompt intervention in a move towards alleviating their high failure rate. However, in CS1...

Full description

Bibliographic Details
Main Authors:	Filipe Dwan Pereira, Samuel C. Fonseca, Elaine H. T. Oliveira, Alexandra I. Cristea, Henrik Bellhauser, Luiz Rodrigues, David B. F. Oliveira, Seiji Isotani, Leandro S. G. Carvalho
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Explainable artificial intelligence online judges learning analytics CS1 computing in education early prediction
Online Access:	https://ieeexplore.ieee.org/document/9517104/

id	doaj-7618ab10e2ba4630a140e36ea8cc49b9
record_format	Article
spelling	doaj-7618ab10e2ba4630a140e36ea8cc49b92021-08-27T23:00:29ZengIEEEIEEE Access2169-35362021-01-01911709711711910.1109/ACCESS.2021.31059569517104Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive ModelFilipe Dwan Pereira0https://orcid.org/0000-0003-4914-3347Samuel C. Fonseca1Elaine H. T. Oliveira2https://orcid.org/0000-0003-2884-9359Alexandra I. Cristea3Henrik Bellhauser4https://orcid.org/0000-0003-4414-7795Luiz Rodrigues5David B. F. Oliveira6Seiji Isotani7https://orcid.org/0000-0003-1574-0784Leandro S. G. Carvalho8https://orcid.org/0000-0003-2970-2084Department of Computer Science, Federal University of Roraima, Boa Vista, BrazilInstitute of Computing, Federal University of Amazonas, Manaus, BrazilInstitute of Computing, Federal University of Amazonas, Manaus, BrazilDepartment of Computer Science, Durham University, Durham, U.K.Department of Psychology, Johannes Gutenberg-University Mainz, Mainz, GermanyInstitute of Mathematics and Computer Science, University of São Paulo, São Carlos, BrazilInstitute of Computing, Federal University of Amazonas, Manaus, BrazilInstitute of Mathematics and Computer Science, University of São Paulo, São Carlos, BrazilInstitute of Computing, Federal University of Amazonas, Manaus, BrazilPredicting student performance as early as possible and analysing to which extent initial student behaviour could lead to failure or success is critical in introductory programming (CS1) courses, for allowing prompt intervention in a move towards alleviating their high failure rate. However, in CS1 performance prediction, there is a serious lack of studies that interpret the predictive model’s decisions. In this sense, we designed a long-term study using very fine-grained log-data of 2056 students, collected from the first two weeks of CS1 courses. We extract features that measure how students deal with deadlines, how they fix errors, how much time they spend programming, and so forth. Subsequently, we construct a predictive model that achieved cutting-edge results with area under the curve (AUC) of.89, and an average accuracy of 81.3%. To allow an effective intervention and to facilitate human-AI collaboration towards prescriptive analytics, we, for the first time, to the best of our knowledge, go a step further than the prediction itself and leverage this field by proposing an approach to explaining our predictive model decisions individually and collectively using a game-theory based framework (SHAP), (Lundberg <italic>et al.</italic>, 2020) that allows interpreting our black-box non-linear model linearly. In other words, we explain the feature effects, clearly by visualising and analysing individual predictions, the overall importance of features, and identification of typical prediction paths. This method can be further applied to other emerging competitive models, as the CS1 prediction field progresses, ensuring transparency of the process for key stakeholders: administrators, teachers, and students.https://ieeexplore.ieee.org/document/9517104/Explainable artificial intelligenceonline judgeslearning analyticsCS1computing in educationearly prediction
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Filipe Dwan Pereira Samuel C. Fonseca Elaine H. T. Oliveira Alexandra I. Cristea Henrik Bellhauser Luiz Rodrigues David B. F. Oliveira Seiji Isotani Leandro S. G. Carvalho
spellingShingle	Filipe Dwan Pereira Samuel C. Fonseca Elaine H. T. Oliveira Alexandra I. Cristea Henrik Bellhauser Luiz Rodrigues David B. F. Oliveira Seiji Isotani Leandro S. G. Carvalho Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model IEEE Access Explainable artificial intelligence online judges learning analytics CS1 computing in education early prediction
author_facet	Filipe Dwan Pereira Samuel C. Fonseca Elaine H. T. Oliveira Alexandra I. Cristea Henrik Bellhauser Luiz Rodrigues David B. F. Oliveira Seiji Isotani Leandro S. G. Carvalho
author_sort	Filipe Dwan Pereira
title	Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model
title_short	Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model
title_full	Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model
title_fullStr	Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model
title_full_unstemmed	Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model
title_sort	explaining individual and collective programming students’ behavior by interpreting a black-box predictive model
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2021-01-01
description	Predicting student performance as early as possible and analysing to which extent initial student behaviour could lead to failure or success is critical in introductory programming (CS1) courses, for allowing prompt intervention in a move towards alleviating their high failure rate. However, in CS1 performance prediction, there is a serious lack of studies that interpret the predictive model’s decisions. In this sense, we designed a long-term study using very fine-grained log-data of 2056 students, collected from the first two weeks of CS1 courses. We extract features that measure how students deal with deadlines, how they fix errors, how much time they spend programming, and so forth. Subsequently, we construct a predictive model that achieved cutting-edge results with area under the curve (AUC) of.89, and an average accuracy of 81.3%. To allow an effective intervention and to facilitate human-AI collaboration towards prescriptive analytics, we, for the first time, to the best of our knowledge, go a step further than the prediction itself and leverage this field by proposing an approach to explaining our predictive model decisions individually and collectively using a game-theory based framework (SHAP), (Lundberg <italic>et al.</italic>, 2020) that allows interpreting our black-box non-linear model linearly. In other words, we explain the feature effects, clearly by visualising and analysing individual predictions, the overall importance of features, and identification of typical prediction paths. This method can be further applied to other emerging competitive models, as the CS1 prediction field progresses, ensuring transparency of the process for key stakeholders: administrators, teachers, and students.
topic	Explainable artificial intelligence online judges learning analytics CS1 computing in education early prediction
url	https://ieeexplore.ieee.org/document/9517104/
work_keys_str_mv	AT filipedwanpereira explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT samuelcfonseca explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT elainehtoliveira explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT alexandraicristea explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT henrikbellhauser explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT luizrodrigues explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT davidbfoliveira explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT seijiisotani explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT leandrosgcarvalho explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel
_version_	1721187985695703040

Explaining Individual and Collective Programming Students&#x2019; Behavior by Interpreting a Black-Box Predictive Model

Similar Items

Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model