Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model
Predicting student performance as early as possible and analysing to which extent initial student behaviour could lead to failure or success is critical in introductory programming (CS1) courses, for allowing prompt intervention in a move towards alleviating their high failure rate. However, in CS1...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9517104/ |
id |
doaj-7618ab10e2ba4630a140e36ea8cc49b9 |
---|---|
record_format |
Article |
spelling |
doaj-7618ab10e2ba4630a140e36ea8cc49b92021-08-27T23:00:29ZengIEEEIEEE Access2169-35362021-01-01911709711711910.1109/ACCESS.2021.31059569517104Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive ModelFilipe Dwan Pereira0https://orcid.org/0000-0003-4914-3347Samuel C. Fonseca1Elaine H. T. Oliveira2https://orcid.org/0000-0003-2884-9359Alexandra I. Cristea3Henrik Bellhauser4https://orcid.org/0000-0003-4414-7795Luiz Rodrigues5David B. F. Oliveira6Seiji Isotani7https://orcid.org/0000-0003-1574-0784Leandro S. G. Carvalho8https://orcid.org/0000-0003-2970-2084Department of Computer Science, Federal University of Roraima, Boa Vista, BrazilInstitute of Computing, Federal University of Amazonas, Manaus, BrazilInstitute of Computing, Federal University of Amazonas, Manaus, BrazilDepartment of Computer Science, Durham University, Durham, U.K.Department of Psychology, Johannes Gutenberg-University Mainz, Mainz, GermanyInstitute of Mathematics and Computer Science, University of São Paulo, São Carlos, BrazilInstitute of Computing, Federal University of Amazonas, Manaus, BrazilInstitute of Mathematics and Computer Science, University of São Paulo, São Carlos, BrazilInstitute of Computing, Federal University of Amazonas, Manaus, BrazilPredicting student performance as early as possible and analysing to which extent initial student behaviour could lead to failure or success is critical in introductory programming (CS1) courses, for allowing prompt intervention in a move towards alleviating their high failure rate. However, in CS1 performance prediction, there is a serious lack of studies that interpret the predictive model’s decisions. In this sense, we designed a long-term study using very fine-grained log-data of 2056 students, collected from the first two weeks of CS1 courses. We extract features that measure how students deal with deadlines, how they fix errors, how much time they spend programming, and so forth. Subsequently, we construct a predictive model that achieved cutting-edge results with area under the curve (AUC) of.89, and an average accuracy of 81.3%. To allow an effective intervention and to facilitate human-AI collaboration towards prescriptive analytics, we, for the first time, to the best of our knowledge, go a step further than the prediction itself and leverage this field by proposing an approach to explaining our predictive model decisions individually and collectively using a game-theory based framework (SHAP), (Lundberg <italic>et al.</italic>, 2020) that allows interpreting our black-box non-linear model linearly. In other words, we explain the feature effects, clearly by visualising and analysing individual predictions, the overall importance of features, and identification of typical prediction paths. This method can be further applied to other emerging competitive models, as the CS1 prediction field progresses, ensuring transparency of the process for key stakeholders: administrators, teachers, and students.https://ieeexplore.ieee.org/document/9517104/Explainable artificial intelligenceonline judgeslearning analyticsCS1computing in educationearly prediction |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Filipe Dwan Pereira Samuel C. Fonseca Elaine H. T. Oliveira Alexandra I. Cristea Henrik Bellhauser Luiz Rodrigues David B. F. Oliveira Seiji Isotani Leandro S. G. Carvalho |
spellingShingle |
Filipe Dwan Pereira Samuel C. Fonseca Elaine H. T. Oliveira Alexandra I. Cristea Henrik Bellhauser Luiz Rodrigues David B. F. Oliveira Seiji Isotani Leandro S. G. Carvalho Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model IEEE Access Explainable artificial intelligence online judges learning analytics CS1 computing in education early prediction |
author_facet |
Filipe Dwan Pereira Samuel C. Fonseca Elaine H. T. Oliveira Alexandra I. Cristea Henrik Bellhauser Luiz Rodrigues David B. F. Oliveira Seiji Isotani Leandro S. G. Carvalho |
author_sort |
Filipe Dwan Pereira |
title |
Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model |
title_short |
Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model |
title_full |
Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model |
title_fullStr |
Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model |
title_full_unstemmed |
Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model |
title_sort |
explaining individual and collective programming students’ behavior by interpreting a black-box predictive model |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
Predicting student performance as early as possible and analysing to which extent initial student behaviour could lead to failure or success is critical in introductory programming (CS1) courses, for allowing prompt intervention in a move towards alleviating their high failure rate. However, in CS1 performance prediction, there is a serious lack of studies that interpret the predictive model’s decisions. In this sense, we designed a long-term study using very fine-grained log-data of 2056 students, collected from the first two weeks of CS1 courses. We extract features that measure how students deal with deadlines, how they fix errors, how much time they spend programming, and so forth. Subsequently, we construct a predictive model that achieved cutting-edge results with area under the curve (AUC) of.89, and an average accuracy of 81.3%. To allow an effective intervention and to facilitate human-AI collaboration towards prescriptive analytics, we, for the first time, to the best of our knowledge, go a step further than the prediction itself and leverage this field by proposing an approach to explaining our predictive model decisions individually and collectively using a game-theory based framework (SHAP), (Lundberg <italic>et al.</italic>, 2020) that allows interpreting our black-box non-linear model linearly. In other words, we explain the feature effects, clearly by visualising and analysing individual predictions, the overall importance of features, and identification of typical prediction paths. This method can be further applied to other emerging competitive models, as the CS1 prediction field progresses, ensuring transparency of the process for key stakeholders: administrators, teachers, and students. |
topic |
Explainable artificial intelligence online judges learning analytics CS1 computing in education early prediction |
url |
https://ieeexplore.ieee.org/document/9517104/ |
work_keys_str_mv |
AT filipedwanpereira explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT samuelcfonseca explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT elainehtoliveira explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT alexandraicristea explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT henrikbellhauser explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT luizrodrigues explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT davidbfoliveira explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT seijiisotani explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel AT leandrosgcarvalho explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel |
_version_ |
1721187985695703040 |