Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model

Predicting student performance as early as possible and analysing to which extent initial student behaviour could lead to failure or success is critical in introductory programming (CS1) courses, for allowing prompt intervention in a move towards alleviating their high failure rate. However, in CS1...

Full description

Bibliographic Details
Main Authors: Filipe Dwan Pereira, Samuel C. Fonseca, Elaine H. T. Oliveira, Alexandra I. Cristea, Henrik Bellhauser, Luiz Rodrigues, David B. F. Oliveira, Seiji Isotani, Leandro S. G. Carvalho
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
CS1
Online Access:https://ieeexplore.ieee.org/document/9517104/
id doaj-7618ab10e2ba4630a140e36ea8cc49b9
record_format Article
spelling doaj-7618ab10e2ba4630a140e36ea8cc49b92021-08-27T23:00:29ZengIEEEIEEE Access2169-35362021-01-01911709711711910.1109/ACCESS.2021.31059569517104Explaining Individual and Collective Programming Students&#x2019; Behavior by Interpreting a Black-Box Predictive ModelFilipe Dwan Pereira0https://orcid.org/0000-0003-4914-3347Samuel C. Fonseca1Elaine H. T. Oliveira2https://orcid.org/0000-0003-2884-9359Alexandra I. Cristea3Henrik Bellhauser4https://orcid.org/0000-0003-4414-7795Luiz Rodrigues5David B. F. Oliveira6Seiji Isotani7https://orcid.org/0000-0003-1574-0784Leandro S. G. Carvalho8https://orcid.org/0000-0003-2970-2084Department of Computer Science, Federal University of Roraima, Boa Vista, BrazilInstitute of Computing, Federal University of Amazonas, Manaus, BrazilInstitute of Computing, Federal University of Amazonas, Manaus, BrazilDepartment of Computer Science, Durham University, Durham, U.K.Department of Psychology, Johannes Gutenberg-University Mainz, Mainz, GermanyInstitute of Mathematics and Computer Science, University of S&#x00E3;o Paulo, S&#x00E3;o Carlos, BrazilInstitute of Computing, Federal University of Amazonas, Manaus, BrazilInstitute of Mathematics and Computer Science, University of S&#x00E3;o Paulo, S&#x00E3;o Carlos, BrazilInstitute of Computing, Federal University of Amazonas, Manaus, BrazilPredicting student performance as early as possible and analysing to which extent initial student behaviour could lead to failure or success is critical in introductory programming (CS1) courses, for allowing prompt intervention in a move towards alleviating their high failure rate. However, in CS1 performance prediction, there is a serious lack of studies that interpret the predictive model&#x2019;s decisions. In this sense, we designed a long-term study using very fine-grained log-data of 2056 students, collected from the first two weeks of CS1 courses. We extract features that measure how students deal with deadlines, how they fix errors, how much time they spend programming, and so forth. Subsequently, we construct a predictive model that achieved cutting-edge results with area under the curve (AUC) of.89, and an average accuracy of 81.3&#x0025;. To allow an effective intervention and to facilitate human-AI collaboration towards prescriptive analytics, we, for the first time, to the best of our knowledge, go a step further than the prediction itself and leverage this field by proposing an approach to explaining our predictive model decisions individually and collectively using a game-theory based framework (SHAP), (Lundberg <italic>et al.</italic>, 2020) that allows interpreting our black-box non-linear model linearly. In other words, we explain the feature effects, clearly by visualising and analysing individual predictions, the overall importance of features, and identification of typical prediction paths. This method can be further applied to other emerging competitive models, as the CS1 prediction field progresses, ensuring transparency of the process for key stakeholders: administrators, teachers, and students.https://ieeexplore.ieee.org/document/9517104/Explainable artificial intelligenceonline judgeslearning analyticsCS1computing in educationearly prediction
collection DOAJ
language English
format Article
sources DOAJ
author Filipe Dwan Pereira
Samuel C. Fonseca
Elaine H. T. Oliveira
Alexandra I. Cristea
Henrik Bellhauser
Luiz Rodrigues
David B. F. Oliveira
Seiji Isotani
Leandro S. G. Carvalho
spellingShingle Filipe Dwan Pereira
Samuel C. Fonseca
Elaine H. T. Oliveira
Alexandra I. Cristea
Henrik Bellhauser
Luiz Rodrigues
David B. F. Oliveira
Seiji Isotani
Leandro S. G. Carvalho
Explaining Individual and Collective Programming Students&#x2019; Behavior by Interpreting a Black-Box Predictive Model
IEEE Access
Explainable artificial intelligence
online judges
learning analytics
CS1
computing in education
early prediction
author_facet Filipe Dwan Pereira
Samuel C. Fonseca
Elaine H. T. Oliveira
Alexandra I. Cristea
Henrik Bellhauser
Luiz Rodrigues
David B. F. Oliveira
Seiji Isotani
Leandro S. G. Carvalho
author_sort Filipe Dwan Pereira
title Explaining Individual and Collective Programming Students&#x2019; Behavior by Interpreting a Black-Box Predictive Model
title_short Explaining Individual and Collective Programming Students&#x2019; Behavior by Interpreting a Black-Box Predictive Model
title_full Explaining Individual and Collective Programming Students&#x2019; Behavior by Interpreting a Black-Box Predictive Model
title_fullStr Explaining Individual and Collective Programming Students&#x2019; Behavior by Interpreting a Black-Box Predictive Model
title_full_unstemmed Explaining Individual and Collective Programming Students&#x2019; Behavior by Interpreting a Black-Box Predictive Model
title_sort explaining individual and collective programming students&#x2019; behavior by interpreting a black-box predictive model
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description Predicting student performance as early as possible and analysing to which extent initial student behaviour could lead to failure or success is critical in introductory programming (CS1) courses, for allowing prompt intervention in a move towards alleviating their high failure rate. However, in CS1 performance prediction, there is a serious lack of studies that interpret the predictive model&#x2019;s decisions. In this sense, we designed a long-term study using very fine-grained log-data of 2056 students, collected from the first two weeks of CS1 courses. We extract features that measure how students deal with deadlines, how they fix errors, how much time they spend programming, and so forth. Subsequently, we construct a predictive model that achieved cutting-edge results with area under the curve (AUC) of.89, and an average accuracy of 81.3&#x0025;. To allow an effective intervention and to facilitate human-AI collaboration towards prescriptive analytics, we, for the first time, to the best of our knowledge, go a step further than the prediction itself and leverage this field by proposing an approach to explaining our predictive model decisions individually and collectively using a game-theory based framework (SHAP), (Lundberg <italic>et al.</italic>, 2020) that allows interpreting our black-box non-linear model linearly. In other words, we explain the feature effects, clearly by visualising and analysing individual predictions, the overall importance of features, and identification of typical prediction paths. This method can be further applied to other emerging competitive models, as the CS1 prediction field progresses, ensuring transparency of the process for key stakeholders: administrators, teachers, and students.
topic Explainable artificial intelligence
online judges
learning analytics
CS1
computing in education
early prediction
url https://ieeexplore.ieee.org/document/9517104/
work_keys_str_mv AT filipedwanpereira explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel
AT samuelcfonseca explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel
AT elainehtoliveira explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel
AT alexandraicristea explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel
AT henrikbellhauser explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel
AT luizrodrigues explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel
AT davidbfoliveira explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel
AT seijiisotani explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel
AT leandrosgcarvalho explainingindividualandcollectiveprogrammingstudentsx2019behaviorbyinterpretingablackboxpredictivemodel
_version_ 1721187985695703040