Predicting and Interpreting Students Performance using Supervised Learning and Shapley Additive Explanations

abstract: Due to large data resources generated by online educational applications, Educational Data Mining (EDM) has improved learning effects in different ways: Students Visualization, Recommendations for students, Students Modeling, Grouping Students, etc. A lot of programming assignments have th...

Full description

Bibliographic Details
Other Authors: Tian, Wenbo (Author)
Format: Dissertation
Language:English
Published: 2019
Subjects:
Online Access:http://hdl.handle.net/2286/R.I.53452
id ndltd-asu.edu-item-53452
record_format oai_dc
spelling ndltd-asu.edu-item-534522019-05-16T03:01:14Z Predicting and Interpreting Students Performance using Supervised Learning and Shapley Additive Explanations abstract: Due to large data resources generated by online educational applications, Educational Data Mining (EDM) has improved learning effects in different ways: Students Visualization, Recommendations for students, Students Modeling, Grouping Students, etc. A lot of programming assignments have the features like automating submissions, examining the test cases to verify the correctness, but limited studies compared different statistical techniques with latest frameworks, and interpreted models in a unified approach. In this thesis, several data mining algorithms have been applied to analyze students’ code assignment submission data from a real classroom study. The goal of this work is to explore and predict students’ performances. Multiple machine learning models and the model accuracy were evaluated based on the Shapley Additive Explanation. The Cross-Validation shows the Gradient Boosting Decision Tree has the best precision 85.93% with average 82.90%. Features like Component grade, Due Date, Submission Times have higher impact than others. Baseline model received lower precision due to lack of non-linear fitting. Dissertation/Thesis Tian, Wenbo (Author) Hsiao, Ihan (Advisor) Bazzi, Rida (Committee member) Davulcu, Hasan (Committee member) Arizona State University (Publisher) Computer science Statistics Computer engineering Data Educational Data Mining Machine Learning Shapley Additive Explanations Students Supervised Learning eng 43 pages Masters Thesis Computer Science 2019 Masters Thesis http://hdl.handle.net/2286/R.I.53452 http://rightsstatements.org/vocab/InC/1.0/ 2019
collection NDLTD
language English
format Dissertation
sources NDLTD
topic Computer science
Statistics
Computer engineering
Data
Educational Data Mining
Machine Learning
Shapley Additive Explanations
Students
Supervised Learning
spellingShingle Computer science
Statistics
Computer engineering
Data
Educational Data Mining
Machine Learning
Shapley Additive Explanations
Students
Supervised Learning
Predicting and Interpreting Students Performance using Supervised Learning and Shapley Additive Explanations
description abstract: Due to large data resources generated by online educational applications, Educational Data Mining (EDM) has improved learning effects in different ways: Students Visualization, Recommendations for students, Students Modeling, Grouping Students, etc. A lot of programming assignments have the features like automating submissions, examining the test cases to verify the correctness, but limited studies compared different statistical techniques with latest frameworks, and interpreted models in a unified approach. In this thesis, several data mining algorithms have been applied to analyze students’ code assignment submission data from a real classroom study. The goal of this work is to explore and predict students’ performances. Multiple machine learning models and the model accuracy were evaluated based on the Shapley Additive Explanation. The Cross-Validation shows the Gradient Boosting Decision Tree has the best precision 85.93% with average 82.90%. Features like Component grade, Due Date, Submission Times have higher impact than others. Baseline model received lower precision due to lack of non-linear fitting. === Dissertation/Thesis === Masters Thesis Computer Science 2019
author2 Tian, Wenbo (Author)
author_facet Tian, Wenbo (Author)
title Predicting and Interpreting Students Performance using Supervised Learning and Shapley Additive Explanations
title_short Predicting and Interpreting Students Performance using Supervised Learning and Shapley Additive Explanations
title_full Predicting and Interpreting Students Performance using Supervised Learning and Shapley Additive Explanations
title_fullStr Predicting and Interpreting Students Performance using Supervised Learning and Shapley Additive Explanations
title_full_unstemmed Predicting and Interpreting Students Performance using Supervised Learning and Shapley Additive Explanations
title_sort predicting and interpreting students performance using supervised learning and shapley additive explanations
publishDate 2019
url http://hdl.handle.net/2286/R.I.53452
_version_ 1719183363431989248