On predicting academic performance with process mining in learning analytics
Purpose - The purpose of this paper is to propose a process mining approach to help in making early predictions to improve students’ learning experience in massive open online courses (MOOCs). It investigates the impact of various machine learning techniques in combination with process mining featur...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Emerald Publishing
2018-05-01
|
Series: | Journal of Research in Innovative Teaching & Learning |
Subjects: | |
Online Access: | https://www.emeraldinsight.com/doi/pdfplus/10.1108/JRIT-09-2017-0022 |
id |
doaj-dfce69246aec4cd9a9901b5c89255ba4 |
---|---|
record_format |
Article |
spelling |
doaj-dfce69246aec4cd9a9901b5c89255ba42020-11-24T20:52:52ZengEmerald PublishingJournal of Research in Innovative Teaching & Learning2397-76042018-05-0110216017610.1108/JRIT-09-2017-0022606383On predicting academic performance with process mining in learning analyticsRahila Umer0Teo Susnjak1Anuradha Mathrani2Suriadi Suriadi3Institute of Natural and Mathematical Sciences, College of Sciences, Massey University, Auckland, New ZealandMassey University, Auckland, New ZealandMassey University, Auckland, New ZealandFaculty of Science and Engineering, Queensland University of Technology, Brisbane, AustraliaPurpose - The purpose of this paper is to propose a process mining approach to help in making early predictions to improve students’ learning experience in massive open online courses (MOOCs). It investigates the impact of various machine learning techniques in combination with process mining features to measure effectiveness of these techniques. Design/methodology/approach - Student’s data (e.g. assessment grades, demographic information) and weekly interaction data based on event logs (e.g. video lecture interaction, solution submission time, time spent weekly) have guided this design. This study evaluates four machine learning classification techniques used in the literature (logistic regression (LR), Naïve Bayes (NB), random forest (RF) and K-nearest neighbor) to monitor weekly progression of students’ performance and to predict their overall performance outcome. Two data sets – one, with traditional features and second, with features obtained from process conformance testing – have been used. Findings - The results show that techniques used in the study are able to make predictions on the performance of students. Overall accuracy (F1-score, area under curve) of machine learning techniques can be improved by integrating process mining features with standard features. Specifically, the use of LR and NB classifiers outperforms other techniques in a statistical significant way. Practical implications - Although MOOCs provide a platform for learning in highly scalable and flexible manner, they are prone to early dropout and low completion rate. This study outlines a data-driven approach to improve students’ learning experience and decrease the dropout rate. Social implications - Early predictions based on individual’s participation can help educators provide support to students who are struggling in the course. Originality/value - This study outlines the innovative use of process mining techniques in education data mining to help educators gather data-driven insight on student performances in the enrolled courses.https://www.emeraldinsight.com/doi/pdfplus/10.1108/JRIT-09-2017-0022PredictionMOOCsMachine learningLearning analyticsProcess miningEducation data mining |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Rahila Umer Teo Susnjak Anuradha Mathrani Suriadi Suriadi |
spellingShingle |
Rahila Umer Teo Susnjak Anuradha Mathrani Suriadi Suriadi On predicting academic performance with process mining in learning analytics Journal of Research in Innovative Teaching & Learning Prediction MOOCs Machine learning Learning analytics Process mining Education data mining |
author_facet |
Rahila Umer Teo Susnjak Anuradha Mathrani Suriadi Suriadi |
author_sort |
Rahila Umer |
title |
On predicting academic performance with process mining in learning analytics |
title_short |
On predicting academic performance with process mining in learning analytics |
title_full |
On predicting academic performance with process mining in learning analytics |
title_fullStr |
On predicting academic performance with process mining in learning analytics |
title_full_unstemmed |
On predicting academic performance with process mining in learning analytics |
title_sort |
on predicting academic performance with process mining in learning analytics |
publisher |
Emerald Publishing |
series |
Journal of Research in Innovative Teaching & Learning |
issn |
2397-7604 |
publishDate |
2018-05-01 |
description |
Purpose - The purpose of this paper is to propose a process mining approach to help in making early predictions to improve students’ learning experience in massive open online courses (MOOCs). It investigates the impact of various machine learning techniques in combination with process mining features to measure effectiveness of these techniques. Design/methodology/approach - Student’s data (e.g. assessment grades, demographic information) and weekly interaction data based on event logs (e.g. video lecture interaction, solution submission time, time spent weekly) have guided this design. This study evaluates four machine learning classification techniques used in the literature (logistic regression (LR), Naïve Bayes (NB), random forest (RF) and K-nearest neighbor) to monitor weekly progression of students’ performance and to predict their overall performance outcome. Two data sets – one, with traditional features and second, with features obtained from process conformance testing – have been used. Findings - The results show that techniques used in the study are able to make predictions on the performance of students. Overall accuracy (F1-score, area under curve) of machine learning techniques can be improved by integrating process mining features with standard features. Specifically, the use of LR and NB classifiers outperforms other techniques in a statistical significant way. Practical implications - Although MOOCs provide a platform for learning in highly scalable and flexible manner, they are prone to early dropout and low completion rate. This study outlines a data-driven approach to improve students’ learning experience and decrease the dropout rate. Social implications - Early predictions based on individual’s participation can help educators provide support to students who are struggling in the course. Originality/value - This study outlines the innovative use of process mining techniques in education data mining to help educators gather data-driven insight on student performances in the enrolled courses. |
topic |
Prediction MOOCs Machine learning Learning analytics Process mining Education data mining |
url |
https://www.emeraldinsight.com/doi/pdfplus/10.1108/JRIT-09-2017-0022 |
work_keys_str_mv |
AT rahilaumer onpredictingacademicperformancewithprocessmininginlearninganalytics AT teosusnjak onpredictingacademicperformancewithprocessmininginlearninganalytics AT anuradhamathrani onpredictingacademicperformancewithprocessmininginlearninganalytics AT suriadisuriadi onpredictingacademicperformancewithprocessmininginlearninganalytics |
_version_ |
1716798706798297088 |