DMCNet: Diversified model combination network for understanding engagement from video screengrabs

Engagement is an essential indicator of the Quality-of-Learning Experience (QoLE) and plays a major role in developing intelligent educational interfaces. The number of people learning through Massively Open Online Courses (MOOCs) and other online resources has been increasing rapidly because they p...

Full description

Bibliographic Details
Main Authors: Batra, S. (Author), Brodeur, P. (Author), Checkley, M. (Author), Dev, S. (Author), Klinkert, A. (Author), Nag, A. (Author), Wang, H. (Author)
Format: Article
Language:English
Published: Academic Press 2022
Subjects:
Online Access:View Fulltext in Publisher
LEADER 02686nam a2200277Ia 4500
001 10.1016-j.sasc.2022.200039
008 220630s2022 CNT 000 0 und d
020 |a 27729419 (ISSN) 
245 1 0 |a DMCNet: Diversified model combination network for understanding engagement from video screengrabs 
260 0 |b Academic Press  |c 2022 
520 3 |a Engagement is an essential indicator of the Quality-of-Learning Experience (QoLE) and plays a major role in developing intelligent educational interfaces. The number of people learning through Massively Open Online Courses (MOOCs) and other online resources has been increasing rapidly because they provide us with the flexibility to learn from anywhere at any time. This provides a good learning experience for the students. However, such learning interface requires the ability to recognize the level of engagement of the students for a holistic learning experience. This is useful for both students and educators alike. However, understanding engagement is a challenging task, because of its subjectivity and ability to collect data. In this paper, we propose a variety of models that have been trained on an open-source dataset of video screengrabs. Our non-deep learning models are based on the combination of popular algorithms such as Histogram of Oriented Gradient (HOG), Support Vector Machine (SVM), Scale Invariant Feature Transform (SIFT) and Speeded Up Robust Features (SURF). The deep learning methods include Densely Connected Convolutional Networks (DenseNet-121), Residual Network (ResNet-18) and MobileNetV1. We show the performance of each models using a variety of metrics such as the Gini Index, Adjusted F-Measure (AGF), and Area Under receiver operating characteristic Curve (AUC). We use various dimensionality reduction techniques such as Principal Component Analysis (PCA) and t-Distributed Stochastic Neighbor Embedding (t-SNE) to understand the distribution of data in the feature sub-space. Our work will thereby assist the educators and students in obtaining a fruitful and efficient online learning experience. © 2022 The Author(s) 
650 0 4 |a convolutional neural network 
650 0 4 |a deep learning 
650 0 4 |a engagement 
650 0 4 |a facial expression recognition 
650 0 4 |a histogram of oriented gradient 
650 0 4 |a support vector machine 
700 1 0 |a Batra, S.  |e author 
700 1 0 |a Brodeur, P.  |e author 
700 1 0 |a Checkley, M.  |e author 
700 1 0 |a Dev, S.  |e author 
700 1 0 |a Klinkert, A.  |e author 
700 1 0 |a Nag, A.  |e author 
700 1 0 |a Wang, H.  |e author 
773 |t Systems and Soft Computing 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1016/j.sasc.2022.200039