Dropout Prediction Model for College Students in MOOCs Based on Weighted Multi-feature and SVM

Main Article Content

Zhang Yujiao
Ling Weay Ang
Shi Shaomin
Sellappan Palaniappan

Abstract

Due to the COVID -19 pandemic, MOOCs have become a popular form of learning for college students. However, unlike traditional face-to-face courses, MOOCs offer little faculty supervision, which may result in students being insufficiently motivated to continue learning, ultimately leading to a high dropout rate. Consequently, the problem of high dropout rates in MOOCs requires urgent attention in MOOC research. Predicting dropout rates is the first step to address this problem, and MOOCs have a large amount of behavioral data that can be used for such predictions. Most existing models for predicting MOOC dropout based on behavioral data assign equal weights to each behavioral characteristic, despite the fact that each behavioral characteristic has a different effect on predicting dropout. To address this problem, this paper proposes a dropout prediction model based on the fusion of behavioral data and Support Vector Machine (SVM). This innovative model assigns different weights to different behavior features based on Pearson principle and integrates them as data inputs to the model. Dropout prediction is essentially a binary problem, Support Vector Machine Classifier is then trained using the training dataset 1 and dataset 2. Experimental results on both datasets show that this predictive model outperforms previous models that assign the same weights to the behavior features.

Article Details

How to Cite
Yujiao, Z., Ang, L. W., Shaomin, S., & Palaniappan, S. (2023). Dropout Prediction Model for College Students in MOOCs Based on Weighted Multi-feature and SVM. Journal of Informatics and Web Engineering, 2(2), 29–42. https://doi.org/10.33093/jiwe.2023.2.2.3
Section
Regular issue

References

H. Haron, A. R. M. Yusof, H. Samad, N. Ismail, A. Juanita, and H. Yusof, "The platform of MOOC (Massive Open Online Course) on open learning: Issues and challenges", International Journal of Modern Education, vol. 1, pp. 1-9,2019.

C. M. Stracke, S. Downes, G. Conole, D. Burgos and F. Nascimbeni, "Are MOOCs Open Educational Resources? A Literature Review on History, Definitions and Typologies of OER and MOOCs", Open Praxis, vol. 11, pp. 331-341, 2019.

N. Alhazzani, "MOOC’s impact on higher education", Social Sciences & Humanities Open, vol. 2, pp. 1-6, 2020.

J. Kennedy, "Characteristics of massive open online courses (MOOCs): A research review", Journal of Interactive Online Learning, vol. 13, pp. 1-15, 2014.

Y. Goel, and R. Goyal, "On the effectiveness of self-training in MOOC dropout prediction", Open Computer Science, vol. 10, pp. 246-258. 2020.

S. Abdelali, "Education data mining: mining MOOCs videos using metadata based approach". 2016 4th IEEE International Colloquium on Information Science and Technology (CiSt), 2016, pp. 531-534

A. Ezen-Can, K. E. Boyer, S. Kellogg, and S. Booth, "Unsupervised modeling for understanding MOOC discussion forums: a learning analytics approach", In Proceedings of the Fifth International Conference on Learning Analytics and Knowledge, 2015, pp. 146-150.

J. Chen, J. Feng, X. Sun, N. Wu, Z. Yang and S. Chen, "MOOC Dropout Prediction Using a Hybrid Algorithm Based on Decision Tree and Extreme Learning Machine", Mathematical Problems in Engineering, vol.2019, pp. 1-11, 2019.

V. Muthukumar and N. Bhalaji, "MOOCVERSITY-deep learning based dropout prediction in MOOCs over weeks", Journal of Soft Computing Paradigm (JSCP), vol. 2, pp. 140-152, 2020.

Y. Wen, Y. Tian, B. Wen, Q. Zhou, G. Cai and S. Liu, "Consideration of the Local Correlation of Learning Behaviors to Predict Dropouts from MOOCs", Tsinghua Science and Technology, vol. 25, pp. 336-347, 2020.

Q. Fu, Z. Gao, J. Zhou and Y. Zheng, "CLSA: A novel deep learning model for MOOC dropout prediction", Computers & Electrical Engineering, vol. 94, pp. 1-12, 2021.

I. Nitta, R. Ishizaki, M. Shingu, S. Nakashima, K. Maruhashi, A. Tolmachev and M. Todoriki, "Graph-based massive open online course (MOOC) dropout prediction using clickstream data in virtual learning environment", 2021 16th International Conference on Computer Science & Education (ICCSE), 2021, pp. 48-52.

S. Nithya and S. Umarani, "MOOC Dropout Prediction using FIAR-ANN Model based on Learner Behavioral Features", International Journal of Advanced Computer Science and Applications, vol.13, pp. 607-617, 2022.

M. T. Sultan, H. El Sayed, M. A. Khan and M. Abduljabar, "A Deep Learning Model for MOOC Dropout Prediction Using Learner's Course-relevant Activities", 2022 IEEE Global Conference on Artificial Intelligence and Internet of Things (GCAIoT), 2022, pp. 13-18.

S. Sun, X. Qian, L. Mu, H. Zan and Q. Zhang, "Performance Prediction Based on Analysis of Learning Behavior", Data Science: 4th International Conference of Pioneering Computer Scientists, Engineers and Educators (ICPCSEE), 2018, pp. 632-644.

V. Cherkassky and Y. Ma, "Practical selection of SVM parameters and noise estimation for SVM regression", Neural networks, vol. 17, pp. 113-126, 2004.

Y. Lim, K. W. Ng, P. Naveen and S. C. Haw, "Emotion Recognition by Facial Expression and Voice: Review and Analysis", Journal of Informatics and Web Engineering, vol.1, pp. 45-54. September 2022.

F. F. Chua, T. Y. Lim, B. Tajuddin and A. P. Yanuarifiani, "Incorporating Semi-Automated Approach for Effective Software Requirements Prioritization: A Framework Design", Journal of Informatics and Web Engineering, vol. 1, pp. 1-15, March 2022.

M. Shafiq, H. Ng, T. T. V. Yap and V. T. Goh, "Performance of Sentiment Classifiers on Tweets of Different Clothing Brands", Journal of Informatics and Web Engineering, vol. 1, pp. 16-22, March 2022.

J. Zhao, "The development and application of support vector machine", Journal of Physics: Conference Series, vol. 1748, pp. 1-6, 2021.

R. S. Chauhan and D. Ghosh, "An erratum to ‘Extended Karush-Kuhn-Tucker condition for constrained interval optimization problems and its application in support vector machines’", Information Sciences, vol. 559, pp. 309-313, 2021.

B. Hong, Z. Wei and Y. Yang, "Discovering learning behavior patterns to predict dropout in MOOC", 2017 12th International Conference on Computer Science and Education (ICCSE), 2017, pp. 700-704.

B. Wu, "Influence of MOOC learners discussion forum social interactions on online reviews of MOOC", Education and Information Technologies, vol. 26, pp. 3483-3496, 2021.

S. Nithya and S. Umarani, "Comparative Analysis of the Learning on KDD Cup 2015 Dataset", Webology, vol. 19, pp. 705-717, 2022.

D. Powers, "Evaluation: From Precision, Recall and F-Measure to ROC, Informedness, Markedness & Correlation", Journal of Machine Learning Technologies, vol. 2, pp. 37-63, 2011.