• Abstract

    The literature on the role of machine learning in forecasting student performance in education lacks comprehensive data quality, equitable and interpretable models, consideration of contextual and causal factors, and the integration of human expertise. This review will explore machine learning types, algorithms, predictive performance, and impact on student performance in education. A systematic literature review based on articles published in the past 2019–2023 period Methods: The IEEE Xplore database was searched by using keywords such as "Educational Data Mining," "Student Performance Prediction," "Evaluations of Students," "Performance Analysis of Students," and "Learning Curve Prediction." were employed and 50 papers were selected. Results: The analysis of the results highlighted prominent patterns. Half of the studies favored supervised learning methods, with decision trees leading (19 instances), followed by Long Short-Term Memory and Random Forest (16 each), and K-Nearest Neighbor and Naive Bayes (12 and 11 times). Support Vector Machine and Logistic Regression were noted 10 and 9 times, respectively. Noteworthy were ANN, CNN, and Xgboost. Positive impacts were evident in 36 cases; only one showed negative effects, while 13 indicated intricate relationships. This study helped in understanding the prevalent machine learning methods used for predicting student performance, provides a benchmark for assessing the effectiveness of new or alternative techniques. Conclusion: This review highlights the varied machine learning uses in predicting student performance in education, emphasizing supervised methods, diverse algorithms, and complex intervention impacts.

  • References

    1. Adnan, M., Habib, A., Ashraf, J., Mussadiq, S., Raza, A. A., Abid, M., Bashir, M., & Khan, S. U. (2021). Predicting at-Risk Students at Different Percentages of Course Length for Early Intervention Using Machine Learning Models. IEEE Access, 9, 7519–7539. https://doi.org/10.1109/access.2021.3049446
    2. Adnan, M., Uddin, M. I., Khan, E., Alharithi, F. S., Amin, S., & Alzahrani, A. A. (2022). Earliest Possible Global and Local Interpretation of Students’ Performance in Virtual Learning Environment by Leveraging Explainable AI. IEEE Access, 10, 129843–129864. https://doi.org/10.1109/access.2022.3227072
    3. Alalawi, K., Athauda, R., & Chiong, R. (2023). Contextualizing the current state of research on the use of machine learning for student performance prediction: A systematic literature review. Engineering Reports, e12699. https://doi.org/10.1002/ENG2.12699
    4. Alamri, R., & Alharbi, B. (2021). Explainable Student Performance Prediction Models: A Systematic Review. IEEE Access, 9, 33132–33143. https://doi.org/10.1109/access.2021.3061368
    5. Albreiki, B., Zaki, N., & Alashwal, H. (2021). A Systematic Literature Review of Student’ Performance Prediction Using Machine Learning Techniques. Education Sciences 2021, Vol. 11, Page 552, 11(9), 552. https://doi.org/10.3390/EDUCSCI11090552
    6. Alhazmi, E., & Sheneamer, A. (2023). Early Predicting of Students Performance in Higher Education. IEEE Access, 11, 27579–27589. https://doi.org/10.1109/access.2023.3250702
    7. Aljaloud, A. S., Uliyan, D. M., Alkhalil, A., Elrhman, M. A., Alogali, A. F. M., Altameemi, Y. M., Altamimi, M., & Kwan, P. (2022). A Deep Learning Model to Predict Student Learning Outcomes in LMS Using CNN and LSTM. IEEE Access, 10, 85255–85265. https://doi.org/10.1109/access.2022.3196784
    8. Alshanqiti, A., & Namoun, A. (2020). Predicting Student Performance and Its Influential Factors Using Hybrid Regression and Multi-Label Classification. IEEE Access, 8, 203827–203844. https://doi.org/10.1109/access.2020.3036572
    9. Ara Shaikh, A., Kumar, A., Jani, K., Mitra, S., García-Tadeo, D. A., & Devarajan, A. (2022). The Role of Machine Learning and Artificial Intelligence for making a Digital Classroom and its sustainable Impact on Education during Covid-19. Materials Today: Proceedings, 56, 3211–3215. https://doi.org/10.1016/J.MATPR.2021.09.368
    10. Asthana, P., Mishra, S., Gupta, N., Derawi, M., & Kumar, A. (2023). Prediction of Student’s Performance With Learning Coefficients Using Regression Based Machine Learning Models. IEEE Access, 11. https://doi.org/10.1109/access.2023.3294700
    11. Bujang, S. D. A., Selamat, A., Ibrahim, R., Krejcar, O., Herrera-Viedma, E., Fujita, H., & Ghani, N. A. M. (2021). Multiclass Prediction Model for Student Grade Prediction Using Machine Learning. IEEE Access, 9, 95608–95621. https://doi.org/10.1109/access.2021.3093563
    12. Chan, R. Y.-Y., Wong, C. M. V., & Yum, Y. N. (2023). Predicting Behavior Change in Students With Special Education Needs Using Multimodal Learning Analytics. IEEE Access, 11, 63238–63251. https://doi.org/10.1109/access.2023.3288695
    13. Chui, K. T., Liu, R. W., Zhao, M., & Ordóñez de Pablos, P. (2020). Predicting Students’ Performance With School and Family Tutoring Using Generative Adversarial Network-Based Deep Support Vector Machine. IEEE Access, 8, 86745–86752. https://doi.org/10.1109/access.2020.2992869
    14. Feng, G., Fan, M., & Chen, Y. (2022). Analysis and Prediction of Students’ Academic Performance Based on Educational Data Mining. IEEE Access, 10, 19558–19571. https://doi.org/10.1109/access.2022.3151652
    15. Gao, L., Zhao, Z., Qi, L., Liang, Y., & Du, J. (2019). Modeling the Effort and Learning Ability of Students in MOOCs. IEEE Access, 7, 128035–128042. https://doi.org/10.1109/access.2019.2937985
    16. Ghorbani, R., & Ghousi, R. (2020). Comparing Different Resampling Methods in Predicting Students’ Performance Using Machine Learning Techniques. IEEE Access, 8, 67899–67911. https://doi.org/10.1109/access.2020.2986809
    17. Haleem, A., Javaid, M., Qadri, M. A., & Suman, R. (2022). Understanding the role of digital technologies in education: A review. Sustainable Operations and Computers, 3, 275–285. https://doi.org/10.1016/J.SUSOC.2022.05.004
    18. Hassan, Y. M. I., Elkorany, A., & Wassif, K. (2022). Utilizing Social Clustering-Based Regression Model for Predicting Student’s GPA. IEEE Access, 10, 48948–48963. https://doi.org/10.1109/access.2022.3172438
    19. Hilbert, S., Coors, S., Kraus, E., Bischl, B., Lindl, A., Frei, M., Wild, J., Krauss, S., Goretzko, D., & Stachl, C. (2021). Machine learning for the educational sciences. Review of Education, 9(3), e3310. https://doi.org/10.1002/REV3.3310
    20. Hussain, W., Spady, W. G., Khan, S. Z., Khawaja, B. A., Naqash, T., & Conner, L. (2021). Impact Evaluations of Engineering Programs Using Abet Student Outcomes. IEEE Access, 9, 46166–46190. https://doi.org/10.1109/access.2021.3066921
    21. Issah, I., Appiah, O., Appiahene, P., & Inusah, F. (2023). A systematic review of the literature on machine learning application of determining the attributes influencing academic performance. Decision Analytics Journal, 7, 100204. https://doi.org/10.1016/J.DAJOUR.2023.100204
    22. Kastrati, Z., Imran, A. S., & Kurti, A. (2020). Weakly Supervised Framework for Aspect-Based Sentiment Analysis on Students’ Reviews of MOOCs. IEEE Access, 8, 106799–106810. https://doi.org/10.1109/access.2020.3000739
    23. Kim, J., Lee, H., & Cho, Y. H. (2022). Learning design to support student-AI collaboration: perspectives of leading teachers for AI in education. Education and Information Technologies, 27(5), 6069–6104. https://doi.org/10.1007/S10639-021-10831-6/TABLES/1
    24. Kusumawardani, S. S., & Alfarozi, S. A. I. (2023). Transformer Encoder Model for Sequential Prediction of Student Performance Based on Their Log Activities. IEEE Access, 11, 18960–18971. https://doi.org/10.1109/access.2023.3246122
    25. Latif, G., Abdelhamid, S. E., Fawagreh, K. S., Brahim, G. Ben, & Alghazo, R. (2023). Machine Learning in Higher Education: Students’ Performance Assessment Considering Online Activity Logs. IEEE Access, 11. https://doi.org/10.1109/access.2023.3287972
    26. Liu, D., Zhang, Y., Zhang, J., Li, Q., Zhang, C., & Yin, Y. (2020). Multiple Features Fusion Attention Mechanism Enhanced Deep Knowledge Tracing for Student Performance Prediction. IEEE Access, 8, 194894–194903. https://doi.org/10.1109/access.2020.3033200
    27. Mengash, H. A. (2020). Using Data Mining Techniques to Predict Student Performance to Support Decision Making in University Admission Systems. IEEE Access, 8, 55462–55470. https://doi.org/10.1109/access.2020.2981905
    28. Motz, B. A., Mallon, M. G., & Quick, J. D. (2021). Automated Educative Nudges to Reduce Missed Assignments in College. IEEE Transactions on Learning Technologies, 14(2), 189–200. https://doi.org/10.1109/tlt.2021.3064613
    29. Munshi, A. A., & Alhindi, A. (2021). Big Data Platform for Educational Analytics. IEEE Access, 9, 52883–52890. https://doi.org/10.1109/access.2021.3070737
    30. Nabil, A., Seyam, M., & Abou-Elfetouh, A. (2021). Prediction of Students’ Academic Performance Based on Courses’ Grades Using Deep Neural Networks. IEEE Access, 9, 140731–140746. https://doi.org/10.1109/access.2021.3119596
    31. Nguyen-Huy, T., Deo, R. C., Khan, S., Devi, A., Adeyinka, A. A., Apan, A. A., & Yaseen, Z. M. (2022). Student Performance Predictions for Advanced Engineering Mathematics Course With New Multivariate Copula Models. IEEE Access, 10, 45112–45136. https://doi.org/10.1109/access.2022.3168322
    32. Okoye, K., Daruich, S. D. N., De La O, J. F. E., Castano, R., Escamilla, J., & Hosseini, S. (2023). A Text Mining and Statistical Approach for Assessment of Pedagogical Impact of Students’ Evaluation of Teaching and Learning Outcome in Education. IEEE Access, 11, 9577–9596. https://doi.org/10.1109/access.2023.3239779
    33. Orlando, S., Gaudioso, E., & De La Paz, F. (2020). Supporting Teachers to Monitor Student’s Learning Progress in an Educational Environment With Robotics Activities. IEEE Access, 8, 48620–48631. https://doi.org/10.1109/access.2020.2978979
    34. Pek, R. Z., Ozyer, S. T., Elhage, T., Ozyer, T., & Alhajj, R. (2023). The Role of Machine Learning in Identifying Students At-Risk and Minimizing Failure. IEEE Access, 11, 1224–1243. https://doi.org/10.1109/access.2022.3232984
    35. Pinto, A., Abreu, A., Costa, E., Information, J. P.-J. of, & 2023, undefined. (2023). How Machine Learning (ML) is Transforming Higher Education: A Systematic Literature Review. Comum.Rcaap.PtAS Pinto, A Abreu, E Costa, J PaivaJournal of Information Systems Engineering and Management, 2023•comum.Rcaap.Pt, 2023(2), 21168. https://doi.org/10.55267/iadt.07.13227
    36. Prabowo, H., Hidayat, A. A., Cenggoro, T. W., Rahutomo, R., Purwandari, K., & Pardamean, B. (2021). Aggregating Time Series and Tabular Data in Deep Learning Model for University Students’ GPA Prediction. IEEE Access, 9, 87370–87377. https://doi.org/10.1109/access.2021.3088152
    37. Rafique, A., Khan, M. S., Jamal, M. H., Tasadduq, M., Rustam, F., Lee, E., Washington, P. B., & Ashraf, I. (2021). Integrating Learning Analytics and Collaborative Learning for Improving Student’s Academic Performance. IEEE Access, 9, 167812–167826. https://doi.org/10.1109/access.2021.3135309
    38. Rahman, M. M., Watanobe, Y., Matsumoto, T., Kiran, R. U., & Nakamura, K. (2022). Educational Data Mining to Support Programming Learning Using Problem-Solving Data. IEEE Access, 10, 26186–26202. https://doi.org/10.1109/access.2022.3157288
    39. Raza, M., & Faria, J. P. (2020). The ProcessPAIR Method for Automated Software Process Performance Analysis. IEEE Access, 8, 141569–141583. https://doi.org/10.1109/access.2020.3013328
    40. Ros, S., Gonzalez, S., Robles, A., Tobarra, L. L., Caminero, A., & Cano, J. (2020). Analyzing Students’ Self-Perception of Success and Learning Effectiveness Using Gamification in an Online Cybersecurity Course. IEEE Access, 8, 97718–97728. https://doi.org/10.1109/access.2020.2996361
    41. Sahlaoui, H., Alaoui, E. A. A., Nayyar, A., Agoujil, S., & Jaber, M. M. (2021). Predicting and Interpreting Student Performance Using Ensemble Models and Shapley Additive Explanations. IEEE Access, 9, 152688–152703. https://doi.org/10.1109/access.2021.3124270
    42. Samin, H., & Azim, T. (2019). Knowledge Based Recommender System for Academia Using Machine Learning: A Case Study on Higher Education Landscape of Pakistan. IEEE Access, 7, 67081–67093. https://doi.org/10.1109/access.2019.2912012
    43. Sanusi, I. T., Oyelere, S. S., Vartiainen, H., Suhonen, J., & Tukiainen, M. (2023). A systematic review of teaching and learning machine learning in K-12 education. Education and Information Technologies, 28(5), 5967–5997. https://doi.org/10.1007/S10639-022-11416-7/TABLES/5
    44. Schiff, D. (2021). Out of the laboratory and into the classroom: the future of artificial intelligence in education. Ai & Society, 36(1), 331. https://doi.org/10.1007/S00146-020-01033-8
    45. Sekeroglu, B., Abiyev, R., Ilhan, A., Arslan, M., & Idoko, J. B. (2021). Systematic Literature Review on Machine Learning and Student Performance Prediction: Critical Gaps and Possible Remedies. Applied Sciences 2021, Vol. 11, Page 10907, 11(22), 10907. https://doi.org/10.3390/APP112210907
    46. Setiawan, R., Arif, F. A. S., Putro, J. O., Princes, E., Silalahi, F. T. R., Geraldina, I., Julianti, E., & Safitri, J. (2023). E-Learning Pricing Model Policy for Higher Education. IEEE Access, 11, 38370–38384. https://doi.org/10.1109/access.2023.3266954
    47. Shao, Z., Sun, H., Wang, X., & Sun, Z. (2020). An Optimized Mining Algorithm for Analyzing Students’ Learning Degree Based on Dynamic Data. IEEE Access, 8, 113543–113556. https://doi.org/10.1109/access.2020.3001749
    48. Shi, Y., Peng, F., & Sun, F. (2022). A Blended Learning Model Based on Smart Learning Environment to Improve College Students’ Information Literacy. IEEE Access, 10, 89485–89498. https://doi.org/10.1109/access.2022.3201105
    49. Shi, Y., Sun, F., Zuo, H., & Peng, F. (2023). Analysis of Learning Behavior Characteristics and Prediction of Learning Effect for Improving College Students’ Information Literacy Based on Machine Learning. IEEE Access, 11, 50447–50461. https://doi.org/10.1109/access.2023.3278370
    50. Sindhu, I., Muhammad Daudpota, S., Badar, K., Bakhtyar, M., Baber, J., & Nurunnabi, M. (2019). Aspect-Based Opinion Mining on Student’s Feedback for Faculty Teaching Performance Evaluation. IEEE Access, 7, 108729–108741. https://doi.org/10.1109/access.2019.2928872
    51. Singelmann, L. N., & Ewert, D. L. (2022). Leveraging the Innovation-Based Learning Framework to Predict and Understand Student Success in Innovation. IEEE Access, 10, 36123–36139. https://doi.org/10.1109/access.2022.3163744
    52. Sun, Q., Wu, J., & Liu, K. (2020). Toward Understanding Students’ Learning Performance in an Object-Oriented Programming Course: The Perspective of Program Quality. IEEE Access, 8, 37505–37517. https://doi.org/10.1109/access.2020.2973470
    53. Tao, X., Shannon-Honson, A., Delaney, P., Li, L., Dann, C., Li, Y., & Xie, H. (2022). Data Analytics on Online Student Engagement Data for Academic Performance Modeling. IEEE Access, 10, 103176–103186. https://doi.org/10.1109/access.2022.3208953
    54. Webb, M. E., Fluck, A., Magenheim, J., Malyn-Smith, J., Waters, J., Deschênes, M., & Zagami, J. (2021). Machine learning for human learners: opportunities, issues, tensions and threats. Educational Technology Research and Development, 69(4), 2109–2130. https://doi.org/10.1007/S11423-020-09858-2/TABLES/1
    55. Wei, Y., Lv, H., Chen, M., Wang, M., Heidari, A. A., Chen, H., & Li, C. (2020). Predicting Entrepreneurial Intention of Students: An Extreme Learning Machine With Gaussian Barebone Harris Hawks Optimizer. IEEE Access, 8, 76841–76855. https://doi.org/10.1109/access.2020.2982796
    56. Wu, Y. C., & Liao, W. H. (2020). Analysis of Learning Patterns and Performance—A Case Study of 3-D Modeling Lessons in the K-12 Classrooms. IEEE Access, 8, 186976–186992. https://doi.org/10.1109/access.2020.3029947
    57. Yanes, N., Mostafa, A. M., Ezz, M., & Almuayqil, S. N. (2020). A Machine Learning-Based Recommender System for Improving Students Learning Experiences. IEEE Access, 8, 201218–201235. https://doi.org/10.1109/access.2020.3036336
    58. Yi, A. C. Y., Ying, T. K., Yee, S. J., Chin, W. M., & Tin, T. T. (2022). InPath Forum: A Real-Time Learning Analytics and Performance Ranking Forum System. IEEE Access, 10, 128536–128542. https://doi.org/10.1109/access.2022.3227430
    59. Yoo, J. E., Rho, M., & Lee, Y. (2022). Online Students’ Learning Behaviors and Academic Success: An Analysis of LMS Log Data From Flipped Classrooms via Regularization. IEEE Access, 10, 10740–10753. https://doi.org/10.1109/access.2022.3144625
    60. Yue, J., Tian, F., Chao, K. M., Shah, N., Li, L., Chen, Y., & Zheng, Q. (2019). Learning Unit State Recognition Based on Multi-channel Data Fusion. IEEE Access, 7, 149554–149567. https://doi.org/10.1109/access.2019.2947091
    61. Zhao, L., Chen, K., Song, J., Zhu, X., Sun, J., Caulfield, B., & Namee, B. Mac. (2021). Academic Performance Prediction Based on Multisource, Multifeature Behavioral Data. IEEE Access, 9, 5453–5465. https://doi.org/10.1109/access.2020.3002791
    62. Zhen, Y., Luo, J. Der, & Chen, H. (2023). Prediction of Academic Performance of Students in Online Live Classroom Interactions—An Analysis Using Natural Language Processing and Deep Learning Methods. Journal of Social Computing, 4(1), 12–29. https://doi.org/10.23919/jsc.2023.0007

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Copyright (c) 2023 Malque Publishing

How to cite

Pandian, B. R., Aziz, A. A., Subramaniam, H., & Nawi, H. S. A. (2024). Exploring the role of machine learning in forecasting student performance in education: An in-depth review of literature. Multidisciplinary Reviews, 6, 2023ss043. https://doi.org/10.31893/multirev.2023ss043
  • Article viewed - 436
  • PDF downloaded - 265