Volume 7 , Issue 2 , PP: 51-62, 2024 | Cite this article as | XML | Html | PDF | Full Length Article
Manish Kumar Singla 1 * , Faris H. Rizk 2 , Mahmoud Elshabrawy Mohamed 3 , Ahmed Mohamed Zaki 4
Doi: https://doi.org/10.54216/JAIM.070205
Student performance prediction is essential so that institutions can assist in identifying weak performers and initiate corrective measures. This research assesses different regression models by applying data from Kaggle, which involves data cleaning like managing missing values and scaling of the data, hence feature extraction, then model imposition and authenticity. The models followed are Linear Regression, SVR, MLPRegressor, Gradient Boosting, Catboost, Xgboost, Random Forest, Extratrees, Decision Tree and K-neighbors. The analysis shows that Linear Regression produced the best result as it has the lowest MSE score of 0.000521 and high accuracy regarding other measures, including RMSE, MAE, and R². The results reveal that regression models can be used to predict students’ performance and be helpful to the various stakeholders in the system. The findings of this study will help develop required models for decision-making to improve students’performance.
Student performance prediction, regression models, educational data, data preprocessing, predictive analytics
[1] Raed Alamri and Basma Alharbi. Explainable student performance prediction models: A systematic review. IEEE Access, 9:33132–33143, 2021.
[2] Bashayer Albreiki, Nazar Zaki, and Hamdi Alashwal. A systematic literature review of student’ performance prediction using machine learning techniques. Education Sciences, 11(9):Article 9, 2021.
[3] Abderrahmane Asselman, Mohamed Khaldi, and Salma Aammou. Enhancing the prediction of student performance based on the machine learning xgboost algorithm. Interactive Learning Environments, 31(6):3360–3379, 2023.
[4] S,enay Aydo˘gdu. Predicting student final performance using artificial neural networks in online learning environments. Education and Information Technologies, 25(3):1913–1927, 2020.
[5] Sobia Batool, Jibran Rashid, Muhammad Waqas Nisar, Jinyoung Kim, Hyung-Yoon Kwon, and Abid Hussain. Educational data mining to predict students’ academic performance: A survey study. Education and Information Technologies, 28(1):905–971, 2023.
[6] Ghaith Ben Brahim. Predicting student performance from online engagement activities using novel statistical features. Arabian Journal for Science and Engineering, 47(8):10225–10243, 2022.
[7] Prashant Dabhade, Ruchir Agarwal, K. P. Alameen, A. T. Fathima, R. Sridharan, and G. Gopakumar. Educational data mining for predicting students’ academic performance using machine learning algorithms. Materials Today: Proceedings, 47:5260–5267, 2021.
[8] Caroline M. A. Gomes, Antonio Amantes, and Edmilson G. Jelihovschi. Applying the regression tree method to predict students’ science achievement. Trends in Psychology, 28(1):99–117, 2020.
[9] Shabbir Hussain and Muhammad Qamar Khan. Student-performulator: Predicting students’ academic performance at secondary and intermediate level using machine learning. Annals of Data Science, 10(3):637–655, 2023.
[10] Peng Jiao, Fang Ouyang, Qiang Zhang, and Amir H. Alavi. Artificial intelligence-enabled prediction model of student academic performance in online engineering education. Artificial Intelligence Review, 55(8):6321–6344, 2022.
[11] Ajmal Khan and Suman Kumar Ghosh. Student performance analysis and prediction in classroom learning: A review of educational data mining studies. Education and Information Technologies, 26(1):205– 240, 2021.
[12] Imtiaz Khan, A. R. Ahmad, Noura Jabeur, and Mohammed N. Mahdi. An artificial intelligence approach to monitor student performance and devise preventive measures. Smart Learning Environments, 8(1):17, 2021.
[13] Ahmed Nabil, Mina Seyam, and Alaa Abou-Elfetouh. Prediction of students’ academic performance based on courses’ grades using deep neural networks. IEEE Access, 9:140731–140746, 2021.
[14] Ahmad Namoun and Abdullah Alshanqiti. Predicting student performance using data mining and learning analytics techniques: A systematic literature review. Applied Sciences, 11(1):Article 1, 2021.
[15] Salma Rebai, Faycal Ben Yahia, and Hatem Essid. A graphically based machine learning approach to predict secondary schools performance in tunisia. Socio-Economic Planning Sciences, 70:100724, 2020.
[16] Marta Riestra-Gonz´alez, Marta del P. Paule-Ru´ız, and Fernando Ortin. Massive lms log data analysis for the early prediction of course-agnostic student performance. Computers & Education, 163:104108, 2021.
[17] Camilo F. Rodr´ıguez-Hern´andez, Mariel Musso, Eva Kyndt, and Eduardo Cascallar. Artificial neural networks in academic performance prediction: Systematic implementation and predictor evaluation. Computers and Education: Artificial Intelligence, 2:100018, 2021.
[18] Nikola Tomasevic, Nataˇsa Gvozdenovic, and Sanja Vranes. An overview and comparison of supervised data mining techniques for student exam performance prediction. Computers & Education, 143:103676, 2020.
[19] Huma Waheed, Sameer-Ul Hassan, Nasser R. Aljohani, Jan Hardman, Saleh Alelyani, and Rameez Nawaz. Predicting academic performance of students from vle big data using deep learning models. Computers in Human Behavior, 104:106189, 2020.
[20] Mustafa Ya˘gcı. Educational data mining: Prediction of students’ academic performance using machine learning algorithms. Smart Learning Environments, 9(1):11, 2022.
[21] Bilal Khushal Yousafzai, Muhammad Hayat, and Sadaf Afzal. Application of machine learning and data mining in predicting the performance of intermediate and secondary education level student. Education and Information Technologies, 25(6):4677–4697, 2020.
[22] Hussein Zeineddine, Udo Braendle, and Anton Farah. Enhancing prediction of student success: Automated machine learning approach. Computers & Electrical Engineering, 89:106903, 2021.