Mining student information system records to predict students’ academic performance

Educational Data Mining (EDM) is an emerging field that is concerned with mining and exploring the useful patterns in educational data. The main objective of this study is to predict the students’ academic performance based on a new dataset extracted from a student information system. The dataset wa...

Full description

Bibliographic Details
Main Authors: Saa, Amjad Abu, Al-Emran, Mostafa, Shaalan, Khaled
Format: Conference or Workshop Item
Language:English
Published: Springer Verlag 2020
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/24977/
http://umpir.ump.edu.my/id/eprint/24977/
http://umpir.ump.edu.my/id/eprint/24977/1/Mining%20student%20information%20system%20records%20to%20predict%20students%E2%80%99.pdf
Description
Summary:Educational Data Mining (EDM) is an emerging field that is concerned with mining and exploring the useful patterns in educational data. The main objective of this study is to predict the students’ academic performance based on a new dataset extracted from a student information system. The dataset was extracted from a private university in the United Arab of Emirates (UAE). The dataset includes 34 attributes and 56,000 records related to students’ information. The empirical results indicated that the Random Forest (RF) algorithm was the most appropriate data mining technique used to predict the students’ academic performance. It is also revealed that the most important attributes that have a direct effect on the students’ academic performance are belonged to four main categories, namely students’ demographics, student previous performance information, course and instructor information, and student general information. The evidence from this study would assist the higher educational institutions by allowing the instructors and students to identify the weaknesses and factors affecting the students’ performance, and act as an early warning system for predicting the students’ failures and low academic performance.