Educational data mining is widely deployed to extract valuable information and patterns from academic data. This research explores new features that can help predict the future performance of undergraduate students and identify at-risk students early on. It answers some crucial and intuitive questions that are not addressed by previous studies. Most of the existing research is conducted on data from 2-3 years in an absolute grading scheme. We examined the effects of historical academic data of 15 years on predictive modeling. Additionally, we explore the performance of undergraduate students in a relative grading scheme and examine the effects of grades in core courses and initial semesters on future performances. As a pilot study, we analyzed the academic performance of Computer Science university students. Many exciting discoveries were made; the duration and size of the historical data play a significant role in predicting future performance, mainly due to changes in curriculum, faculty, society, and evolving trends. Furthermore, predicting grades in advanced courses based on initial pre-requisite courses is challenging in a relative grading scheme, as students’ performance depends not only on their efforts but also on their peers. In short, educational data mining can come to the rescue by uncovering valuable insights from academic data to predict future performance and identify the critical areas that need significant improvement.
Distance Learning has enabled educational practices based on digital platforms, generating massive amounts of data. Several initiatives use this data to identify dropout contexts, mainly providing teacher support about student behavior. Approaches such as Active Methodologies are known as having good potential to involve and motivate students. This article presents a systematic mapping aiming to identify current Educational Data Mining and Learning Analytics methods. Besides, we identify Active Methodologies’ application to mitigate dropout in Distance Learning. We evaluated 668 papers published from January 2015 to March 2020. The results indicate a growing application of Educational Data Mining and Learning Analytics to identify and mitigate students’ abandonment in Distance Learning. However, studies with Active Methodologies to minimize dropout and enhance student permanence are scarce. Some works suggest Active Methods as a possible complement of Learning Analytics in dropout.
The paper aims to present application of Educational Data Mining and particularly Case-Based Reasoning (CBR) for students profiling and further to design a personalised intelligent learning system. The main aim here is to develop a recommender system which should help the learners to create learning units (scenarios) that are the most suitable for them. First of all, systematic literature review on application of CBR and its possible implementation to personalise learning was performed in the paper. After that, methodology on CBR application to personalise learning is presented where learning styles play a dominate role as key factor in proposed personalised intelligent learning system model based on students profiling and personalised learning process model. The algorithm (the sequence of steps) to implement this model is also presented in the paper.
This paper considers the use of log data provided by learning management systems when studying whether students obey the problem-based learning (PBL) method. Log analysis turns out to be a valuable tool in measuring the use of the learning material of interest. It gives reliable figures concerning not only the number of use sessions but also the interlocking of various course activities. The longitudinal study based on log analysis makes use of a new software tool, SPY US. Our study concentrates on using log data analysis in improving the PBL method used in learning diagnostic skills with the help of Virtual Patients.
Student evaluations to measure the teaching effectiveness of instructor's are very frequently applied in higher education for many years. This study investigates the factors associated with the assessment of instructors teaching performance using two different data mining techniques; stepwise regression and decision trees. The data collected anonymously from students' evaluations of Management Information Systems department at Bogazici University. Additionally, variables related to other instructor and course characteristics are also included in the study. The results show that, a factor summarizing the instructor related questions in the evaluation form, the employment status of the instructor, the workload of the course, the attendance of the students, and the percentage of the students filling the form are significant dimensions of instructor's teaching performance.