VOlUME 03 ISSUE 06 June 2024
1Lukman Nadjamuddin, 2Arwansyah Arwansyah, 3Sukmawati Sukmawati, 4Windayanti
1,4Department of History Education, Faculty of Teacher Training and Education, Tadulako University, Palu, Indonesia.
2Department of Chemistry Education, Faculty of Teacher Training and Education, Tadulako University, Palu, Indonesia.
3Department of Civic Education, Faculty of Teacher Training and Education, Tadulako University, Palu, Indonesia.
DOI : https://doi.org/10.58806/ijsshmr.2024.v3i6n13Google Scholar Download Pdf
ABSTRACT
In this study, data mining was implemented to find the common variables such as “gender”, “age”, “GPA first semester”, “GPA second semester”, “organization activity”, “part time job”, “living place”, “family income”, “father education”, “mother education” influencing the grade point of average (GPA) score of the 3nd-semester student at the Department of History Education. Three methods, including logistic regression (LR), decision tree (DT), and support vector machine (SVM) were employed. According to the validation results, the best algorithm method is found in the model developed by decision tree with the Accuracy 0.96 and all models provide sufficient data since the AUC value for all classes is greater than 0.5. This finding proved that above variables are linked to student achievement. As a result, concern to those aspects is critical for improving academic performance.
REFERENCES
1) Abonazel, M. R., & Ibrahim, M. G. (2018). On estimation methods for binary logistic regression model with missing values. International Journal of Mathematics and Computational Science, 4(3), 79–85.
2) Browning, M. H. E. M., & Rigolon, A. (2019). School green space and its impact on academic performance: A systematic literature review. International Journal of Environmental Research and Public Health, 16(3), 429.
3) Cervantes, J., Garcia-Lamont, F., Rodríguez-Mazahua, L., & Lopez, A. (2020). A comprehensive survey on support vector machine classification: Applications, challenges and trends. Neurocomputing, 408, 189–215.
4) Charbuty, B., & Abdulazeez, A. (2021). Classification based on decision tree algorithm for machine learning. Journal of Applied Science and Technology Trends, 2(01), 20–28.
5) Cios, K. J., Pedrycz, W., & Swiniarski, R. W. (1998). Data mining and knowledge discovery. In Data mining methods for knowledge discovery (pp. 1–26). Springer.
6) Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273–297.
https://doi.org/10.1007/BF00994018
7) Denœux, T. (2019). Logistic regression, neural networks and Dempster–Shafer theory: A new perspective. Knowledge-Based Systems, 176, 54–67.
8) Fayyad, U., Piatetsky-Shapiro, G., & Smyth, P. (1996). From data mining to knowledge discovery in databases. AI Magazine, 17(3), 37.
9) Hasan, R., Palaniappan, S., Raziff, A. R. A., Mahmood, S., & Sarker, K. U. (2018). Student academic performance prediction by using decision tree algorithm. 2018 4th International Conference on Computer and Information Sciences (ICCOINS), 1–5.
10) Ho, D. S. W., Schierding, W., Wake, M., Saffery, R., & O’Sullivan, J. (2019). Machine learning SNP based prediction for precision medicine. Frontiers in Genetics, 10, 267.
11) Jamelske, E. (2009). Measuring the impact of a university first-year experience program on student GPA and retention. Higher Education, 57, 373–391.
12) Kim, H. J., Hong, A. J., & Song, H.-D. (2019). The roles of academic engagement and digital readiness in students’ achievements in university e-learning environments. International Journal of Educational Technology in Higher Education, 16(1), 1–18.
13) Kurniawan, I., Rosalinda, M., & Ikhsan, N. (2020). Implementation of ensemble methods on QSAR Study of NS3 inhibitor activity as anti-dengue agent. SAR and QSAR in Environmental Research, 31(6), 477–492.
14) Luque, A., Carrasco, A., Martín, A., & de Las Heras, A. (2019). The impact of class imbalance in classification performance metrics based on the binary confusion matrix. Pattern Recognition, 91, 216–231.
15) Muschelli, J. (2020). ROC and AUC with a binary predictor: a potentially misleading metric. Journal of Classification, 37(3), 696–708.
16) Palacios, C. A., Reyes-Suárez, J. A., Bearzotti, L. A., Leiva, V., & Marchant, C. (2021). Knowledge discovery for higher education student retention based on data mining: Machine learning algorithms and case study in Chile. Entropy, 23(4), 485.
17) Patel, H. H., & Prajapati, P. (2018). Study and analysis of decision tree based classification algorithms. International Journal of Computer Sciences and Engineering, 6(10), 74–78.
18) Pisner, D. A., & Schnyer, D. M. (2020). Support vector machine. In Machine learning (pp. 101– 121). Elsevier.
19) Rastrollo-Guerrero, J. L., Gómez-Pulido, J. A., & Durán-Domínguez, A. (2020). Analyzing and predicting students’ performance by means of machine learning: A review. Applied Sciences, 10(3), 1042.
20) Ray, S. (2019). A quick review of machine learning algorithms. 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), 35–39.
21) Schratz, P., Muenchow, J., Iturritxa, E., Richter, J., & Brenning, A. (2019). Hyperparameter tuning and performance assessment of statistical and machine-learning algorithms using spatial data. Ecological Modelling, 406, 109–120.
22) Sekeroglu, B., Dimililer, K., & Tuncal, K. (2019). Student performance prediction and classification using machine learning algorithms. Proceedings of the 2019 8th International Conference on Educational and Information Technology, 7–11.
23) Shah, K., Patel, H., Sanghvi, D., & Shah, M. (2020). A comparative analysis of logistic regression, random forest and KNN models for the text classification. Augmented Human Research, 5(1), 1–16.
24) Tus, J. (2020). Academic stress, academic motivation, and its relationship on the academic performance of the senior high school students. Asian Journal of Multidisciplinary Studies, 8(11), 29–37.
25) Wu, H., & Shen, J. (2022). The association between principal leadership and student achievement: A multivariate meta-meta-analysis. Educational Research Review, 35, 100423.
26) Yang, L., & Shami, A. (2020). On hyperparameter optimization of machine learning algorithms: Theory and practice. Neurocomputing, 415, 295–316.
27) Zeineddine, H., Braendle, U., & Farah, A. (2021). Enhancing prediction of student success: Automated machine learning approach. Computers & Electrical Engineering, 89, 106903.