skip to main content

ANALISIS KLASIFIKASI MENGGUNAKAN REGRESI LOGISTIK BINER DAN K-NEAREST NEIGHBOR PADA DATA IMBALANCE

*Eva Fitriyani  -  Departemen Statistika, Fakultas Sains dan Matematika, Universitas Diponegoro, Indonesia
Tatik Widiharih  -  Departemen Statistika, Fakultas Sains dan Matematika, Universitas Diponegoro, Indonesia
Bagus Arya Saputra  -  Departemen Statistika, Fakultas Sains dan Matematika, Universitas Diponegoro, Indonesia
Open Access Copyright 2026 Jurnal Gaussian under http://creativecommons.org/licenses/by-nc-sa/4.0.

Citation Format:
Abstract

Savings and Loan Cooperative or (KSP) is a cooperative that conducts its business activities only saving and borrowing. KSP members come from various different backgrounds so that they can affect their behavior in carrying out their obligations. To find out the status of current or bad customer payments, a classification process is carried out. The division of KSP customer data is carried out in the classification process into two, namely training data and test data. In the classification process, there are often cases of data imbalance, so it is necessary to handle data imbalance in training data with SMOTE and ADASYN. SMOTE and ADASYN were chosen because these methods handle imbalance data by generating data from minor classes so as not to eliminate important parts of the data. Classification was performed with Binary Logistic Regression and K-Nearest Neighbor. Binary Logistic Regression is a regression where the dependent variable is binary. While K-Nearest Neighbor is a grouping method based on the closeness of the distance of a data with other data as many as k nearest neighbors. The results of this study indicate that the ADASYN Binary Logistic Regression method is the best method that can classify and predict the payment status of KSP customers because it produces the highest accuracy and G-mean, namely the accuracy value of 70.67% and G-Mean 67.63%.

Keywords: KSP; SMOTE; ADASYN; Binary Logistic Regression; K-Nearest Neighbor; Accuracy

Article Metrics:

Article Info
Section: Articles
Language : EN
  1. Agresti, A. 2007. An Introduction to Categorical Data Analysis. New York: John Wiley and Sons
  2. Bagaskoro, G., N., Fauzi, M.A., dan Adikara, P.P. 2018. Penerapan Klasifikasi Tweets pada Berita Twitter Menggunakan Metode K-Nearest Neighbor dan Query Expansion Berbasis Distributional Semantic. Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer Vol. 2, No. 10 Hal. 3849-3855
  3. Bhatia, M., Vandana., 2010. Survey of Nearest Neighbor Techniques. International Journal of Computer Science and Information Security 8, 1947-5500
  4. Chawla, N. V., Bowyer, K. W., Hall, L. O., dan Kegelmeyer, W. P. 2002. SMOTE: Synthetic Minority Over-Sampling Technique. Journal of Artificial Intelligence Research, 16,321-357
  5. Choi, J. M. 2010. A Selective Sampling Method for Imbalanced Data Learning on Support Vector Machines. Graduate Theses and Dissertations, Paper 11529
  6. Han J, Kamber M, J. P. 2011. Data Mining Concept and Techniques Third Edition
  7. He, H., Bai, Y., Garcia, E. A., dan Li, S. 2008. ADASYN: Adaptive Syntethic Sampling Approach for Imbalanced Learning. In 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence) (pp. 1322-1328). IEEE
  8. He, H., dan Garcia, E. A. 2009. Learning from Imbalanced Data, IEEE Trans. Knowl. Discov. 21(9) 1263-1284
  9. Hosmer, D. W., dan Lemeshow, S. 2000. Apllied Logistic Regression. New York: John Wiley & Sons
  10. Kubat, M., Holte, R., dan Matwin, S. 1997. Learning When Negative Examples Abound. In European conference on machine learning (pp. 146-153). Springer, Berlin, Heidelberg
  11. Prasetyo, E. 2012. Data Mining Konsep dan Aplikasi Menggunakan MATLAB. Yogyakarta: ANDI Yogyakarta
  12. Republik Indonesia. 2012. Undang-Undang Republik Indonesia Nomor 17 Tahun 2012. Tentang Perkoperasian. Pemerintah Pusat
  13. Sreemathy, J., dan Balamurugan, P. S. 2012. An Efficient Text Classification using KNN and Nai ̈ve Bayes. International Journal on Computer Science and Engineering, 4(3), 392

Last update:

No citation recorded.

Last update:

No citation recorded.