BibTex Citation Data :
@article{J.Gauss40168, author = {Hagi Afdal Fatan and Tatik Widiharih and Sudarno Sudarno}, title = {KLASIFIKASI KUALITAS KOPI ARABIKA DENGAN METODE RANDOM FOREST DAN K-NEAREST NEIGHBOR PADA IMBALANCED DATASET}, journal = {Jurnal Gaussian}, volume = {14}, number = {1}, year = {2025}, keywords = {Classification, Arabica Coffee, SMOTE, K-Nearest Neighbor, Random Forest}, abstract = { Coffee is a superior plantation commodity in the export sector with high economic value. Coffee quality is the most important factor affecting the selling price, so coffee quality assessment is the main key in setting market prices and determining the export potential of coffee-producing countries. Coffee quality is divided into specialty, premium and regular based on bean defects and taste test values. Coffee quality prediction is needed to find out which coffee has the best quality. This study compares the Random Forest and K-Nearest Neighbor (KNN) methods to find out which algorithm is most effective in predicting coffee quality. The working principle of Random Forest is to build more than one decision tree and then determine the estimated value based on majority voting. KNN classifies data based on the distance between the data and other data. The coffee dataset used is sourced from the Coffee Quality Institute (CQI) Database. The data has problems to match resulting in a small recall value in the minority class, the SMOTE oversampling algorithm is used to improve classification performance. The advantage of oversampling compared to undersampling is that it does not lose data information. The results showed that the Random Forest method after SMOTE produced the best classification performance with accuracy and memory values of 80.26% and 80.59%, respectively. }, issn = {2339-2541}, pages = {107--117} doi = {10.14710/j.gauss.14.1.107-117}, url = {https://ejournal3.undip.ac.id/index.php/gaussian/article/view/40168} }
Refworks Citation Data :
Coffee is a superior plantation commodity in the export sector with high economic value. Coffee quality is the most important factor affecting the selling price, so coffee quality assessment is the main key in setting market prices and determining the export potential of coffee-producing countries. Coffee quality is divided into specialty, premium and regular based on bean defects and taste test values. Coffee quality prediction is needed to find out which coffee has the best quality. This study compares the Random Forest and K-Nearest Neighbor (KNN) methods to find out which algorithm is most effective in predicting coffee quality. The working principle of Random Forest is to build more than one decision tree and then determine the estimated value based on majority voting. KNN classifies data based on the distance between the data and other data. The coffee dataset used is sourced from the Coffee Quality Institute (CQI) Database. The data has problems to match resulting in a small recall value in the minority class, the SMOTE oversampling algorithm is used to improve classification performance. The advantage of oversampling compared to undersampling is that it does not lose data information. The results showed that the Random Forest method after SMOTE produced the best classification performance with accuracy and memory values of 80.26% and 80.59%, respectively.
Note: This article has supplementary file(s).
Article Metrics:
Last update:
The Authors submitting a manuscript do so on the understanding that if accepted for publication, copyright of the article shall be assigned to Media Statistika journal and Department of Statistics, Universitas Diponegoro as the publisher of the journal. Copyright encompasses the rights to reproduce and deliver the article in all form and media, including reprints, photographs, microfilms, and any other similar reproductions, as well as translations.
Jurnal Gaussian and Department of Statistics, Universitas Diponegoro and the Editors make every effort to ensure that no wrong or misleading data, opinions or statements be published in the journal. In any way, the contents of the articles and advertisements published in Jurnal Gaussian journal are the sole and exclusive responsibility of their respective authors and advertisers.
The Copyright Transfer Form can be downloaded here: [Copyright Transfer Form Jurnal Gaussian]. The copyright form should be signed originally and send to the Editorial Office in the form of original mail, scanned document or fax :
Dr. Rukun Santoso (Editor-in-Chief) Editorial Office of Jurnal GaussianDepartment of Statistics, Universitas DiponegoroJl. Prof. Soedarto, Kampus Undip Tembalang, Semarang, Central Java, Indonesia 50275Telp./Fax: +62-24-7474754Email: jurnalgaussian@gmail.com
Jurnal Gaussian by Departemen Statistika Undip is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Visitor Number:
View statistics