BibTex Citation Data :
@article{J.Gauss28915, author = {Muhamad Syukron and Rukun Santoso and Tatik Widiharih}, title = {PERBANDINGAN METODE SMOTE RANDOM FOREST DAN SMOTE XGBOOST UNTUK KLASIFIKASI TINGKAT PENYAKIT HEPATITIS C PADA IMBALANCE CLASS DATA}, journal = {Jurnal Gaussian}, volume = {9}, number = {3}, year = {2020}, keywords = {Fibrosis, Cirrhosis, Random Forest, SMOTE, XGboost}, abstract = { Hepatitis causes around 1.4 million people die every year. This number makes hepatitis to be the largest contagious disease in the number of deaths after tuberculosis. Liver biopsy is still the best method for diagnosing the stage of hepatitis C, but this method is an invasive, painful, expensive, and can cause complications. Non-invasively method needs to be developed, one of non-invasif method is machine learning. Random Forest and XGboost are classification methods that are often used, since they have many advantages over classical classification methods. The SMOTE algorithm can be used to improve the accuracy of predictions from imbalanced data. the data in this study have 24 independent variables in the form of patients self-data, hepatitis C symptoms, and laboratory test results. The dependent variable in this study is a binary category, namely the level of hepatitis C disease (fibrosis and cirrhosis). The results showed that the random forest and XGboost had an accuracy of around 74% but the recall value was less than 2%. SMOTE random forest dan SMOTE XGboost have an accuracy & recall value more than 75%. SMOTE random forest has a higher accuracy for predicting fibrosis class while SMOTE XGboost is better in cirrhosis class. Variables that are more influental in determining hepatitis C stage are variables from laboratory test. Keyword : Fibrosis, Cirrhosis, Random Forest, SMOTE, XGboost }, issn = {2339-2541}, pages = {227--236} doi = {10.14710/j.gauss.9.3.227-236}, url = {https://ejournal3.undip.ac.id/index.php/gaussian/article/view/28915} }
Refworks Citation Data :
Hepatitis causes around 1.4 million people die every year. This number makes hepatitis to be the largest contagious disease in the number of deaths after tuberculosis. Liver biopsy is still the best method for diagnosing the stage of hepatitis C, but this method is an invasive, painful, expensive, and can cause complications. Non-invasively method needs to be developed, one of non-invasif method is machine learning. Random Forest and XGboost are classification methods that are often used, since they have many advantages over classical classification methods. The SMOTE algorithm can be used to improve the accuracy of predictions from imbalanced data. the data in this study have 24 independent variables in the form of patients self-data, hepatitis C symptoms, and laboratory test results. The dependent variable in this study is a binary category, namely the level of hepatitis C disease (fibrosis and cirrhosis). The results showed that the random forest and XGboost had an accuracy of around 74% but the recall value was less than 2%. SMOTE random forest dan SMOTE XGboost have an accuracy & recall value more than 75%. SMOTE random forest has a higher accuracy for predicting fibrosis class while SMOTE XGboost is better in cirrhosis class. Variables that are more influental in determining hepatitis C stage are variables from laboratory test.
Keyword : Fibrosis, Cirrhosis, Random Forest, SMOTE, XGboost
Article Metrics:
Last update:
The Authors submitting a manuscript do so on the understanding that if accepted for publication, copyright of the article shall be assigned to Media Statistika journal and Department of Statistics, Universitas Diponegoro as the publisher of the journal. Copyright encompasses the rights to reproduce and deliver the article in all form and media, including reprints, photographs, microfilms, and any other similar reproductions, as well as translations.
Jurnal Gaussian and Department of Statistics, Universitas Diponegoro and the Editors make every effort to ensure that no wrong or misleading data, opinions or statements be published in the journal. In any way, the contents of the articles and advertisements published in Jurnal Gaussian journal are the sole and exclusive responsibility of their respective authors and advertisers.
The Copyright Transfer Form can be downloaded here: [Copyright Transfer Form Jurnal Gaussian]. The copyright form should be signed originally and send to the Editorial Office in the form of original mail, scanned document or fax :
Dr. Rukun Santoso (Editor-in-Chief) Editorial Office of Jurnal GaussianDepartment of Statistics, Universitas DiponegoroJl. Prof. Soedarto, Kampus Undip Tembalang, Semarang, Central Java, Indonesia 50275Telp./Fax: +62-24-7474754Email: jurnalgaussian@gmail.com
Jurnal Gaussian by Departemen Statistika Undip is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Visitor Number:
View statistics