Implementasi Klasifikasi Penyakit Liver dengan Teknik Penanganan Data Tidak Seimbang Synthetic Minority Oversampling Technique – Edited Nearest Neighbor
| dc.contributor.author | Nurifatul Laily | |
| dc.date.accessioned | 2026-06-23T08:05:57Z | |
| dc.date.issued | 2026-04-27 | |
| dc.description | Approved by Teddy | |
| dc.description.abstract | Liver disease is a general term that refers to various disorders or abnormalities affecting the liver, including fatty liver disease, cirrhosis, hepatitis, liver cancer, liver tumors, and others. Due to non-specific symptoms, liver disease is challenging to diagnose at an early stage. However, early diagnosis is important to enable timely treatment, thereby preventing disease progression to more severe stages. Machine learning can be used for the early diagnosis of liver disease using classification methods. However, if the data used is imbalanced, it can bias the classification model. In this research, a liver disease classification model is built using the k-Nearest Neighbor (k-NN), Random Forest (RF), and Extreme Gradient Boosting (XGBoost) algorithms by applying Synthetic Minority Oversampling Technique - Edited Nearest Neighbors (SMOTE-ENN) to handle imbalanced data. SMOTE-ENN was applied by experimenting with several values of the sampling_strategy (SMOTE) hyperparameter. Hyperparameter tuning was also performed on both SMOTE-ENN and the classification algorithm using Grid Search. Based on accuracy, precision, recall, and F1-score, the RF model with sampling_strategy (SMOTE) = default and {0:475} using a 90:10 data split achieved the best performance. Both configurations achieved accuracy, precision, recall, and F1-score of 88%, 93%, 90%, and 92%, respectively. Based on further evaluation using learning curve, cross-validation, confusion matrix heatmap, Receiver Operating Characteristic (ROC) curve, and Precision-Recall (PR) curve, the selected model for implementation in the web application was the RF model with sampling_strategy (SMOTE) = {0:475}. | |
| dc.description.sponsorship | Muhammad ‘Ariful Furqon, S.Pd., M.Kom. Gama Wisnu Fajarianto S.Kom., M.Kom. | |
| dc.identifier.uri | https://repository.unej.ac.id/handle/123456789/9932 | |
| dc.publisher | Fakultas Ilmu Komputer | |
| dc.subject | Liver Disease | |
| dc.subject | SMOTE-ENN | |
| dc.subject | k-Nearest Neighbor | |
| dc.subject | Random Forest | |
| dc.subject | Extreme Gradient Boosting | |
| dc.title | Implementasi Klasifikasi Penyakit Liver dengan Teknik Penanganan Data Tidak Seimbang Synthetic Minority Oversampling Technique – Edited Nearest Neighbor | |
| dc.type | Thesis |
