Implementasi Klasifikasi Penyakit Liver dengan Teknik Penanganan Data Tidak Seimbang Synthetic Minority Oversampling Technique – Edited Nearest Neighbor

dc.contributor.authorNurifatul Laily
dc.date.accessioned2026-06-23T08:05:57Z
dc.date.issued2026-04-27
dc.descriptionApproved by Teddy
dc.description.abstractLiver disease is a general term that refers to various disorders or abnormalities affecting the liver, including fatty liver disease, cirrhosis, hepatitis, liver cancer, liver tumors, and others. Due to non-specific symptoms, liver disease is challenging to diagnose at an early stage. However, early diagnosis is important to enable timely treatment, thereby preventing disease progression to more severe stages. Machine learning can be used for the early diagnosis of liver disease using classification methods. However, if the data used is imbalanced, it can bias the classification model. In this research, a liver disease classification model is built using the k-Nearest Neighbor (k-NN), Random Forest (RF), and Extreme Gradient Boosting (XGBoost) algorithms by applying Synthetic Minority Oversampling Technique - Edited Nearest Neighbors (SMOTE-ENN) to handle imbalanced data. SMOTE-ENN was applied by experimenting with several values of the sampling_strategy (SMOTE) hyperparameter. Hyperparameter tuning was also performed on both SMOTE-ENN and the classification algorithm using Grid Search. Based on accuracy, precision, recall, and F1-score, the RF model with sampling_strategy (SMOTE) = default and {0:475} using a 90:10 data split achieved the best performance. Both configurations achieved accuracy, precision, recall, and F1-score of 88%, 93%, 90%, and 92%, respectively. Based on further evaluation using learning curve, cross-validation, confusion matrix heatmap, Receiver Operating Characteristic (ROC) curve, and Precision-Recall (PR) curve, the selected model for implementation in the web application was the RF model with sampling_strategy (SMOTE) = {0:475}.
dc.description.sponsorshipMuhammad ‘Ariful Furqon, S.Pd., M.Kom. Gama Wisnu Fajarianto S.Kom., M.Kom.
dc.identifier.urihttps://repository.unej.ac.id/handle/123456789/9932
dc.publisherFakultas Ilmu Komputer
dc.subjectLiver Disease
dc.subjectSMOTE-ENN
dc.subjectk-Nearest Neighbor
dc.subjectRandom Forest
dc.subjectExtreme Gradient Boosting
dc.titleImplementasi Klasifikasi Penyakit Liver dengan Teknik Penanganan Data Tidak Seimbang Synthetic Minority Oversampling Technique – Edited Nearest Neighbor
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Skripsi Repository.pdf
Size:
1.42 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: