Analisis Pengaruh SMOTE terhadap Kinerja Model KNN untuk Prediksi Risiko Stroke
Abstract
Penelitian ini membahas masalah ketidakseimbangan data dalam klasifikasi risiko stroke, di mana kasus non-stroke secara signifikan lebih rendah daripada kasus stroke. Ketidakseimbangan kelas cenderung menimbulkan bias terhadap kelas mayoritas, yang menyebabkan berkurangnya efektivitas klasifikasi. Untuk mengatasi hal ini, SMOTE (Synthetic Minority Over-sampling Technique) digunakan untuk mengatasi ketidakseimbangan kelas dalam dataset dan algoritma K-Nearest Neighbor (KNN) digunakan untuk klasifikasi. Dataset mengalami preprocessing, aplikasi SMOTE, dan algoritma KNN dilatih dan dievaluasi menggunakan metrik standar termasuk akurasi, presisi, recall, dan F1-score. Penerapan SMOTE bersama dengan KNN menghasilkan peningkatan yang signifikan dalam hasil klasifikasi, mencapai akurasi 91,87%, presisi 94,27%, recall 89,20%, dan F1-score 91,66%. Temuan ini menegaskan bahwa pendekatan yang diimplementasikan berkinerja baik dalam mendeteksi risiko stroke meskipun ada set data yang tidak seimbang. Tujuan dari penelitian ini adalah untuk menginformasikan kemajuan teknologi deteksi dini stroke yang lebih kuat dan mendukung peningkatan dalam penyediaan layanan kesehatan.
Keywords
References
F. N. Syahreza, Puspita Nurul Sabrina, and Edvin Ramadhan, “PREDIKSI PENYAKIT STROKE MENGGUNAKAN METODE K-NEAREST NEIGHBOUR DAN INFORMATION GAIN,” JATI (Jurnal Mahasiswa Teknik Informatika), vol. 8, no. 6, pp. 11354–11359, Nov. 2024, doi: https://doi.org/10.36040/jati.v8i6.11427.
P. N. Srinivasu, U. Sirisha, K. Sandeep, S. P. Praveen, L. P. Maguluri, and T. Bikku, “An Interpretable Approach with Explainable AI for Heart Stroke Prediction,” Diagnostics, vol. 14, no. 2, p. 128, Jan. 2024, doi: https://doi.org/10.3390/diagnostics14020128.
D. U. maula Rachmad, H. Oktavianto, and M. Rahman, “Perbandingan Metode K-Nearest Neighbors dan Gaussian Naive Bayes untuk Klasifikasi Penyakit Stroke,” Jurna Smart Teknologi, vol. 3, no. 4, pp. 405–412, May 2022, Accessed: Apr. 30, 2025. [Online]. Available: https://jurnal.unmuhjember.ac.id/index.php/JST/article/view/7601
A. Byna and M. Basit, “PENERAPAN METODE ADABOOST UNTUK MENGOPTIMASI PREDIKSI PENYAKIT STROKE DENGAN ALGORITMA NAÏVE BAYES,” Jurnal Sisfokom (Sistem Informasi dan Komputer), vol. 9, no. 3, Aug. 2020, doi: https://doi.org/10.32736/sisfokom.v9i3.1023.
M. N. Maskuri, K. Sukerti, and R. M. H. Bhakti, “Penerapan Algoritma K-Nearest Neighbor untuk Memprediksi Penyakit Stroke,” Jurnal Ilmiah Intech : Information Technology Journal of UMUS, vol. 4, no. 1, pp. 130–140, May 2022, Accessed: Apr. 26, 2025. [Online]. Available: https://jurnal.umus.ac.id/index.php/intech/article/view/751
P. W. S. Aji, S. Supriyanto, and R. Dijaya, “Prediksi Penyakit Stroke Menggunakan Metode Random Forest,” KESATRIA Jurnal Penerapan Sistem Informasi (Komputer & Manajemen), vol. 4, no. 4, pp. 916–924, Oct. 2023, Accessed: Apr. 20, 2025. [Online]. Available: https://tunasbangsa.ac.id/pkm/index.php/kesatria/article/view/242
Novianti Puspitasari, Anindita Septiarini, and Abdul Razak Aliudin, “METODE K-NEAREST NEIGHBOR DAN FITUR WARNA UNTUK KLASIFIKASI DAUN SIRIH BERDASARKAN CITRA DIGITAL,” Prosisko/Prosisko: jurnal pengembangan riset dan observasi sistem komputer, vol. 10, no. 2, pp. 165–172, Aug. 2023, doi: https://doi.org/10.30656/prosisko.v10i2.6924.
Z. Umar, None Dityo Kreshna Argeshwara, None Aji Prasetya Wibawa, A. Nur, and S. Hadi, “Pemodelan Sistem Deteksi Kadar Unsur Hara Tanah Berdasarkan Nilai NPK Menggunakan Metode Fuzzy Mamdani,” Jurnal sains dan informatika, pp. 77–88, Aug. 2023, doi: https://doi.org/10.34128/jsi.v9i1.523.
V. Saini, L. Guada, and D. R. Yavagal, “Global Epidemiology of Stroke and Access to Acute Ischemic Stroke Interventions,” Neurology, vol. 97, no. 20 Supplement 2, pp. S6–S16, Nov. 2021, doi: https://doi.org/10.1212/WNL.0000000000012781.
Tanapol Kosolwattana, C. Liu, R. Hu, S. Han, H. Chen, and Y. Lin, “A self-inspected adaptive SMOTE algorithm (SASMOTE) for highly imbalanced data classification in healthcare,” vol. 16, no. 1, Apr. 2023, doi: https://doi.org/10.1186/s13040-023-00330-4.
Amir Reza Salehi and Majid Khedmati, “A cluster-based SMOTE both-sampling (CSBBoost) ensemble algorithm for classifying imbalanced data,” Scientific reports (Nature Publishing Group), vol. 14, no. 1, Mar. 2024, doi: https://doi.org/10.1038/s41598-024-55598-1.
Z. Zhou, C. Xu, Y. Qiao, J. Xiong, and J. Yu, “Enhancing Equipment Health Prediction with Enchanced SMOTE-KNN,” JIEAS Journal of Industrial Engineering and Applied Science, vol. 2, no. 2, Apr. 2024, Accessed: Mar. 25, 2025. [Online]. Available: https://www.suaspress.org/ojs/index.php/JIEAS/article/view/v2n2a03
M. Khushi et al., “A Comparative Performance Analysis of Data Resampling Methods on Imbalance Medical Data,” IEEE Access, vol. 9, pp. 109960–109975, 2021, doi: https://doi.org/10.1109/access.2021.3102399.
Oladunjoye John Abiodun and A. I. Wreford, “Stroke Prediction Using Smote for Data Balancing, XGBoost and KNN Ensemble Algorithms,” Deleted Journal, pp. 42–53, Aug. 2023, doi: https://doi.org/10.56557/japsi/2023/v15i18349.
H. M. Merdas, “Elastic Net – MLP – SMOTE (EMS)-Based Model for Enhancing Stroke Prediction,” Medinformatics, pp. 73–78, Apr. 2024, doi: https://doi.org/10.47852/bonviewmedin42022470.
F. Yagin, I. Cicek, and Z. Kucukakcali, “Classification of stroke with gradient boosting tree using smote-based oversampling method,” Medicine Science | International Medical Journal, vol. 10, no. 4, p. 1510, 2021, doi: https://doi.org/10.5455/medscience.2021.09.322.
M. A. Aish, F. Nasim, K. I. Ali, S. Akhter, and S. Azeem, “Improving Stroke Prediction Accuracy through Machine Learning and Synthetic Minority Over-sampling,” Journal of Computing & Biomedical Informatics, vol. 7, no. 02, Sep. 2024, Accessed: Apr. 26, 2025. [Online]. Available: https://jcbi.org/index.php/Main/article/view/566/469
K. Swain et al., “Enhancing Stroke Prediction Using LightGBM With SMOTE-ENN and Fine-Tuning: A Comprehensive Analysis,” Cureus Journals, Dec. 2024, doi: https://doi.org/10.7759/s44389-024-02268-y.
Fitri Handayani and Reny Medikawati Taufiq, “Komparasi Algoritma Menggunakan Teknik Smote Dalam Melakukan Klasifikasi Penyakit Stroke Otak,” Jurnal CoSciTech (Computer Science and Information Technology), vol. 5, no. 2, pp. 367–372, Aug. 2024, doi: https://doi.org/10.37859/coscitech.v5i2.7439.
F. Fadmadika, H. H. Handayani, T. A. Mudzakir, and J. Indra, “PENGARUH SMOTE TERHADAP PERFORMA ALGORITMA RANDOM FOREST DAN ALGORITMA GRADIENT BOOSTING DALAM MEMPREDIKSI PENYAKIT STROKE,” Jurnal Teknik Informasi dan Komputer (Tekinkom), vol. 7, no. 2, p. 837, Dec. 2024, doi: https://doi.org/10.37600/tekinkom.v7i2.1575.
L. Pasiolo, I. Afrianty, E. Budianita, and R. Abdillah, “PENERAPAN TEKNIK SMOTE PADA KLASIFIKASI PENYAKIT STROKE DENGAN ALGORITMA SUPPORT VECTOR MACHINE,” ZONAsi: Jurnal Sistem Informasi, vol. 7, no. 1, Jan. 2025, Accessed: Apr. 23, 2025. [Online]. Available: https://journal.unilak.ac.id/index.php/zn/article/view/24731
Desti Mualfah, Wahyu Fadila, and Rahmad Firdaus, “Teknik SMOTE untuk Mengatasi Imbalance Data pada Deteksi Penyakit Stroke Menggunakan Algoritma Random Forest,” Jurnal CoSciTech (Computer Science and Information Technology), vol. 3, no. 2, pp. 107–113, Aug. 2022, doi: https://doi.org/10.37859/coscitech.v3i2.3912.
Oladunjoye John Abiodun and A. I. Wreford, “Stroke Prediction Using Smote for Data Balancing, XGBoost and KNN Ensemble Algorithms,” Deleted Journal, pp. 42–53, Aug. 2023, doi: https://doi.org/10.56557/japsi/2023/v15i18349.
K. Akmal, A. Faqih, and Fatihanursari Dikananda, “PERBANDINGAN METODE ALGORITMA NAÏVE BAYES DAN K-NEAREST NEIGHBORS UNTUK KLASIFIKASI PENYAKIT STROKE,” JATI (Jurnal Mahasiswa Teknik Informatika), vol. 7, no. 1, pp. 470–477, Mar. 2023, doi: https://doi.org/10.36040/jati.v7i1.6367.
M. Hafidz Ariansyah, Sri Winarno, Esmi Nur Fitri, and Retha, “Multi-Layer Perceptron For Diagnosing Stroke With The SMOTE Method In Overcoming Data Imbalances,” Innovation in Research of Informatics (INNOVATICS), vol. 5, no. 1, Mar. 2023, doi: https://doi.org/10.37058/innovatics.v5i1.6565.
H. Hairani, K. E. Saputro, and S. Fadli, “K-means-SMOTE for handling class imbalance in the classification of diabetes with C4.5, SVM, and naive Bayes,” Jurnal Teknologi dan Sistem Komputer, vol. 8, no. 2, pp. 89–93, Feb. 2020, doi: https://doi.org/10.14710/jtsiskom.8.2.2020.89-93.
N. Y. Paramitha, A. Nuryaman, A. Faisol, E. Setiawan, and D. E. Nurvazly, “Klasifikasi Penyakit Stroke Menggunakan Metode Naive Bayes,” Jurnal Siger Matematika, vol. 04, no. 01, Mar. 2023, Accessed: Apr. 19, 2025. [Online]. Available: https://jsm.fmipa.unila.ac.id/index.php/jsm/article/view/33
H. Siregar, A. Tumanggor, and None Akhwan Rahmadani, “Penerapan K-Nearest Neighbors (KNN) dalam Memprediksi dan Menghitung Akurasi Data Penyakit Stroke,” Jurnal Penelitian Rumpun Ilmu Teknik, vol. 2, no. 4, pp. 146–154, Nov. 2023, doi: https://doi.org/10.55606/juprit.v2i4.3040.
“Stroke pada Lansia di Indonesia: Gambaran Faktor Risiko Berdasarkan Gender (SKI 2023),” Jurnal Biostatistik, Kependudukan, dan Informatika Kesehatan, vol. 5, no. 1, Dec. 2024, doi: https://doi.org/10.7454/bikfokes.v5i1.1092.
Siti Retno Wulandari, “Pembiayaan Penyakit Stroke Masih Tinggi Hingga Rp5,2 Triliun,” Mediaindonesia.com, Oct. 25, 2024. https://mediaindonesia.com/humaniora/712147/pembiayaan-penyakit-stroke-masih-tinggi-hingga-rp52-triliun (accessed Apr. 30, 2025).J. Padhye, V. Firoiu, &D. Towsley, “A stochastic model of TCP Reno congestion avoidance and control,” Univ. of Massachusetts, Amherst, MA, CMPSCI Tech. Rep. 99-02, 199
DOI: https://doi.org/10.30591/jpit.v10i4.8809
Refbacks
- There are currently no refbacks.

This work is licensed under a Creative Commons Attribution 4.0 International License.
JPIT INDEXED BY
![]() | ![]() | ![]() | ![]() |
![]() | ![]() | ![]() | |

This work is licensed under a Creative Commons Attribution 4.0 International License.









