Flood Risk Prediction Using the K-Nearest Neighbors (KNN) Method with Machine Learning Optimization in Jakarta Province
Keywords:
Jakarta, K-Nearest Neighbors, Machine Learning, Flood PredictionAbstract
This study aims to predict flood risk in the Jakarta Province using the K-Nearest Neighbors (KNN) algorithm. The model is designed to classify regions into three risk levels—low, moderate, and high—based on historical features such as water level and annual flood frequency. Data from 2013 to 2024 provided by the Jakarta Regional Disaster Management Agency (BPBD) served as the basis for model training. The normalization process using Z-Score and Min-Max Scaling proved essential in enhancing KNN performance. Cross-validation (5-Fold Cross Validation) revealed that K=3 yielded the best results. The model achieved accuracy rates between 96% and 99%, with precision, recall, and F1-score exceeding 95%. These results demonstrate that KNN can reliably map flood risks based on numerical data trends. Predictions for the years 2026 to 2028 indicate an increased flood risk in several areas, particularly Central, North, and West Jakarta, which fall into the high-risk category. East and South Jakarta are classified as moderate to high risk, while the Thousand Islands region is expected to remain in the low-risk category. This model can be used as a foundation for developing data-driven mitigation strategies and early warning systems
Downloads
References
Angreni, D. S., & Pusadan, M. Y. (2024). Klasifikasi curah hujan menggunakan algoritma K-Nearest Neighbor (KNN) di Sulawesi Tengah. Jurnal Teknologi Sistem Informasi dan Aplikasi, 9(4), 2316–2324.
Cumel, D. Z., & Rahmaddeni, S. (2022). Perbandingan metode data mining untuk prediksi banjir dengan algoritma Naïve Bayes dan KNN. SENTIMAS: Seminar Nasional Penelitian dan Pengabdian kepada Masyarakat, 40–48. https://journal.irpi.or.id/index.php/sentimas/article/view/353
https://journal.irpi.or.id/index.php/sentimas/article/download/353/132
Hermawan, E., Panjaitan, S. D., & Ripanti, E. F. (2024). Sistem prediksi banjir rob Kota Pontianak berbasis machine learning menggunakan framework Streamlit. Jurnal Teknologi Sistem Informasi dan Aplikasi, 10(3), 351–361.
Rafi Nahjan, M., Heryana, N., & Voutama, A. (2023). Implementasi RapidMiner dengan metode clustering K-Means untuk analisa penjualan pada Toko Oj Cell. JATI (Jurnal Mahasiswa Teknik Informatika), 7(1), 101–104. https://doi.org/10.36040/jati.v7i1.6094
Restu, A. K. A., Siswa, T. A. Y., & Pranoto, W. J. (2024). Model optimasi KNN-PSORF dalam menangani high dimensional data banjir Kota Samarinda. Jurnal Teknologi Sistem Informasi dan Aplikasi, 7(3), 1289–1299. https://doi.org/10.32493/jtsi.v7i3.41587
Roihan, A., Sunarya, P. A., & Rafika, A. S. (2020). Pemanfaatan machine learning dalam berbagai bidang: Review paper. IJCIT (Indonesian Journal on Computer and Information Technology), 5(1), 75–82. https://doi.org/10.31294/ijcit.v5i1.7951
Sandiwarno, S. (2024). Penerapan machine learning untuk prediksi bencana banjir. Jurnal Sistem Informasi Bisnis, 14(1), 62–76. https://doi.org/10.21456/vol14iss1pp62-76
Wardani, S. D. K., Ariyanto, A. S., Umroh, M., & Rolliawati, D. (2023). Perbandingan hasil metode clustering K-Means, DB Scanner & Hierarchical untuk analisa segmentasi pasar. JIKO (Jurnal Informatika dan Komputer), 7(2), 191. https://doi.org/10.26798/jiko.v7i2.796
Whendasmoro, R. G., & Joseph, J. (2022). Analisis penerapan normalisasi data dengan menggunakan Z-Score pada kinerja algoritma K-NN. JURIKOM (Jurnal Riset Komputer), 9(4), 872. https://doi.org/10.30865/jurikom.v9i4.4526
Yudono, M. A. S., & Akbar, J. (2024). Water level classification for early flood detection using KNN method. Fidelity: Jurnal Teknik Elektro, 6(2), 49–57. https://doi.org/10.52005/fidelity.v6i2.227











