Classification of Traffic Accident Levels in West Java Using the K-Means Algorithm
Keywords:
Data Mining, K-Means Clustering, Traffic Accidents, CRISP-D, Risk MappingAbstract
Traffic accidents are an important factor in improving public safety in West Java Province, because the population movement rate there is very high. The high number of accidents is directly related to increased deaths and material losses, but the use of historical data is still limited to administrative archiving tasks without any process of identifying regional vulnerability patterns. This study aims to classify accident-prone areas using Data Mining techniques and the K-Means Clustering algorithm, as well as applying the CRISP-DM framework approach. The analyzed dataset comes from the Indonesian National Police Traffic Accident Database (Pusiknas Polri) for the period 2020 to 2025, consisting of 552 observations with indicator variables covering the number of fatalities, serious injuries, and minor injuries. The determination of the most appropriate number of clusters was tested using the Silhouette method to ensure more accurate and objective modeling results. The analysis shows that the number of clusters (k=3) is the most appropriate, with a Silhouette metric value of 0.398. The application of the model produces three levels of risk: the red zone, which indicates high risk with 51 cases and the highest mortality rate; the yellow zone, which indicates moderate risk with 189 cases; and the green zone, which indicates low risk with 312 cases. The visualization of these mapping results is expected to be an important tool for the police and local governments in formulating mitigation policies, improving patrol efficiency, and accelerating infrastructure improvements in high-risk areas, thereby reducing the number of accidents in the future.
References
M. Afdal and R. Prana Disastra, “Analisis Pola Kecelakaan Lalu Lintas Menggunakan Algoritma K-Means dan FP-Growth Studi Kasus: Polres Solok,” Jurnal Ilmiah Rekayasa dan Manajemen Sistem Informasi, vol. 8, no. 1, pp. 31–40, 2022.
A. G. Ramadhan, “Data Mining untuk Segmentasi Pelanggan dengan Algoritma K-Means: Studi Kasus pada Data Pelanggan di Toko Retail,” Syntax Literate, vol. 8, no. 10, 2023, doi: 10.36418/syntax-literate.v6i6.
S. S. Hilabi and B. Huda, “Technoxplore Jurnal Ilmu Komputer dan Teknologi Informasi,” vol. 4, 2019.
F. R. Sari, Tukino, S. S. Hilabi, and B. Priyatna, “Klasifikasi Text Ulasan Pengguna Aplikasi Wondr by BNI Menggunakan Algoritma Naïve Bayes,” Jurnal Teknologi Informasi, vol. 6, no. 2, 2025, doi: 10.46576/djtechno.
S. Setyaningtyas, B. I. Nugroho, and Z. Arif, “Tinjauan Pustaka Sistematis: Penerapan Data Mining Teknik Clustering Algoritma K-Means,” Jurnal Teknoif Teknik Informatika Institut Teknologi Padang, vol. 10(2), pp. 52–61, 2022, doi: Https://Doi.Org/10.21063/Jtif.2022.V10.2.52-61.
I. Ferdiansyah, B. Huda, and A. Hananto, “Analisis Clustering Menggunakan Metode K-Means Pada Kemiskinan Di Jawa Timur Tahun 2020,” INNOVATIVE: Journal Of Social Science Research, vol. 4, pp. 858–869, 2024.
Tukino, R. A. Nanda, Gunawan, S. Wijono, S. Y. J. Prasetyo, and S. Trihandaru, “Analysis Transfer Data Image Processing and Face Recognition Using Camera ESP32CAM Web Browser IoT,” ICIC Express Letters, vol. 17, no. 6, pp. 717–724, 2023, doi: 10.24507/icicel.17.06.717.
I. F. Anshori and Y. Nuraini, “Pengelompokan Data Kecelakaan Lalu Lintas di Kota Tasikmalaya Menggunakan Algoritma K-Means,” Jurnal Responsif, vol. 2, no. 1, pp. 118–127, 2020.
T. A. Permana, O. S. Bachri, and R. M. H. Bhakti, “Pemetaan Wilayah Rawan Kecelakaan Lalu Lintas di Kabupaten Brebes Menggunakan Algoritma K-Means,” ELKOM: Jurnal Elektronika dan Komputer, vol. 18, no. 1, 2025.
D. Safitri, S. S. Hilabi, and F. Nurapriani, “Analisis Penggunaan Algoritma Klasifikasi dalam Prediksi Kelulusan Menggunakan Orange Data Mining,” Rabit: Jurnal Teknologi dan Sistem Informasi Univrab, vol. 8, no. 1, pp. 75–81, 2023, doi: 10.36341/rabit.v8i1.3009.
Y. Aprianti, Tukino, A. L. Hananto, and S. S. Hilabi, “Klasifikasi Sentimen Komentar Pengguna pada Aplikasi Ruangguru Menggunakan Algoritma Naive Bayes Ruangguru menunjukkan inovasi dalam pendekatan Naïve Bayes menjadi alat klasifikasi teks yang populer . Penerapan Penelitian lain oleh Artanti Inez TF-IDF pen,” METIK JURNAL, pp. 101–110, 2025, doi: 10.47002/metik.v9i1.1023.
F. N. Dhewayani, D. Amelia, D. N. Alifah, B. N. Sari, and Jajuli, “Implementasi K-Means Clustering untuk Pengelompokkan Daerah Rawan Bencana Kebakaran Menggunakan Model Crisp-Dm,” Jurnal Teknologi Dan Informasi, vol. 12, pp. 64–77, 2022, doi: 10.34010/jati.v12i1.
A. Saptiani, B. Huda, E. Novalia, and A. B. Purba, “Pengelompokkan Data Obat-Obatan Pada Pelayanan Kesehatan Menggunakan Algoritma K-Means Clustering,” Jurnal Sistem Informasi dan Manajemen, vol. 10(3), 2022.
H. Raslin, “Evaluasi Kualitas Aplikasi Digital Korlantas Polri guna Mendukung Tugas Kepolisian dalam Rangka Penguatan Pelayanan Publik,” Jurnal Litbang Polri, vol. 27, no. 3, pp. 191–215, 2024, doi: 10.46976/litbangpolri.v27i3.245.
M. D. R. P. Dio, B. P. Priyatna, A. L. Hananto, and S. S. Hilabi, “Deteksi Objek Kecelakaan pada Kendaraan Roda Empat Menggunakan Algoritma YOLOv5,” Teknologi, vol. 12, no. 2, pp. 15–26, 2022, doi: 10.26594/teknologi.v12i2.3260.











