Comparison and Evaluation of Euclidean Distance and Dice Distance in the K-Means Adaptive Algorithm for Clustering Composite Indexes of Food Security and Vulnerability Maps

Authors

  • Emma Romasta Naulina Nainggolan Faculty of computer sciene, University Katolik Santo Thomas, Medan, Indonesia
  • Paska Marto Hasugian Faculty of computer sciene, University Katolik Santo Thomas, Medan, Indonesia

DOI:

https://doi.org/10.58471/jds.v3i2.6941

Keywords:

Adaptive K-means, Euclidean Distance, Dice Distance, Silhoutte Score, Clustering, Food Resilience and Vulnerability index

Abstract

This study aims to compare and evaluate the effectiveness of two distance measurement methods, namely Euclidean Distance and Dice Distance, in the K-Means Adaptive algorithm for clustering Food Security and Vulnerability Composite Index data. The dataset used includes index data from 2022 to 2024, comprising 305 entries, which were then cleaned to 298 entries. The evaluation was conducted manually using a sample dataset and automatically using the entire dataset via Google Colab with Python. The algorithm's performance was assessed using the Silhouette Score metric to measure the quality of the resulting clusters. The evaluation results showed that the Euclidean method produced an average Silhouette Score of 0.3082, indicating an suboptimal cluster structure. This study concludes that the choice of distance method significantly influences clustering results, and selection should be tailored to the characteristics of the data.

References

] K. P. Badan Ketahanan Pangan, “Indeks Ketahanan Pangan Indonesia 2018.” Badan Ketahanan Pangan, 2018.

] Rousseeuw, P. J. (1987). Siluet: Bantuan grafis untuk interpretasi dan validasi analisis klaster. Matematika Komputasi dan Terapan, 20: 53-65.

] Kepler, G., & Palomino, M. (2023). Gurun makanan dan pengelompokan k-means. *SIAM Undergraduate Research Online*.https://www.siam.org/media/bl2p3oyy/s150444.pdf

] Zhou, Y. (2023). Perbandingan k-means clustering dengan hierarchical agglomerative clustering dalam analisis ketahanan pangan. JIDSS. https://www.idss.iocspublisher.org/index.php/jidss/article/download/290/161

] Bora, D. J., & Gupta, A. K. (2014). Pengaruh ukuran jarak yang berbeda terhadap kinerja algoritma K-Means: Sebuah studi eksperimental di Matlab. *arXiv preprint*. https://arxiv.org/abs/1405.7471

Downloads

Published

2025-09-03

How to Cite

Emma Romasta Naulina Nainggolan, & Paska Marto Hasugian. (2025). Comparison and Evaluation of Euclidean Distance and Dice Distance in the K-Means Adaptive Algorithm for Clustering Composite Indexes of Food Security and Vulnerability Maps. Journal Of Data Science, 3(02), 106–118. https://doi.org/10.58471/jds.v3i2.6941