Analysis of the K-Means Algorithm on Clean Water Customers Based on the Province

research
  • 13 May
  • 2020

Analysis of the K-Means Algorithm on Clean Water Customers Based on the Province

One of the important needs of environmental health is clean water. Clean water is the most important necessity of living beings in supporting survival. The study aimed to cluster the number of cleaned water customers by province (1995-2015). The method used is data mining clustering using k-means. The sample data used 34 provinces with attribute assessment of the number of cleaned water customers by province. The clustering process is done with 3 clusters, namely (C1) Cluster High, (C2) Cluster Normal and (C3) Cluster Low, for the number of cleaned water customers who are low on the need of clean water. The results showed, C1: 6 provinces, C2: 4 provinces and C3: 24 provinces. The end centroid values used are: C1 (296587.22), C2 (995898.56) and C3 (70832.29). The results obtained on the Davies-Bouldin index for "the number of cleaned water consumers" are -0.470. based on performance results, it can be concluded that k-means algorithm is best because it has the smallest Davies-Bouldin index value. Based on research results, 70% of Indonesian people are still low awareness of the need for clean water

Unduhan

 

REFERENSI

[1]    Kepala Badan Pusat Statistik Republik Indonesia, Water Supply Statistics. BPS-Statistics Indonesia.
[2]    O. J. Oyelade, O. O. Oladipupo, and I. C. Obagbuwa, “Application of k Means Clustering algorithm for prediction of Students Academic Performance,” Int. J. Comput. Sci. Inf. Secur., vol. 7, no. 1, pp. 292– 295, 2010.
[3]    S. Kumar and S. K. Rathi, “Performance Evaluation of K-Means Algorithm and Enhanced Mid-point based K-Means Algorithm on Mining Frequent Patterns,” Int. J. Adv. Res. Comput. Sci. Softw. Eng., vol. 4, no. 10, pp. 545–548, 2014.
[4]    A. Yadav and S. Dhingra, “An Enhanced K-Means Clustering Algorithm to Remove Empty Clusters,”
IJEDR, vol. 4, no. 4, pp. 901–907, 2016.
[5]    N. Aggarwal, K. Aggarwal, and K. Gupta, “Comparative Analysis of K-means and Enhanced K-means Clustering Algorithm for Data Mining,” Int. Jouranl Sci. Eng. Res., vol. 3, no. 3, 2012.
[6]    T. M. Kodinariya and P. R. Makwana, “Review on determining number of Cluster in K-Means Clustering,” International Journal of Advance Research in Computer Science and Management Studies, 2013. .
[7]    U. R. Raval and C. Jani, “Implementing and Improvisation of K-means Clustering,” Int. J. Comput. Sci. Mob. Comput., vol. 5, no. 5, pp. 72–76, 2016.
[8]    M. K. Arzoo, A. Prof, and K. Rathod, “K-Means algorithm with different distance metrics in spatial data mining with uses of NetBeans IDE 8 . 2,” Int. Res. J. Eng. Technol., vol. 4, no. 4, pp. 2363–2368, 2017.
[9]    A. P. Windarto, “Implementation of Data Mining on Rice Imports by Major Country of Origin Using Algorithm Using K-Means Clustering Method,” Int. J. Artif. Intell. Res., vol. 1, no. 2, pp. 26–33, 2017.
[10]    B. Supriyadi, A. P. Windarto, T. Soemartono, and Mungad, “Classification of natural disaster prone areas in Indonesia using K-means,” Int. J. Grid Distrib. Comput., vol. 11, no. 8, pp. 87–98, 2018.