Prosiding

research
  • 24 Feb
  • 2023

Prosiding

The current development of technology is quite rapidly not disengaged in a large
data processor covering of all areas such as information technology, computer science, medicine,
finance and other. This brings a large computing effect in identifying the processing of data. In
data analysis for very large data, data processing is very much needed, in this study the authors
propose data mining method as a solution to a large data processing problem, data mining is
divided into several techniques including classification method techniques that aims to classify
large amounts of data to be relevant data information. In this study the authors compared
5 algorithms in the classification method to get better performance in classification problems.
Researchers analyze and test 5 Algorithm classifications with 4 different datasets as a tool in the
problem of large data classification. .The results of this research show the method SVM is much
better to be used 4 comparison methods in calculating the value of AUC by using 4 datasets of
UCI Repository. The LSVT Dataset shows the highest AUC value with 0973, Ionsphere 0887,
Sonar 0897, Heartstatlog 0868.

Unduhan

  • Artikel.pdf

    Prosiding-Performance Comparison and Optimized Algorithm Classification

    •   diunduh 163x | Ukuran 689 KB

 

REFERENSI

[1] H. Jiawei, M. Kamber, J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques. 2012.
[2] E. Turban, J. E. Aronson, and T. Liang, “Decision Support Systems and Intelligent Systems,” 2007.
[3] D. T. Larose, Data Mining Methods and Models. 2006.
[4] S. Dewi, “KOMPARASI 5 METODE ALGORITMA KLASIFIKASI DATA MINING PADA PREDIKSI
KEBERHASILAN PEMASARAN PRODUK LAYANAN PERBANKAN,” vol. XIII, no. 1, pp. 60–66, 2016.
[5] W. J. Frawley, G. Piatetsky-shapiro, and C. J. Matheus, “Knowledge Discovery in Databases: An Overview,”
vol. 13, no. 3, pp. 57–70, 1992
[6] F. Gorunescu, Data mining: concepts and techniques. 2011.
[7] J. Wu, Advances in K-means Clustering, vol. 53, no. 9. 2013.
[8] H. B. Zheng, R. J. Liao, S. Grzybowski, and L. J. Yang, “Fault diagnosis of power transformers using multiclass least square support vector machines classifiers with particle swarm optimisation,” IET Electr. Power
Appl., vol. 5, no. 9, p. 691, 2011.
[9] N. Deng, Y. Tian, and C. Zhang, Support Vector Machines. 2013.
[10] Y. Liu, X. Yu, J. X. Huang, and A. An, “Combining integrated sampling with SVM ensembles for learning
from imbalanced datasets,” Inf. Process. Manag., vol. 47, no. 4, pp. 617–631, 2011.
[11] J. Han, M. Kamber, and J. Pei, Data Mining Concepts and Techniques. 2012.
[12] I. H. Witten, E. Frank, and M. a. Hall, Data Mining Practical Machine Learning Tools and Techniques Third
Edition, vol. 277, no. Tentang Data Mining. 2011.
[13] C. J. Mantas and J. Abell´an, “Credal-C4.5: Decision tree based on imprecise probabilities to classify noisy
data,” Expert Syst. Appl., vol. 41, no. 10, pp. 4625–4637, 2014