KOMBINASI TOMEK-LINK DAN SMOTE UNTUK MENGATASI KETIDAKSEIMBANGAN KELAS PADA CREDIT CARD FRAUD

research
  • 17 Mar
  • 2023

KOMBINASI TOMEK-LINK DAN SMOTE UNTUK MENGATASI KETIDAKSEIMBANGAN KELAS PADA CREDIT CARD FRAUD

Meningkatnya aktivitas perdagangan secara online atau e-commerce telah menjadi trend saat ini. Akibatnya kejahatan yang paling sering terjadi adalah penipuan kartu kredit (credit card fraud) atau carding. Kurang lebih terdapat 1.000 kasus penipuan dalam satu juta transaksi sehingga data tersebut dikumpulkan dalam bentuk dataset
credit card fraud risk. Pada beberapa kasus, kelas minoritas justru lebih penting untuk diidentifikasi daripada kelas mayoritas seperti pada kasus transaksi credit card. Pada penelitian ini untuk menangani masalah ketidakseimbangan kelas pada dataset credit card fraud risk maka diusulkan metode resampling yaitu pendekatan
level data Tomek-Link dan SMOTE dengan model klasifikasi C5.0. Penelitian ini dilakukan untuk meningkatkan nilai akurasi AUC pada model algoritma klasifikasi C5.0. Hasil penelitian menunjukkan bahwa metode usulan mampu meningkatkan nilai AUC sebesar 0,134 dibandingkan tanpa metode resampling.

Unduhan

  • Tesis_Deni Risdiansyah_14001911.pdf

    KOMBINASI TOMEK-LINK DAN SMOTE UNTUK MENGATASI KETIDAKSEIMBANGAN KELAS PADA CREDIT CARD FRAUD

    •   diunduh 425x | Ukuran 1,304 KB

 

REFERENSI

Akbar, Muhammad Sonhaji, and Siti Rochimah. 2017. “Prediksi Cacat Perangkat Lunak Dengan Optimasi Naive Bayes Menggunakan Gain Ratio.” Jurnal Sistem dan Informatika: 147–55.


Arora, Ishani, Vivek Tetarwal, and Anju Saha. 2015. “Open Issues in Software Defect Prediction.” Procedia Computer Science 46(Icict 2014): 906–12. Attenberg, Josh, and S Ertekin. 2013. “Class Imbalance and Active Learning.” Imbalanced Learning: Foundations, Algorithms, and Applications (iii): 101–49.


Batuwita, R, and V Palade. 2010. “Efficient Resampling Methods for Training Support Vector Machines with Imbalanced Datasets.” Neural Networks (IJCNN), The 2010 Bekkar, Mohamed, Hassiba Kheliouane

Djemaa, and Taklit Akrouf Alitouche. 2013. “Evaluation Measures for Models Assessment over Imbalanced Data Sets.” Journal of Information Engineering and Applications 3(10): 27–38.

Chawla, N, K W Bowyer, L O Hall, and W P Kegelmeyer. 2002. “SMOTE:Synthetic Minority over-Sampling Technique.” Journal of Artificial Intelligence Research 16: 321–57.

Dawson, Christian W. 2009. 2 Information Systems Journal Projects in Computing and Information Systems.

Dewi, Sari. 2016. “Komparasi 5 Metode E Algoritma Klasifikasi Data Mining Pada Prediksi Keberhasilan Pemasaran Produk Layanan Perbankan.” Jurnal Techno Nusa Mandiri XIII(1): 60–66.

Elhassan, T, M Aljurf, F Al-Mohanna, and M Shoukri. 2016. “Classification of Imbalance Data Using Tomek Link(T-Link) Combined with Random UnderSampling (RUS) as a Data Reduction Method.” Journal of Informatics and Data Mining 1(2): 1–12.

Fitkov, Elena, and Sakinat Oluwabukonla. 2013. “Engineering Applications of Neural Networks: 14th International Conference, EANN 2013 Halkidiki, Greece, September 13-16, 2013 Proceedings, Part I.” Communications in Computer and Information Science 383 CCIS(PART 1). Gorunescu, Florin. 2011. Springer-Verlag Data Mining: Concepts and Techniques. Berlin.

Japkowi, Nathalie, and Shaju Stephen. 2002. “The Class Imbalance Problem: A Systemati Study.” Intelligent Data Analysis 6(5): 203–31. 57

Kuhn, Max. 2008. “Caret Package.” Journal Of Statistical Software 28(5): 1–26. Kuhn, Max, and Kjell Johnson. 2013. Applied Predictive Modeling [Hardcover]. Lin, Wei Chao, Chih Fong Tsai, Ya Han Hu, and

Jing Shang Jhang. 2017. “Clustering-Based Undersampling in Class-Imbalanced Data.” Information Sciences 409–410: 17–26.

Rahman, M Mostafizur, and D N Davis. 2013. “Cluster Based Under-Sampling for Unbalanced Cardiovascular Data.” Proceedings of the World Congress on Engineering 2013 III: 1–6.


Sabaruddin, Raja. 2018. “KETIDAKSEIMBANGAN KELAS PADA PREDIKSI.”

Shirali, Rohan. 2016. “Classification Trees and Rule-Based Modeling Using the C5. 0 Algorithm for Self-Image Across Sex and Race in St . Louis.”

Sun, Zhongbin et al. 2015. “A Novel Ensemble Method for Classifying Imbalanced Data.” Pattern Recognition 48(5): 1623–37.
Vercellis, Carlo. 2009. Business Intelligence : Data Mining and Optimization for Decision Making. John Wiley & Sons, Ltd.


Wahono, Romi Satria, Nanna Suryana, and Sabrina Ahmad. 2014. “Metaheuristic Optimization Based Feature Selection for Software Defect Prediction.” Journal of Software 9(5): 1324–33.
Zhang, D., Liu, W., Gong, X., & Jin, H. (2011). A Novel Improved SMOTE Resampling Algorithm Based on Fractal. Computational Information Systems, 2204-2211