The dataset of software metrics, in general, are not balanced (Imbalanced). Class imbalance in Dataset can reduce the performance of software defect prediction models, because it tends to produce majority class predictions from minority classes, the dataset used in this study uses the National Aeronautics and Space Administration (NASA) Metrics Data Program (MDP), dataset From Stages Pre-processing proposed the Particle Swarm Optimization (PSO). method to overcome the problem of attributes in the training data and the Random Over Sampling (ROS) Resampling method. to deal with class imbalances. This study proposes that the Random Forest method combined with Adaboost can estimate the level of disability of software through training data. The results of this study indicate that the Resampling + Adaboost + Random Forest algorithm can be used to predict software defects with an average accuracy of 94.70% and a value of AUC 0.939. While the PSO + Random Forest algorithm only has an average accuracy of 89.60% and AUC 0.636 the difference in the accuracy of the two models is 5.10% and AUC 0.303. Statistical tests show that there is a significant influence between the proposed model and the Random Forest model with a p-value (0.036) smaller than the alpha value (0.05), which means there is a significant difference between the two models.
Penerapan PSO Over Sampling Dan Adaboost Random Forest Untuk Memprediksi Cacat Software
Penerapan PSO Over Sampling Dan Adaboost Random Forest Untuk Memprediksi Cacat Software
Adegoke, V. F., Chen, D., Banissi, S., & Banissi, E. (2017). Predictive Ensemble Modelling -Experimental Comparison of Boosting Implementation Methods. Proceedings - UKSim-AMSS 11th European Modelling Symposium on Computer Modelling and Simulation, EMS 2017, 11–16. https://doi.org/10.1109/EMS.2017.13
Amalia, H., Lestari, A. F., & Puspita, A. (2017). Penerapan Metode Svm Berbasis Pso Untuk Penentuan Kebangkrutan Perusahaan. Jurnal Techno Nusa Mandiri, 14(2), 131–136.
Biau, G., & Scornet, E. (2016). A random forest guided tour. Test, 25(2), 197–227. https://doi.org/10.1007/s11749-016-0481-7
Chopard, B., & Tomassini, M. (2018). Particle Swarm Optimization. In: An Introduction to Metaheuristics for Optimization. Natural Computing Series. Springer, Cham, 23(2), 145–156. https://doi.org/10.1007/978-3-319-93073-2
Hofman, R. (2011). Behavioral economics in software quality engineering. Empirical Software Engineering, 16(2), 278–293. https://doi.org/10.1007/s10664-010-9140-x
Laradji, I. H., Alshayeb, M., & Ghouti, L. (2014). Software defect prediction using ensemble learning on selected features. INFORMATION AND SOFTWARE TECHNOLOGY. https://doi.org/10.1016/j.infsof.2014.07.005
Li, R. H., Yu, J. X., Qin, L., Mao, R., & Jin, T. (2015). On random walk based graph sampling. Proceedings - International Conference on Data Engineering, 2015-May, 927–938. https://doi.org/10.1109/ICDE.2015.7113345
Muslikh, A. R., Santoso, H. A., Marjuni, A., Teknik, P., Universitas, I., & Nuswantoro, D. (2018). Klasifikasi Data Time Series Arus Lalu Lintas. 14, 24–38.
Onan, A., Korukoǧlu, S., & Bulut, H. (2016). Ensemble of keyword extraction methods and classifiers in text classification. Expert Systems with Applications, 57, 232–247. https://doi.org/10.1016/j.eswa.2016.03.045
Rais, A. N., & Subekti, A. (2019). Integrasi SMOTE Dan Ensemble AdaBoost Untuk Mengatasi Imbalance Class Pada Data Bank Direct Marketing. Jurnal Informatika, 6(2), 278–285. https://doi.org/10.31311/ji.v6i2.6186
Saifudin, A., Teknik, F., Pamulang, U., Wahono, R. S., Komputer, F. I., & Nuswantoro, U. D. (2015). Penerapan Teknik Ensemble untuk Menangani Ketidakseimbangan Kelas pada Prediksi Cacat Software. 1(1).
Singleton, A. (2016). The Economics of Microservices. IEEE Cloud Computing, 3(5), 16–20. https://doi.org/10.1109/MCC.2016.109
Tang, X., & Chen, L. (2019). Artificial bee colony optimization-based weighted extreme learning machine for imbalanced data learning. Cluster Computing, 22, 6937–6952. https://doi.org/10.1007/s10586-018-1808-9
Wahono, R. S. (2007). A Systematic Literature Review of Software Defect Prediction: Research Trends, Datasets, Methods and Frameworks. Journal of Software Engineering, 1(1), 1–16. https://doi.org/10.3923/jse.2007.1.12
Wahono, R. S., Suryana, N., & Ahmad, S. (2014). Metaheuristic Optimization based Feature Selection for Software Defect Prediction. 9(5), 1324–1333. https://doi.org/10.4304/jsw.9.5.1324-1333