Improving The Effectiveness of Classication Using The Data Level Approach and Feature Selection Techniques in Online Shoppers Purchasing Intention Prediction

research
  • 25 Dec
  • 2020

Improving The Effectiveness of Classication Using The Data Level Approach and Feature Selection Techniques in Online Shoppers Purchasing Intention Prediction

Online shopping is a form of trading using electronic devices that allows consumers to buy goods or services from sellers via the internet. Other names for these activities are: e-web-shop, e-shop, e-shop, internet shop, web-shop, web-store, online shop, and virtual shop. An online store generates purchases of products or services at retailers or shopping centers, which are referred to as business-to-consumer (B2C) online shopping. n another process where a business buys from another business, it is called business-to-business (B2B). Nowadays online shopping has become more sophisticated with trading via mobile phones (m-commerce). Cellular phones have been optimized with an application to buy from online sites. In this study, we proposed a data level approach and feature selection techniques as a solution for the classification of imbalanced data. The imbalance class classification is one of the classic problems in the field of artificial intelligence, especially for classification in machine learning. Imbalanced data have been proven to reduce the performance of machine learning algorithms, where imbalance data means that the total data from each class is significantly different. The proposed method is evaluated using a dataset from the UCI repository and area under the curve (AUC) as the main evaluation. The results have shown that the proposed method produces good performance. (AUC¿ 0.8). Overall the second experiment outperformed and was better than the first and third experiments because the main evaluation in the unbalanced class classification is AUC. Therefore, it can be concluded that the proposed method produces optimal performance both for large scale data sets. Overall the second experiment outperformed and better than the first and third experiments, because the main evaluation in the unbalanced class classification was AUC

Unduhan

 

REFERENSI

[1] S. J. Kim, R. J. H. Wang, and E. C. Malthouse, 2015, \The E ects of Adopting and Using a Brand's Mobile

Application on Customers' Subsequent Purchase Behavior," J. Interact. Mark., vol. 31, no. 2015, p. 28{41.

[2] J. Martins, C. Costa, T. Oliveira, R. Goncalves, and F. Branco, 2019, \How smartphone advertising in

uences

consumers' purchase intention," J. Bus. Res., vol. 94, no. December 2017, p. 378{387.

[3] B. Ramkumar and B. Ellie Jin, 2019, \Examining pre-purchase intention and post-purchase consequences

of international online outshopping (IOO): The moderating e ect of E-tailer's country image," J. Retail.

Consum. Serv., vol. 49, no. March, p. 186{197.

[4] TimeTrade, \The State of Retail Report 2017," 30 March, 2017.

[5] Fanny and T. W. Cenggoro, 2018, \Deep Learning for Imbalance Data Classi cation using Class Expert Generative Adversarial Network".

[6] A. Wijaya dan R. S.Wahono, 2017, \Tackling Imbalanced Class In Software Defect Prediction Using Two-Step

Cluster Based Random Undersampling And Stacking Technique," J. Teknol., no. November, p. 45{50.

[7] C. Pradhan and A. Gupta, 2017, \Ship detection using Neyman-Pearson criterion in marine environment,"

Ocean Eng., vol. 143, no. March 2016, p. 106{112.

[8] X. Tao et al., 2019, Self-adaptive cost weights-based support vector machine cost-sensitive ensemble for

imbalanced data classi cation, vol. 487. Elsevier Inc.

[9] T. Zhu, Y. Lin, Y. Liu, W. Zhang, and J. Zhang, 2018, \Minority oversampling for imbalanced ordinal

regression," Knowledge-Based Syst.

[10] C. Tsai, W. Lin, Y. Hu, and G. Yao, 2018, \Under-Sampling Class Imbalanced Datasets by Combining

Clustering Analysis and Instance Selection Chih-Fong," Inf. Sci. (Ny).

[11] D. Z. Li, W. Wang, dan F. Ismail, 2015, \Neurocomputing A selective boosting technique for pattern classi

cation," vol. 156, hal. 186{192.

[12] W. W. Y. Ng, X. Zhou, X. Tian, X. Wang, and D. S. Yeung, 2017, \Bagging-boosting-based semi-supervised

multi-hashing with query-adaptive re-ranking," Neurocomputing, vol. 0, hal. p.

[13] A. R. Hassan and A. Haque, 2016, \An expert system for automated identi cation of obstructive sleep apnea

from single-lead ECG using random under sampling boosting," Neurocomputing.

[14] B. S. Raghuwanshi dan S. Shukla, 2019, \Neurocomputing Class imbalance learning using UnderBagging

based kernelized extreme learning machine," Neurocomputing, vol. 329, hal. 172{187.

[15] I. H. Laradji, M. Alshayeb, and L. Ghouti, 2015, \Software defect prediction using ensemble learning on

selected features," Inf. Softw. Technol., vol. 58, hal. 388{402.

[16] Z. A. Rana, M. A. Mian, and S. Shamail, 2015, \Improving Recall of software defect prediction models using

association mining," Knowledge-Based Syst., vol. 90, p. 1{13.

[17] G. Czibula, Z. Marian, and I. G. Czibula, 2014, \Software defect prediction using relational association rule

mining," Inf. Sci. (Ny)., vol. 264, p. 260{278.

[18] R. S. Wahono and N. S. Herman, 2014, \Genetic feature selection for software defect prediction," Adv. Sci.

Lett., vol. 20, no. 1, p. 239{244.

[19] O. F. Arar and K. Ayan, 2015, \Software defect prediction using cost-sensitive neural network," Appl. Soft

Comput. J., vol. 33, p. 263{277.

[20] F. Cheng, X. Zhang, C. Zhang, J. Qiu, and L. Zhang, 2018, \An Adaptive Mini-Batch Stochastic Gradient

Method for AUC Maximization," Neurocomputing, p. 2{25.

[21] C. O. Sakar, S. O. Polat, M. Katircioglu, and Y. Kastro, 2019, \Real-time prediction of online shoppers'

purchasing intention using multilayer perceptron and LSTM recurrent neural networks," Neural Comput. Appl., vol. 31, no. 10, p. 6893{6908.