Classification of Science, Technology and Medicine (STM) Domains with PSO and NBC

Lihat/Buka File Repository

Lihat/Buka File Peer Review

Tanggal

2018-08-31

Abstraksi

Science, Technology, and Medicine (STM) is a field of
research that has a characteristic in each document. These
characteristics are different from most documents that are used as a
corpus in mining text research. Documents derived from Newswire
are more dominant in previous research. However, in this study will
try to classify documents from STM field. Complex technical terms,
symbols, position information, and the number of citations would
be a challenge itself. Previous studies have used the Naive Bayes
Classifier (NBC) classification method. There are also those who
apply Particle Swarm Optimization to assist its classification. From
the Newswire field generated a fairly high accuracy Therefore, it
would be applied to the optimization method with PSO and combine
it with NBC method. This study produced accuracy value in
classification model without using PSO equal to 82,73%. While in
the classification model using PSO, the accuracy value is 87.27%.
This shows that the use of PSO optimization is very influential on
the classification.
  

Kata Kunci: STM, Classification, Naive Bayes Classifier, Particle Swarm Optimization.

URI
http://citsm.id/citsm.id/citsm2018/acceptedpapers.php

Bidang ilmu
Data Mining

Bibliografi

[1] R. Feldman and J. Sanger, “The text mining handbook: advanced
approaches in analyzing unstructured data,” 2007.
A. D. Asy’arie and A. W. Pribadi, “Automatic news articles
[2]

classification in Indonesian language by using Naive Bayes
Classifier method,”
Proc. 11th Int. Conf. Inf. Integr. Web-based
Appl. Serv. - iiWAS ’09
, p. 658, 2009.

[3] I. Technology, T. Mining, and P. Lama, “Clustering System Based
on Text Mining Using the K-Means Algorithm,” 2013.
C. Zhai, A. Velivelli, and B. Yu, “A cross-collection mixture model
for comparative text mining,”
Proc. 2004 ACM SIGKDD Int. Conf.
Knowl. Discov. data Min. - KDD ’04
, p. 743, 2004.
E. Junianto and D. Riana, “Penerapan PSO Untuk Seleksi Fitur Pada
Klasifikasi Dokumen Berita Menggunakan NBC,”
ejournal.bsi.ac.id, vol. 4, no. 1, pp. 38–45, 2017.
H. Kaur, “Online News Classification: A Review,” pp. 7–9, 2013.
V. Ingle and S. Deshmukh, “Predictive mining for stock market
[4]
[5]
[6]
[7]

 based on live news TF-IDF features,” Int. J. Auton. Comput., vol. 2,
no. 4, p. 341, 2017.

[8] B. Baharudin, L. H. Lee, and K. Khan, “A Review of Machine
Learning Algorithms for Text-Documents Classification,”
J. Adv.
Inf. Technol.
, vol. 1, no. 1, pp. 4–20, 2010.
R. Daniel, “New open access resource will support text mining and
[9]

natural language processing,” 2015. [Online]. Available:
https://www.elsevier.com/connect/new-open-access-resource-willsupport-text-mining-and-natural-language-processing.

[10] S. Ghosh, S. Roy, and S. K. Bandyopadhyay, “A tutorial review on
Text Mining Algorithms,”
Int. J. Adv. Res. Comput. Commun. Eng.,
vol. 1, no. 4, pp. 223–233, 2012.
C. Tu, L. Chuang, J. Chang, and C. Yang, “Feature selection using
[11]

PSO-SVM,” IAENG Int. J. Comput. Sci., vol. 33, no. 1, pp. 1–6,
2007.