IMPLEMENTATION OF DECISION TREE AND SUPPORT VECTOR MACHINE ON RAISIN SEED CLASSIFICATION

Wardhani Utami Dewi(1), Khoirin Nisa(2*), Mustofa Usman(3),

(1) Universitas Lampung
(2) Universitas Lampung
(3) Universitas Lampung
(*) Corresponding Author


Abstract


In everyday life there are many complex and global problems, especially in terms of decision making. Machine learning (ML) which is built from the concepts of computer science statistics and mathematics can automatically solve problems without guidance from ordinary users. Decision tree (DT) and support vector machine (SVM) are two supervised learning methods among several classification algorithms in ML. Both algorithms are the most popular classification techniques due to their ability to change a complex decision-making process into a simple process. In this study, the accuracy of the DT and SVM algorithms is studied on classifying raisin seeds into the Besni class and the Kecimen class based on existing features. The raisin data are divided into training and testing data, and the evaluation of the two methods is done using the testing data. The results of the evaluation are compared based on the accuracy, sensitivity, specificity, and kappa levels of the DT and SVM algorithms. The results on classifying raisin seeds data show that the SVM algorithm is superior to DT, therefor the number of positive observations is more precise in the prediction.


Keywords


Data Mining; ML; Supervised Learning

Full Text:

PDF

References


Ashok, P., Jackermeier, M., Jagtap, P., KÅetínský, J., Weininger, M., & Zamani, M. (2020). DtControl: Decision tree learning algorithms for controller representation. HSCC 2020 - Proceedings of the 23rd International Conference on Hybrid Systems: Computation and Control ,Part of CPS-IoT Week. https://doi.org/10.1145/3365365.3382220

Bokaba, T., Doorsamy, W., & Paul, B. S. (2022). Comparative Study of Machine Learning Classifiers for Modelling Road Traffic Accidents. In Applied Sciences (Vol. 12, Issue 2). https://doi.org/10.3390/app12020828

Bzdok, D., Krzywinski, M., & Altman, N. (2018). Machine learning: supervised methods. Nature Methods, 15(1), 5–6. https://doi.org/10.1038/nmeth.4551

Cervantes, J., Garcia-Lamont, F., Rodríguez-Mazahua, L., & Lopez, A. (2020). A comprehensive survey on support vector machine classification: Applications, challenges and trends. Neurocomputing, 408, 189–215. https://doi.org/https://doi.org/10.1016/j.neucom.2019.10.118

Charbuty, B., & Abdulazeez, A. (2021). Classification Based on Decision Tree Algorithm for Machine Learning. Journal of Applied Science and Technology Trends, 2(01 SE-), 20–28. https://doi.org/10.38094/jastt20165

Chen, X., Yang, Z., & Lou, W. (2019). Fault Diagnosis of Rolling Bearing Based on the Permutation Entropy of VMD and Decision Tree. 2019 3rd International Conference on Electronic Information Technology and Computer Engineering (EITCE), 1911–1915. https://doi.org/10.1109/EITCE47263.2019.9095187

Fachrurrozi, S., Muljono, Shidik, G. F., Fanani, A. Z., Purwanto, & Zami, F. A. (2021). Increasing Accuracy of Support Vector Machine (SVM) By Applying N-Gram and Chi-Square Feature Selection for Text Classification. 2021 International Seminar on Application for Technology of Information and Communication (ISemantic), 42–47. https://doi.org/10.1109/iSemantic52711.2021.9573210

Gavankar, S. S., & Sawarkar, S. D. (2017). Eager decision tree. 2017 2nd International Conference for Convergence in Technology (I2CT), 837–840. https://doi.org/10.1109/I2CT.2017.8226246

Gkikas, D. C., Theodoridis, P. K., & Beligiannis, G. N. (2022). Enhanced Marketing Decision Making for Consumer Behaviour Classification Using Binary Decision Trees and a Genetic Algorithm Wrapper. Informatics, 9(2), 45.

Huang, S., Cai, N., Pacheco, P. P., Narrandes, S., Wang, Y., & Xu, W. (2018). Applications of support vector machine (SVM) learning in cancer genomics. Cancer Genomics & Proteomics, 15(1), 41–51.

Ilkay, C., Murat, K., & Sakir, T. (2020). Classification of Raisin Grains Using Machine Vision and Artificial Intelligence Methods. Gazi Muhendislik Bilimleri Dergisi, 6(3), 200–209. https://dergipark.org.tr/tr/download/article-file/1227592

Imran, M., Sattar, M. U., Khan, H. W., Ghaffar, A., & Mushtaq, H. (2022). Selecting a Better Classifier Using Machine Learning for COVID-19. International Journal of Computing and Digital Systems, 11(1), 955–962. https://doi.org/10.12785/ijcds/110178

Lantz, B. (2013). Machine Learning with R: Learn how to use R to apply powerful machine learning methods and gain an insight into real world applications. In Livery Place. Packt Publishing Ltd., Packt Publishing Ltd.

Liu, Y., Hu, L., Yan, F., & Zhang, B. (2013). Information Gain with Weight Based Decision Tree for the Employment Forecasting of Undergraduates. 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing, 2210–2213. https://doi.org/10.1109/GreenCom-iThings-CPSCom.2013.417

Ma, G., Chao, Z., Zhang, Y., Zhu, Y., & Hu, H. (2018). The application of support vector machine in geotechnical engineering. IOP Conference Series: Earth and Environmental Science, 189(2). https://doi.org/10.1088/1755-1315/189/2/022055

Mienye, I. D., Sun, Y., & Wang, Z. (2019). Prediction performance of improved decision tree-based algorithms: a review. Procedia Manufacturing, 35, 698–703.

Mishra, A., & Dasgupta, A. (2022). Supervised and Unsupervised Machine Learning Algorithms for Forecasting the Fracture Location in Dissimilar Friction-Stir-Welded Joints. In Forecasting (Vol. 4, Issue 4, pp. 787–797). https://doi.org/10.3390/forecast4040043

Nalepa, J., & Kawulok, M. (2019). Selecting training sets for support vector machines: a review. Artificial Intelligence Review, 52(2), 857–900. https://doi.org/10.1007/s10462-017-9611-1

Nedaie, A., & Najafi, A. A. (2018). Support vector machine with Dirichlet feature mapping. Neural Networks, 98, 87–101. https://doi.org/https://doi.org/10.1016/j.neunet.2017.11.006

Nikam, S. S. (2015). A comparative study of classification techniques in data mining algorithms. Oriental Journal of Computer Science and Technology, 8(1), 13–19.

Reddy, M. R. (2021). Implementation of SVM machine learning Algorithm to predict lung And Breast Cancer (Vol. 12, Issue 12, pp. 3050–3060). https://www.turcomat.org/index.php/turkbilmat/article/view/7976

Shang, C., Li, M., Feng, S., Jiang, Q., & Fan, J. (2013). Feature selection via maximizing global information gain for text classification. Knowledge-Based Systems, 54, 298–309. https://doi.org/https://doi.org/10.1016/j.knosys.2013.09.019

Soofi, A. A., & Awan, A. (2017). Classification Techniques in Machine Learning: Applications and Issues. Journal of Basic & Applied Sciences, 13, 459–465.

Su, Q.-H., & Chiang, K.-N. (2022). Predicting Wafer-Level Package Reliability Life Using Mixed Supervised and Unsupervised Machine Learning Algorithms. In Materials (Vol. 15, Issue 11). https://doi.org/10.3390/ma15113897

Taneja, S., Gupta, C., Goyal, K., & Gureja, D. (2014). An Enhanced K-Nearest Neighbor Algorithm Using Information Gain and Clustering. 2014 Fourth International Conference on Advanced Computing & Communication Technologies, 325–329. https://doi.org/10.1109/ACCT.2014.22

Tariq, A., Yan, J., Gagnon, A. S., Riaz Khan, M., & Mumtaz, F. (2022). Mapping of cropland, cropping patterns and crop types by combining optical remote sensing images with decision tree classifier and random forest. Geo-Spatial Information Science, 1–19.




DOI: http://dx.doi.org/10.24127/ajpm.v12i1.6873

Refbacks

  • There are currently no refbacks.