An Analysis of The C4.5 Decision Tree Algorithm Method Applied to The Play Tennis Dataset and Manual Calculation Approach
DOI:
https://doi.org/10.64021/ijmst.1.2.65-70.2025Keywords:
Analysis, Decision Tree C4.5, Machine LearningAbstract
This study explores the use of the C4.5 decision tree algorithm on the Play Tennis dataset through two approaches: manual calculations and a Python-based program. As an improved version of the ID3 algorithm, C4.5 is capable of managing both categorical and numerical inputs, dealing with missing data, and utilizing entropy and information gain to determine the most important features. The dataset contains 14 entries with attributes such as Outlook, Temperature, Humidity, Windy, and the target variable PlayTennis. Entropy and information gain were calculated manually to construct the decision tree in a step-by-step manner. The resulting tree was then compared with one generated using Python tools like Pandas, NumPy, and Scikit-learn. Both trees were identical, confirming the accuracy of the method. A comparison with previous research highlights the flexibility and clarity of decision tree algorithms, making them suitable for various fields such as healthcare, finance, privacy-conscious machine learning, and materials science. These findings support the real-world usefulness of such algorithms. Overall, the study finds that C4.5 is highly effective for small classification problems and shows promise for use in larger, more complex datasets. Additionally, this research supports deeper learning of how decision tree algorithms work, making it a helpful reference for both educational and applied data science contexts.
References
N. A. Aziz, A. Manzoor, M. Deedahwar Mazhar Qureshi, M. Atif Qureshi, and W. Rashwan, “Explainable AI in Healthcare: Systematic Review of Clinical Decision Support Systems,” 2024, [Online]. Available: https://doi.org/10.1101/2024.08.10.24311735
M. W. Ali, M. Q. Memon, and M. F. Hyder, “Optimization of Fraud Detection Models for Safeguarding Customer Transactions,” vol. 5, no. 2, pp. 86–100, 2025.
Y. Mao, B. Legesse, and T. Belsty, “Current Problems in Cardiology Machine learning algorithms for heart disease diagnosis : A systematic review,” Curr. Probl. Cardiol., vol. 50, no. 8, p. 103082, 2025, doi: 10.1016/j.cpcardiol.2025.103082.
S. Sreekumari, R. Bhalla, and G. Singh, “Feature Selection and Model Evaluation for Heart Disease Prediction Using Ensemble Methods,” Procedia Comput. Sci., vol. 259, pp. 1282–1295, 2025, doi: 10.1016/j.procs.2025.04.083.
G. Dharmarathne, M. Bogahawaththa, U. Rathnayake, and D. P. P. Meddage, “Integrating explainable machine learning and user-centric model for diagnosing cardiovascular disease: A novel approach,” Intell. Syst. with Appl., vol. 23, no. August, p. 200428, 2024, doi: 10.1016/j.iswa.2024.200428.
Y. Efe and L. Demir, “The impact of feature selection models on the accuracy of tree-based classification algorithms: Heart disease case,” Procedia Comput. Sci., vol. 253, no. 2024, pp. 757–764, 2025, doi: 10.1016/j.procs.2025.01.137.
G. Manikandan, B. Pragadeesh, V. Manojkumar, A. L. Karthikeyan, R. Manikandan, and A. H. Gandomi, “Classification models combined with Boruta feature selection for heart disease prediction,” Informatics Med. Unlocked, vol. 44, no. December 2023, p. 101442, 2024, doi: 10.1016/j.imu.2023.101442.
M. Ozcan and S. Peker, “A classification and regression tree algorithm for heart disease modeling and prediction,” Healthc. Anal., vol. 3, no. July 2022, p. 100130, 2023, doi: 10.1016/j.health.2022.100130.
A. Maulana et al., “Classification of Stunting in Toddlers using Naive Bayes Method and Decision Tree,” vol. 1, no. 1, pp. 28–33, 2025.
D. A. Anggoro and D. Permatasari, “Performance Comparison of the Kernels of Support Vector Machine Algorithm for Diabetes Mellitus Classification,” Int. J. Adv. Comput. Sci. Appl., vol. 14, no. 1, pp. 580–585, 2023, doi: 10.14569/IJACSA.2023.0140163.
M. Alizade-Harakiyan, A. Khodaei, A. Yousefi, H. Zamani, and A. Mesbahi, “Decision tree-based machine learning algorithm for prediction of acute radiation esophagitis,” Biochem. Biophys. Reports, vol. 42, no. March, 2025, doi: 10.1016/j.bbrep.2025.101991.
S. Kiran, G. R. Reddy, S. P. Girija, S. Venkatramulu, K. Dorthi, and V. Chandra Shekhar Rao, “A Gradient Boosted Decision Tree with Binary Spotted Hyena Optimizer for cardiovascular disease detection and classification,” Healthc. Anal., vol. 3, no. March, p. 100173, 2023, doi: 10.1016/j.health.2023.100173.
D. Sun, “ScienceDirect Research on Privacy Protection Technology Based on Decision Tree Algorithm,” Procedia Comput. Sci., vol. 262, pp. 1309–1315, 2025, doi: 10.1016/j.procs.2025.05.175.
J. Wu, G. Zhao, M. Wang, Y. Xu, and N. Wang, “Concrete carbonation depth prediction model based on a gradient-boosting decision tree and different metaheuristic algorithms,” Case Stud. Constr. Mater., vol. 21, no. September, p. e03864, 2024, doi: 10.1016/j.cscm.2024.e03864.
M. Qiu, “Path Planning Algorithm and ID3 Decision Tree Model Application of Scenic Intelligent Navigation System,” Procedia Comput. Sci., vol. 247, no. C, pp. 1187–1196, 2023, doi: 10.1016/j.procs.2024.10.143.
A. A. Firdaus, R. A. Faresta, and M. Yunus, “The Role of Sentiment Analysis in Election Predictions Compared to Electability Surveys,” vol. 1, no. 1, pp. 1–8, 2025.
A. Rizky, N. Habibi, I. Sufiyandi, A. K. M. Jayed, and A. M. Nakib, “Diabetes Mellitus Disease Analysis using Support Vector Machines and K-Nearest Neighbor Methods,” vol. 1, no. 1, pp. 22–27, 2025.
R. Hasanah, Z. Hasanah, and S. A. Wijaya, “Play Store Data Scrapping and Preprocessing done as Sentiment Analysis Material,” vol. 1, no. 1, pp. 16–21, 2025.
L. Ji and S. Li, “A dynamic financial risk prediction system for enterprises based on gradient boosting decision tree algorithm,” Syst. Soft Comput., vol. 7, no. December 2024, p. 200189, 2025, doi: 10.1016/j.sasc.2025.200189.
K. Khosravi, A. A. Faroouqe, A. R. Shahvaran, P. Daggupati, S. Heddam, and J. Hatamiafkoueieh, “Beyond conventional modeling: A cutting-edge hybrid IAER-AMT decision-tree-based algorithm for high-resolution river turbidity prediction,” Ain Shams Eng. J., vol. 16, no. 9, p. 103511, 2025, doi: 10.1016/j.asej.2025.103511.
I. Kayan and N. Ayman Oz, “Integrating response surface methodology and decision tree algorithms for valorization of cheese whey wastewater,” Desalin. Water Treat., vol. 322, no. March, p. 101129, 2025, doi: 10.1016/j.dwt.2025.101129.
M. Abidin and M. Muzir, “CLASSIFICATION OF HEART ( CARDIOVASCULAR ) DISEASE USING THE SVM METHOD,” vol. 1, no. 1, pp. 1–7, 2025.
Z. Zhang and D. Xu, “Enterprise marketing data mining method based on decision tree algorithm,” Procedia Comput. Sci., vol. 261, pp. 1172–1178, 2025, doi: 10.1016/j.procs.2025.04.701.
A. Elhazmi et al., “Machine learning decision tree algorithm role for predicting mortality in critically ill adult COVID-19 patients admitted to the ICU,” J. Infect. Public Health, vol. 15, no. 7, pp. 826–834, 2022, doi: 10.1016/j.jiph.2022.06.008.
B. H. De Figueiredo, M. Dos Santos, L. P. L. Fávero, M. Â. L. Moreira, and I. P. De Araújo Costa, “Analysis of maintenance activities in Urban Pavement Management Systems based on Decision Tree Algorithm,” Procedia Comput. Sci., vol. 214, no. C, pp. 712–719, 2022, doi: 10.1016/j.procs.2022.11.233.
A. R. Panhalkar and D. D. Doye, “Optimization of decision trees using modified African buffalo algorithm,” J. King Saud Univ. - Comput. Inf. Sci., vol. 34, no. 8, pp. 4763–4772, 2022, doi: 10.1016/j.jksuci.2021.01.011
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Minhajul Abidin, M. Hikari Aufa, M. Ilham Cahyo Saputra, Babatunde Bamidele Oyeyemi, Nuraqilla Waidha Bintang Grendis (Author)

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
