Enhancing Image Classification of Cabbage Plant Diseases Using a Hybrid Model Convolutional Neural Network and XGBoost
Abstract
Classifying imbalanced datasets presents significant challenges, often leading to biased model performance, particularly in multiclass classification. This study addresses these issues by integrating Convolutional Neural Networks (CNN) and XGBoost, leveraging CNN’s exceptional feature extraction capabilities and XGBoost's robust handling of imbalanced data. The Hybrid CNN-XGBoost model was applied to classify cabbage plants affected by pests and diseases, which are categorized into five classes, with a significant imbalance between healthy and affected plants. The dataset, characterized by severe class imbalance, was effectively handled by the proposed model. A comparative analysis demonstrated that the CNN-XGBoost approach, with a Balanced Accuracy of 0.93 compared to 0.53 for the standalone CNN, significantly outperformed the standalone model, particularly for minority class predictions. This approach not only enhances the accuracy of plant disease and pest diagnosis but also provides a practical solution for farmers to efficiently identify and classify cabbage plants, contributing to more effective agricultural management.
Keywords
Full Text:
PDFReferences
Prabaningrum, L., & Moekasan, T. K. (2020). Incidence and diversity of insect pests and their natural enemies in control threshold-based cabbage cultivation. AAB Bioflux, 12(1), 12–21. http://www.aab.bioflux.com.ro [2] Wardhani, N. W. S., Lestantyo, P., & Rahmi, N. S. (2023). Decision support system as an element of webbased integrated pest control on cabbage plants. E3S Web of Conferences, 450. https://doi.org/10.1051/e3sconf/202345002001 [3] Reya, S. S., Malek, M. D. A., & Debnath, A. (2022). Deep Learning Approaches for Cabbage Disease Classification. 2022 International Conference on Recent Progresses in Science, Engineering and Technology (ICRPSET), 1–5. https://doi.org/10.1109/ICRPSET57982.2022.10188553 [4] Myna, A. N., Manasvi, K., Pavan, J. K., Rakshith, H. S., & Yukhta, D. J., (2023). Classification and Detection of Cabbage Leaf Diseases from Images Using Deep Learning Methods. Automation, Control and Intelligent Systems, 11(1), 1–7. https://doi.org/https://doi.org/10.11648/j.acis.20231101.11 [5] Skendžić, S., Zovko, M., Živković, I. P., Lešić, V., & Lemić, D. (2021). The impact of climate change on agricultural insect pests. In Insects (Vol. 12, Issue 5). https://doi.org/10.3390/insects12050440 [6] Dablain, D., Jacobson, K. N., Bellinger, C., Roberts, M., & Chawla, N. V. (2023). Understanding CNN Fragility When Learning with Imbalanced Data. Machine Learning, 1–26. https://doi.org/10.1007/s10994-023-06326-9 [7] Liu, L., Wu, X., Li, S., Li, Y., Tan, S., & Bai, Y. (2022). Solving the class imbalance problem using ensemble algorithm: application of screening for aortic dissection. BMC Medical Informatics and Decision Making, 22(1), 1–16. https://doi.org/10.1186/s12911-022-01821-w [8] Velarde, G., Sudhir, A., Deshmane, S., Deshmunkh, A., Sharma, K., & Joshi, V. (2023). Evaluating XGBoost for Balanced and Imbalanced Data: Application to Fraud Detection. http://arxiv.org/abs/2303.15218 Rahman, M., Prodhan, R., Shishir, Y., & Ripon, S. (2021). Analyzing and Evaluating Boosting-Based CNN Algorithms for Image Classification. 2021 International Conference on Intelligent Technologies (CONIT), 1–6. https://doi.org/10.1109/CONIT51480.2021.9498328 [9] Rahman, M., Prodhan, R., Shishir, Y., & Ripon, S. (2021). Analyzing and Evaluating Boosting-Based CNN Algorithms for Image Classification. 2021 International Conference on Intelligent Technologies (CONIT), 1–6. https://doi.org/10.1109/CONIT51480.2021.9498328 [10] Jiao, W., Hao, X., & Qin, C. (2021). The image classification method with cnn-xgboost model based on adaptive particle swarm optimization. Information (Switzerland), 12(4), 1–22. https://doi.org/10.3390/info12040156 [11] Gao, X., Jamil, N., Ramli, M. I., & Ariffin, S. M. Z. S. Z. (2024). A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data. International Journal on Informatics Visualization, 8(1), 456–464. https://dx.doi.org/10.62527/joiv.8.1.2194 [12] Fleuret, F. (2023). The Little Book of Deep Learning (Vol. 1). University of Geneva. https://fleuret.org/public/lbdl.pdf [13] James, G., Witten, D., Hastie, T., & Tibshirani, R. (2021). Classification. In An Introduction to Statistical Learning: with Applications in R (pp. 129–195). Springer US. https://doi.org/10.1007/978-1-0716-1418-1_4 [14] Todi, A., Narula, N., Sharma, M., & Gupta, U. (2023). ConvNext: A Contemporary Architecture for Convolutional Neural Networks for Image Classification. Proceedings - 2023 3rd International Conference on Innovative Sustainable Computational Technologies, CISCT 2023. https://doi.org/10.1109/CISCT57197.2023.10351320 [15] Nguyen, A., Pham, K., Ngo, D., Ngo, T., & Pham, L. (2021). An Analysis of State-of-the-art Activation Functions For Supervised Deep Neural Network. 2021 International Conference on System Science and Engineering (ICSSE), 215–220. https://doi.org/10.1109/ICSSE52999.2021.9538437 [16] Kouretas, I., & Paliouras, V. (2019). Simplified Hardware Implementation of the Softmax Activation Function. 2019 8th International Conference on Modern Circuits and Systems Technologies (MOCAST), 1–4. https://doi.org/10.1109/MOCAST.2019.8741677 [17] Fadhlullah Kh.TQ, M., & Wahyono, W. (2024). Classification of Tuberculosis Based on Chest X-Ray Images for Imbalance Data using SMOTE. International Journal of Computing and Digital Systems, 15(1), 981–993. https://doi.org/10.12785/ijcds/160171 [18] De Diego, I. M., Redondo, A. R., Fernández, R. R., Navarro, J., & Moguerza, J. M. (2022). General Performance Score for classification problems. Applied Intelligence, 52(10), 12049–12063. https://doi.org/10.1007/s10489-021-03041-7 [19] Cullerne Bown, W. (2024). Sensitivity and Specificity versus Precision and Recall, and Related Dilemmas. Journal of Classification, 41(2), 402–426. https://doi.org/10.1007/s00357-024-09478-y [20] Vujović, Ž. (2021). Classification Model Evaluation Metrics. International Journal of Advanced Computer Science and Applications, 12(6), 599–606. https://doi.org/10.14569/IJACSA.2021.0120670 [21] Byeon, H. (2021). Comparing the Balanced Accuracy of Deep Neural Network and Machine Learning for Predicting the Depressive Disorder of Multicultural Youth. International Journal of Advanced Computer Science and Applications, 12(6), 584–588. https://doi.org/10.14569/IJACSA.2021.0120668
DOI: https://doi.org/10.18860/cauchy.v10i1.30866
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 Nabila Ayunda Sovia

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Editorial Office
Mathematics Department,
Universitas Islam Negeri Maulana Malik Ibrahim Malang
Gajayana Street 50 Malang, East Java, Indonesia 65144
Faximile (+62) 341 558933
e-mail: cauchy@uin-malang.ac.id
CAUCHY: Jurnal Matematika Murni dan Aplikasi is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.