A Robust Framework for Dissolved Oxygen Forecasting in Precision Aquaculture: A LightGBM Approach with Advanced Feature Engineering

Nyoman Wira Prasetya, Richard Wijaya Harianto

Abstract


Accurate prediction of necessary water quality parameters such as Dissolved Oxygen (DO) is very critical in precision aquaculture and is essential for performance-based decision-making. This thesis fills the gap between reactive monitoring and predictive intelligence through the construction of a solid machine learning infrastructure. We convert high frequency multivariate time series data into a supervised learning problem by an advanced feature engineering process that generates temporal predictions including lag features and rolling window statistics. A Light Gradient Boosting machine (LightGBM) algorithm trained using the above-mentioned engineered dataset has an extreme predictive power. Results of single-variable interpretation analysis showed that short term data, especially the 5-minute rolling statistics of DO and turbidity variability, are the main driving factors for the model prediction. This research confirms that a feature-engineered LightGBM approach is a computationally efficient, but highly accurate approach to supporting the development of early warning systems in modern aquaculture as a computationally scalable approach.

Full Text:

PDF

References


[1] Gatta, P. P. (2022). The State of World Fisheries and Aquaculture 2022. In FAO eBooks. https://doi.org/10.4060/cc0461en.

[2] Kumar, A., & Saini, M. (2023). A Comprehensive Review of Sustainable Aquaculture Practices. Advanced Research in Agriculture Science and Technology, 6(2), 1-10.

[3] Føre, M., Frank, K., Norton, T., Svendsen, E., Alfredsen, J. A., Dempster, T., Eguiraun, H., Watson, W., Stahl, A., Sunde, L. M., Schellewald, C., Skøien, K. R., Alver, M. O., & Berckmans, D. (2017). Precision fish farming: A new framework to improve production in aquaculture. Biosystems Engineering, 173, 176–193. https://doi.org/10.1016/j.biosystemseng.2017.10.014

[4] Narsale, S. A., Prakash, P., Mohale, H. P., Baraiya, R., Sheikh, S., Kirtikumar, P. B., Mansukhbhai, C. R., Kadam, R. V., & Tekam, I. (2024). Precision Aquaculture: A way forward for Sustainable agriculture. Journal of Experimental Agriculture International, 46(5), 83–97. https://doi.org/10.9734/jeai/2024/v46i52360.

[5] O'Donncha, Fearghal & Grant, Jon. (2019). Precision Aquaculture. IEEE Internet of Things Magazine. 2. 26-30. 10.1109/IOTM.0001.1900033.

[6] Mohammed, Imran & Blesslene, Hepzibah & Faizullah, Mohamed & Nicy, Brita & devi, Vimala & Pandiyan, Praveenkumar. (2025). Revolution of AI in Aquaculture and Fish Processing: A Review https://ojs.pphouse.org/index.php/IJBSM Natural Resource Management International Journal of Bio-resource and Stress Management. International Journal of Bio-resource and Stress Management. 16. 1-14. 10.23910/1.2025.6194.

[7] Isla, Mercedes. (2008). Water quality in recirculating aquaculture systems (ras) for arctic charr (salvelinus alpinus L.) culture..

[8] Ali, Bulbul & Anushka & Mishra, Abha. (2022). Effects of dissolved oxygen concentration on freshwater fish: A review. International Journal of Fisheries and Aquatic Studies. 10. 113-127. 10.22271/fish.2022.v10.i4b.2693.

[9] Ali, Bulbul & ., Anushka & Mishra, Abha. (2022). Effects of dissolved oxygen concentration on freshwater fish: A review. International Journal of Fisheries and Aquatic Studies. 10. 113-127. 10.22271/fish.2022.v10.i4b.2693.

[10] Syauqy, Dahnial & Hanggara, Buce & Purnomo, Welly & Putra, Widhy Hayuhardhika Nugraha & Prasetya, Nyoman. (2022). Automated Continuous IoT-based Monitoring System for Vaname Shrimp Cultivation Management. Computer Engineering and Applications Journal. 11. 89-100. 10.18495/comengapp.v11i2.402.

[11] Prasetya, Nyoman & Harahap, Arya & Aulady, Fadhli & Wulandari, Inayah. (2023). Design of Water Monitoring System in Aquaponics Based on Arduino Nano and Raspberry Pi. MATICS: Jurnal Ilmu Komputer dan Teknologi Informasi (Journal of Computer Science and Information Technology). 15. 29-42. 10.18860/mat.v15i1.23005.

[12] Rodríguez, M. C. B., González, C. E. M., Almanza, E. D. C., Rodríguez, C. G., Regino-Vergara, J. Á., & López-Padilla, A. (2025). Benefits and challenges of the internet of things in aquaculture production: a literature review. Frontiers in Sustainable Food Systems, 9. https://doi.org/10.3389/fsufs.2025.1590153.

[13] Papolonias, Juffil & Lavilles, Rabby & Miano, Joel. (2025). Development of water quality monitoring system for fish farming. Bulletin of Electrical Engineering and Informatics. 14. 2962-2974. 10.11591/eei.v14i4.7673.

[14] Saha, Sajal & Rajib, Rakibul & Kabir, Sumaiya. (2018). IoT Based Automated Fish Farm Aquaculture Monitoring System. 201-206. 10.1109/ICISET.2018.8745543.

[15] Yang, H., Sun, M., & Liu, S. (2023). A hybrid intelligence model for predicting dissolved oxygen in aquaculture water. Frontiers in Marine Science, 10. https://doi.org/10.3389/fmars.2023.1126556

[16] Zhou, S., Song, C., Zhang, J., Chang, W., Hou, W., & Yang, L. (2022). A Hybrid Prediction Framework for Water Quality with Integrated W-ARIMA-GRU and LightGBM Methods. Water, 14(9), 1322. https://doi.org/10.3390/w14091322.

[17] Hartanto, Anggit & Kholik, Yanuar & Pristyanto, Yoga. (2023). Stock Price Time Series Data Forecasting Using the Light Gradient Boosting Machine (LightGBM) Model. JOIV : International Journal on Informatics Visualization. 7. 2270. 10.30630/joiv.7.4.01740.

[18] Porto, B. M., & Fogliatto, F. S. (2024). Enhanced forecasting of emergency department patient arrivals using feature engineering approach and machine learning. BMC Medical Informatics and Decision Making, 24(1). https://doi.org/10.1186/s12911-024-02788-6.

[19] Shukla, M. (2025). Interpreting Time Series Forecasts with LIME and SHAP:A Case Study on the Air Passengers Dataset. Research Square (Research Square). https://doi.org/10.21203/rs.3.rs-7358158/v1

[20] Lee, A. H. S., Shankararaman, V., & Ouh, E. L. (2023). Extending the Horizon by Empowering Government Customer Service Officers with ACQAR for Enhanced Citizen Service Delivery. 2021 IEEE International Conference on Big Data (Big Data), 1952–1958. https://doi.org/10.1109/bigdata59044.2023.10386189

[21] Petkovski, Aleksandar & Shehu, Visar. (2023). Anomaly Detection on Univariate Sensing Time Series Data for Smart Aquaculture Using Deep Learning. SEEU Review. 18. 1-16. 10.2478/seeur-2023-0030.

[22] Al-Kassab-Córdova, A., & Soto-Becerra, P. (2025). Causal Inference in Public Health: A Call to Stop Causal Fishing Expeditions. Public health reports (Washington, D.C. : 1974), 333549251342034. Advance online publication. https://doi.org/10.1177/00333549251342034

[23] Li, Mingjie & Li, Zeyan & Yin, Kanglin & Nie, Xiaohui & Zhang, Wenchi & Sui, Kaixin & Pei, Dan. (2022). Causal Inference-Based Root Cause Analysis for Online Service Systems with Intervention Recognition. 10.1145/3534678.3539041.

[24] Villarroel, D. & Mayer, Miguel Angel & Leis, Angela & Karkaletsis, Vangelis & Stamatakis, Konstantinos & Metsis, Vangelis & Nasikas, P. & Labsky, Martin & Ruzicka, M. & Svátek, Vojtěch & López-Ostenero, F. & Peinado, V.. (2010). Assisting Quality Assessment (AQUA) – a system based on semantic web and information extraction technologies to support medical quality labelling.

[25] ProRCA: a causal python package for actionable root cause analysis in real-world business scenarios. (n.d.). https://arxiv.org/html/2503.01475v1

[26] Su, L., Zuo, X., Li, R., Wang, X., Zhao, H., & Huang, B. (2023, October 31). A Systematic review for transformer-based long-term series forecasting. arXiv.org. https://arxiv.org/abs/2310.20218

[27] Paialunga, P. (2025, July 15). Hands-On Attention Mechanism for Time Series Classification, with Python. Towards Data Science. https://towardsdatascience.com/hands-on-attention-mechanism-for-time-series-classification-with-python/

[28] Nuanmeesri, S., Poomhiran, L., Kadmateekarun, P., & Tarasawatpipat, C. (2023). Optimizing Predictive Modeling for Water Quality in Farms with Blended Artificial Neural Network. Journal of System and Management Sciences, 14(1). https://doi.org/10.33168/jsms.2024.0118

[29] Mohan, S., Kumar, B., & Nejadhashemi, A. P. (2025). Integration of Machine Learning and Remote Sensing for Water Quality Monitoring and Prediction: A Review. Sustainability, 17(3), 998. https://doi.org/10.3390/su17030998

[30] Zambrano, A. F., Giraldo, L. F., Quimbayo, J., Medina, B., & Castillo, E. (2021). Machine learning for manually-measured water quality prediction in fish farming. PLoS ONE, 16(8), e0256380. https://doi.org/10.1371/journal.pone.0256380

[31] Ma, Y., Fang, Q., Xia, S., & Zhou, Y. (2024). Prediction of the Dissolved Oxygen Content in Aquaculture Based on the CNN-GRU Hybrid Neural Network. Water, 16(24), 3547. https://doi.org/10.3390/w16243547

Eze, E., & Ajmal, T. (2020). Dissolved Oxygen Forecasting in Aquaculture: A Hybrid Model Approach. Applied Sciences, 10(20), 7079. https://doi.org/10.3390/app10207079




DOI: https://doi.org/10.18860/mat.v18i1.37617

Refbacks

  • There are currently no refbacks.




Copyright (c) 2026 Nyoman Wira Prasetya, Richard Wijaya Harianto

License URL: http://creativecommons.org/licenses/by-sa/4.0/

The journal is indexed by :

Dimensions Sinta CrossRef GoogleScholar
Index Copernicus Moraref Portal Garuda

 

_______________________________________________________________________________________________________________

Editorial Office:
Informatics Engineering Department
Faculty of Science and Technology
Universitas Islam Negeri Maulana Malik Ibrahim Malang
Jalan Gajayana 50 Malang, Jawa Timur, Indonesia 65144
Email: matics@uin-malang.ac.id
_______________________________________________________________________________________________________________

Creative Commons License
This work is licensed under a CC-BY-SA 4.0.
© All rights reserved 2015. MATICS , ISSN : 1978-161X | e-ISSN :  2477-2550