Binary Logistic Regression Modeling using Bayesian method: Analysis and Simulation on the Poverty Percentage of Districts in East Java

Achmad Efendi, Restilia Anggita Sari, Samingun Handoyo, Nur Silviyah Rahmi, Friansyah Gani

Abstract


In conducting logistic regression modeling, parameter estimation is considered an important stage. Determination of parameter estimates is often influenced by sample size and data characteristics. To cope with this issue, the Bayesian method is used as it is expected to be more robust, for instance to small sample size. In this method, MCMC is used to determine parameter values that are difficult to solve analytically. The study aims at determining the binary logistic regression model and its application to determine the factors that influence the percentage of city/district poverty rates in East Java in 2023. East Java was chosen because it has the highest percentage of poverty rates in Indonesia. This study uses informative and non-conjugate priors which is normal distribution in this case. Based on the results of the MCMC simulation with the Gibbs Sampling Algorithm, the random sample of study converged at the 266, 000th iteration with a burn-in of 60, 000 and a thin of 10. The results of this study indicate that the variables influencing the percentage of the poverty rate of cities/regencies in East Java are the Human Development Index (HDI), Life Expectancy (LE), and Gini Ratio (GR) which have significant effects. The residual deviance value shows a number that is smaller than the chi-square value. This means that the resulting model is appropriate. The model can predict data correctly by 84.2%. Simple simulations are carried out with different observed sample sizes. The simulation results show that the Bayesian method is somewhat better than likelihood estimation, particularly for data with small samples. Furthermore, we suggest the government of East Java would have more concern on HDI, LE, and GR as well as other factors related to them for poverty reduction policies.


Keywords


Bayesian; likelihood; logistic regression; poverty level; simulation

Full Text:

PDF

References


[1] Wayne Wobcke and Siti Mariyah. “Machine Learning and Data Augmentation in the Proxy Means Test for Poverty Targeting”. In: Statistical Journal of the IAOS 39.4 (2023), pp. 961–977. doi: 10.3233/SJI-230033.

[2] David W. Hosmer and Stanley Lemeshow. Applied Logistic Regression. 2nd ed. New York: John Wiley & Sons, 2000. doi: 10.1002/0471722146.

[3] Syarifah Nur Aini et al. “Deteksi Nefropati Diabetik pada Pasien Diabetes Melitus Menggunakan Regresi Logistik”. In: Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer 9.2 (2025), pp. 1–10.

[4] William M. Bolstad. Introduction to Bayesian Statistics. 2nd ed. Hoboken, NJ: John Wiley & Sons, 2007. doi: 10.1002/0471654213.

[5] A. Safitri, Sudarmin, and M. Nusrang. “Model Regresi Logistik Biner pada Tingkat Pengangguran Terbuka di Provinsi Sulawesi Barat Tahun 2017”. In: VARIANSI: Journal of Statistics and Its Application on Teaching and Research 1.2 (2019), pp. 1–6. doi: 10.35580/variansiunm10620.

[6] David B. Dunson. “Commentary: Practical Advantages of Bayesian Analysis of Epidemiologic Data”. In: American Journal of Epidemiology 153.12 (2001), pp. 1222–1226. doi: 10.1093/aje/153.12.1222.

[7] Desi Kurniawati and Hery T. Sutanto. “Faktor-Faktor yang Mempengaruhi Anemia Remaja Putri dengan Menggunakan Bayesian Regresi Logistik dan Algoritma Metropolis–Hastings”. In: MATHunesa: Jurnal Ilmiah Matematika 7.1 (2019), pp. 1–6. doi: 10.26740/mathunesa.v7n1.p1-6.

[8] P. A. Lukman, S. Abdullah, and A. Rachman. “Bayesian Logistic Regression and Its Application for Hypothyroid Prediction in Post-Radiation Nasopharyngeal Cancer Patients”. In: Journal of Physics: Conference Series 1725.1 (2021), p. 012010. doi: 10.1088/1742-6596/1725/1/012010.

[9] A. I. Nurrizqi, Erfiani, Indahwati, A. Fitrianto, and R. Amelia. “Pemodelan Regresi Logistik Berbasis Backward Elimination untuk Mengetahui Faktor-Faktor yang Memengaruhi Tingkat Kemiskinan di Indonesia Tahun 2021”. In: Jurnal Statistika dan Aplikasinya 6.2 (2022), pp. 160–170. doi: 10.21009/JSA.06206.

[10] Friansyah Gani, Henny Pramoedyo, and Achmad Efendi. “Spatial Variation of HDI in East Java: A Tricube-Based Geographically Weighted Regression–Flower Pollination Algorithm Modeling Approach”. In: CAUCHY – Jurnal Matematika Murni dan Aplikasi 11.1 (2026), pp. 239–254. doi: 10.18860/ca.v11i1.38007.

[11] S. James Press. Bayesian Statistics: Principles, Models, and Applications. New York: John Wiley & Sons, 1989. doi: 10.1002/9780470316696.

[12] José M. Bernardo and Adrian F. M. Smith. Bayesian Theory. Chichester: John Wiley & Sons, 2000. doi: 10.1002/9780470316870.

[13] Michael D. Lee. “A Bayesian Analysis of Retention Functions”. In: Journal of Mathematical Psychology 48.5 (2004), pp. 310–321. doi: 10.1016/j.jmp.2004.06.002.

[14] Edward Greenberg. Introduction to Bayesian Econometrics. 2nd ed. Cambridge: Cambridge University Press, 2012. doi: 10.1017/CBO9781139150736.

[15] Zening Zhao, Wei Duan, Guojun Cai, Meng Wu, and Songyu Liu. “CPT-Based Fully Probabilistic Seismic Liquefaction Potential Assessment to Reduce Uncertainty: Integrating XGBoost Algorithm with Bayesian Theorem”. In: Computers and Geotechnics 149 (2022), p. 104868. doi: 10.1016/j.compgeo.2022.104868.

[16] Ioannis Ntzoufras. Bayesian Modeling Using WinBUGS. Hoboken, NJ: John Wiley & Sons, 2009. doi: 10.1002/9780470434550.

[17] Peyman Amirafshari and Athanasios Kolios. “Estimation of Weld Defects Size Distributions, Rates and Probability of Detection in Fabrication Yards Using a Bayesian Theorem Approach”. In: International Journal of Fatigue 159 (2022), p. 106763. doi: 10.1016/j.ijfatigue.2022.106763.

[18] Muhammad Yarahmadi and Amin Salehi. “A Comparative Bayesian PINN–MCMC Analysis of Barrow–Tsallis Holographic Dark Energy with Neutrinos: Toward Resolving the Hubble Tension”. In: Journal of High Energy Astrophysics 50 (2026), p. 100498. doi: 10.1016/j.jheap.2025.100498.

[19] Cagatay Catal. “Performance Evaluation Metrics for Software Fault Prediction Studies”. In: Acta Polytechnica Hungarica 9.4 (2012), pp. 211–225. doi: 10.12700/APH.9.4.2012.4.13.

[20] Badan Pusat Statistik Provinsi Jawa Timur. Statistik Sektoral Jawa Timur 2023. Surabaya: BPS Provinsi Jawa Timur, 2023.

[21] Alan Agresti. Categorical Data Analysis. 2nd ed. Hoboken, NJ: John Wiley & Sons, 2002. doi: 10.1002/0471249688.

[22] Kevin Ross. An Introduction to Bayesian Reasoning and Methods. 2022. Available at: https://bookdown.org/kevin_davisross/bayesian-reasoning-and-methods/.

[23] D. A. M. Rohmah, A. B. Astuti, and A. Efendi. “A Statistical Analytics of Migration Using Binary Bayesian Logistic Regression”. In: BAREKENG: Jurnal Ilmu Matematika dan Terapan 17.3 (2023), pp. 1725–1738. doi: 10.30598/barekengvol17iss3pp1725-1738.




DOI: https://doi.org/10.18860/cauchy.v11i1.41326

Refbacks

  • There are currently no refbacks.


Copyright (c) 2026 Friansyah Gani, Achmad Efendi, Restilia Anggita Sari, Samingun Handoyo, Nur Silviyah Rahmi

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Editorial Office
Mathematics Department,
Universitas Islam Negeri Maulana Malik Ibrahim Malang
Gajayana Street 50 Malang, East Java, Indonesia 65144
Faximile (+62) 341 558933
e-mail: cauchy@uin-malang.ac.id

Creative Commons License
CAUCHY: Jurnal Matematika Murni dan Aplikasi is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.