Binary Logistic Regression Modeling using Bayesian method: Analysis and Simulation on the Poverty Percentage of Districts in East Java
Abstract
In conducting logistic regression modeling, parameter estimation is considered an important stage. Determination of parameter estimates is often influenced by sample size and data characteristics. To cope with this issue, the Bayesian method is used as it is expected to be more robust, for instance to small sample size. In this method, MCMC is used to determine parameter values that are difficult to solve analytically. The study aims at determining the binary logistic regression model and its application to determine the factors that influence the percentage of city/district poverty rates in East Java in 2023. East Java was chosen because it has the highest percentage of poverty rates in Indonesia. This study uses informative and non-conjugate priors which is normal distribution in this case. Based on the results of the MCMC simulation with the Gibbs Sampling Algorithm, the random sample of study converged at the 266, 000th iteration with a burn-in of 60, 000 and a thin of 10. The results of this study indicate that the variables influencing the percentage of the poverty rate of cities/regencies in East Java are the Human Development Index (HDI), Life Expectancy (LE), and Gini Ratio (GR) which have significant effects. The residual deviance value shows a number that is smaller than the chi-square value. This means that the resulting model is appropriate. The model can predict data correctly by 84.2%. Simple simulations are carried out with different observed sample sizes. The simulation results show that the Bayesian method is somewhat better than likelihood estimation, particularly for data with small samples. Furthermore, we suggest the government of East Java would have more concern on HDI, LE, and GR as well as other factors related to them for poverty reduction policies.
Keywords
Full Text:
PDFReferences
[1] W. Wobcke and S. Mariyah, “Machine learning and data augmentation in the proxy means test for poverty targeting,” Statistical Journal of the IAOS, vol. 39, no. 4, pp. 961–977, 2023. doi: 10.3233/SJI-230033.
[2] D. W. Hosmer and S. Lemeshow, Applied Logistic Regression, 2nd ed. New York: John Wiley & Sons, Inc., 2000. doi: 10.1002/0471722146.
[3] S. N. Aini et al., “Deteksi nefropati diabetik pada pasien diabetes melitus menggunakan regresi logistik,” Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer, vol. 9, no. 2, pp. 1–10, 2025.
[4] W. M. Bolstad, Introduction to Bayesian Statistics. New Jersey, USA: John Wiley & Sons, 2004. doi: 10.1002/0471654213.
[5] A. Safitri, Sudarmin, and M. Nusrang, “Model regresi logistik biner pada tingkat pengangguran terbuka di Provinsi Sulawesi Barat tahun 2017,” VARIANSI: Journal of Statistics and Its Application on Teaching and Research, vol. 1, no. 2, pp. 1–6, 2019. doi: 10.35580/variansiunm10620.
[6] D. B. Dunson, “Commentary: Practical advantages of Bayesian analysis of epidemiologic data,” American Journal of Epidemiology, vol. 153, no. 12, pp. 1222–1226, 2001. doi: 10.1093/aje/153.12.1222.
[7] D. Kurniawati and H. T. Sutanto, “Faktor-faktor yang mempengaruhi anemia remaja putri dengan menggunakan Bayesian regresi logistik dan algoritma Metropolis-Hastings,” Jurnal Ilmiah Matematika, vol. 7, no. 1, pp. 1–6, 2019. doi: 10.26740/mathunesa.v7n1.p1-6.
[8] P. A. Lukman, S. Abdullah, and A. Rachman, “Bayesian logistic regression and its application for hypothyroid prediction in post-radiation nasopharyngeal cancer patients,” Journal of Physics: Conference Series, vol. 1725, no. 1, p. 012010, 2021. doi: 10.1088/1742-6596/1725/1/012010.
[9] A. I. Nurrizqi, Erfiani, Indahwati, A. Fitrianto, and R. Amelia, “Pemodelan regresi logistik berbasis backward elimination untuk mengetahui faktor-faktor yang memengaruhi tingkat kemiskinan di Indonesia tahun 2021,” Jurnal Statistika dan Aplikasinya, vol. 6, no. 2, pp. 160–170, 2022. doi: 10.21009/JSA.06206.
[10] F. Gani, H. Pramoedyo, and A. Efendi, “Spatial variation of HDI in East Java: A Tricube-Based Geographically Weighted Regression–Flower Pollination Algorithm modeling approach,” CAUCHY – Jurnal Matematika Murni dan Aplikasi, vol. 11, no. 1, pp. 239–254, 2026. doi: 10.18860/ca.v11i1.XXXXX.
[11] S. J. Press, Bayesian Statistics: Principles, Models and Applications. New York: John Wiley & Sons, 1989. doi: 10.1002/9780470316696.
[12] J. M. Bernardo and A. F. M. Smith, Bayesian Theory. Wiley, 2000. doi: 10.1002/9780470316870.
[13] M. D. Lee, “A Bayesian analysis of retention functions,” Journal of Mathematical Psychology, vol. 48, no. 5, pp. 310–321, 2004. doi: 10.1016/j.jmp.2004.06.002.
[14] E. Greenberg, Introduction to Bayesian Econometrics. Cambridge University Press, 2012. doi: 10.1017/CBO9781139150736.
[15] I. Ntzoufras, Bayesian Modelling Using WinBUGS. New Jersey: John Wiley & Sons, Inc., 2009. doi: 10.1002/9780470434550.
[16] Z. Zhao, W. Duan, G. Cai, M. Wu, and S. Liu, “CPT-based fully probabilistic seismic liquefaction potential assessment to reduce uncertainty: Integrating XGBoost algorithm with Bayesian theorem,” Computers and Geotechnics, vol. 149, p. 104868, 2022. doi: 10.1016/j.compgeo.2022.104868.
[17] P. Amirafshari and A. Kolios, “Estimation of weld defects size distributions, rates and probability of detections in fabrication yards using a Bayesian theorem approach,” International Journal of Fatigue, vol. 159, p. 106763, 2022. doi: 10.1016/j.ijfatigue.2022.106763.
[18] I. Nerín et al., “Factores predictores de éxito a los 6 meses en fumadores tratados en una unidad de tabaquismo,” Archivos de Bronconeumología, vol. 40, no. 12, pp. 558–562, 2004. doi: 10.1016/S0300-2896(04)75592-1.
[19] A. Agresti, Categorical Data Analysis. New York: John Wiley & Sons, Inc., 2002. doi: 10.1002/0471249688.
[20] S. Weisberg, Applied Linear Regression, 3rd ed. New Jersey: John Wiley and Sons, Inc., 2005. doi: 10.1002/0471704091.
[21] M. H. Kutner, J. Neter, C. J. Nachtsheim, and W. Li, Applied Linear Statistical Models, 5th ed. Boston: McGraw-Hill Irwin, 2004.
[22] E. Sumarminingsih and S. Astutik, Pengantar Teori Peluang. UB Press, 2021.
[23] P. Congdon, Applied Bayesian Modelling, 2nd ed. John Wiley & Sons, 2014. doi: 10.1002/9781118895047.
[24] C. Koenig, “Bayesian hierarchical item response theory modeling,” in Reference Module in Social Sciences, Elsevier, 2026. doi: 10.1016/B978-0-443-26629-4.00177-5.
[25] P. Congdon, Bayesian Statistical Modelling, 2nd ed. England: John Wiley & Sons, Ltd., 2006. doi: 10.1002/0470034510.
[26] G. E. P. Box and G. C. Tiao, Bayesian Inference in Statistical Analysis. Philippines: Addison-Wesley Publishing Company, 1973. doi: 10.1002/9781118033197.
[27] F. Syafitri and R. Goejantoro, “Regresi logistik dengan metode Bayes untuk pemodelan indeks pembangunan manusia kabupaten/kota di Pulau Kalimantan,” Jurnal Eksponensial, vol. 12, no. 2, pp. 173–180, 2021. doi: 10.30872/eksponensial.v12i2.646.
[28] L. J. Bain and M. Engelhardt, Introduction to Probability and Mathematical Statistics, 2nd ed. Pacific Grove: Duxbury, 2000.
[29] X. Yu et al., “Markov chain Monte Carlo based adaptive Rauch–Tung–Striebel smoother,” Journal of the Franklin Institute, vol. 359, no. 15, pp. 8355–8376, 2022. doi: 10.1016/j.jfranklin.2022.08.007.
[30] M. Yarahmadi and A. Salehi, “A comparative Bayesian PINN–MCMC analysis of Barrow–Tsallis holographic dark energy with neutrinos: Toward resolving the Hubble tension,” Journal of High Energy Astrophysics, vol. 50, p. 100498, 2026. doi: 10.1016/j.jheap.2025.100498.
[31] S. Brooks, O. Gimenez, R. King, and B. Morgan, Bayesian Analysis for Population Ecology. USA: Chapman & Hall, 2010. doi: 10.1201/b10313.
[32] C. Catal, “Performance evaluation metrics for software fault prediction studies,” Acta Polytechnica Hungarica, vol. 9, no. 4, pp. 211–225, 2012. doi: 10.12700/APH.9.4.2012.4.13.
[33] Badan Pusat Statistik Jawa Timur, “Statistik Sektoral Jawa Timur,” 2023. Diakses pada 10 Desember 2023.
[34] K. Ross, An Introduction to Bayesian Reasoning and Methods. bookdown.org, 2022.
[35] D. A. M. Rohmah, A. B. Astuti, and A. Efendi, “A statistical analytics of migration using binary Bayesian logistic regression,” BAREKENG: Journal of Mathematics and Its Applications, vol. 17, no. 3, pp. 1725–1738, 2023. doi: 10.30598/barekengvol17iss3pp1725-1738.
DOI: https://doi.org/10.18860/cauchy.v11i1.41326
Refbacks
- There are currently no refbacks.
Copyright (c) 2026 Friansyah Gani, Achmad Efendi, Restilia Anggita Sari, Samingun Handoyo, Nur Silviyah Rahmi

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Editorial Office
Mathematics Department,
Universitas Islam Negeri Maulana Malik Ibrahim Malang
Gajayana Street 50 Malang, East Java, Indonesia 65144
Faximile (+62) 341 558933
e-mail: cauchy@uin-malang.ac.id

CAUCHY: Jurnal Matematika Murni dan Aplikasi is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.







