Clustering and Mixture Model Analysis of Human Development Index in Papua: A Study Based on Educational Data (2010–2023)
Abstract
The purpose of this research is to analyze the distribution of the Human Development Index (HDI) in Papua based on the average length of schooling during the period 2010–2023 using the Gaussian Mixture Model (GMM) approach. Data from 27 districts are grouped into three clusters based on the distribution characteristics of each region. Weibull, Nakagami, and Generalized Extreme Value (GEV) distributions were selected to represent Cluster 1, Cluster 2, and Cluster 3, with parameter estimation using Maximum Likelihood Estimation (MLE). The results of the analysis show that Cluster 1 includes areas with low HDI such as Mamberamo Raya and Yahukimo, Cluster 2 reflects moderate HDI in areas such as Nduga and Tolikara, while Cluster 3 describes high HDI in districts such as Jayapura and Mimika. The mixture model that combines these three distributions provides an accurate representation of the HDI distribution pattern in Papua. Policy implications from these results include the development of cluster-based education programs to improve access to education in areas with low HDI, reduce educational disparities in areas with moderate HDI, and maintain sustainable development in areas with high HDI. This approach can be a reference for similar analyses in other regions with high development heterogeneity characteristics
Keywords
Full Text:
PDFReferences
[1] R. Karagiannis and G. Karagiannis, “Constructing composite indicators with Shannon entropy: The case of Human Development Index,” Socioecon. Plann. Sci., 2020, doi: 10.1016/j.seps.2019.03.007.
[2] C. Türe and Y. Türe, “A model for the sustainability assessment based on the human development index in districts of Megacity Istanbul (Turkey),” Environ. Dev. Sustain., 2021, doi: 10.1007/s10668-020-00735-9.
[3] Y. Jiang and C. Shi, “Estimating sustainability and regional inequalities using an enhanced sustainable development index in China,” Sustain. Cities Soc., 2023, doi: 10.1016/j.scs.2023.104555.
[4] S. Kwatra, A. Kumar, and P. Sharma, “A critical review of studies related to construction and computation of Sustainable Development Indices,” Ecological Indicators. 2020. doi: 10.1016/j.ecolind.2019.106061.
[5] W. Afalia, I. Hamda, S. A. Adriana, A. F. Alamsyah, and N. L. Wafiroh, “Determinants Of Human Development Index In Papua Province 2012-2021,” Wiga J. Penelit. Ilmu Ekon., 2023, doi: 10.30741/wiga.v13i2.1076.
[6] D. P. Rahmawati, I. N. Budiantara, D. D. Prastyo, and M. A. D. Octavanny, “Modeling of Human Development Index in Papua Province Using Spline Smoothing Estimator in Nonparametric Regression,” in Journal of Physics: Conference Series, 2021. doi: 10.1088/1742-6596/1752/1/012018.
[7] D. A. N. Sirodj, I. M. Sumertajaya, and A. Kurnia, “Analisis Clustering Time Series untuk Pengelompokan Provinsi di Indonesia Berdasarkan Indeks Pembangunan Manusia Jenis Kelamin Perempuan,” Stat. J. Theor. Stat. Its Appl., 2023, doi: 10.29313/statistika.v23i1.2181.
[8] A. M. Sikana and A. W. Wijayanto, “Analisis Perbandingan Pengelompokan Indeks Pembangunan Manusia Indonesia Tahun 2019 dengan Metode Partitioning dan Hierarchical Clustering,” J. Ilmu Komput., 2021, doi: 10.24843/jik.2021.v14.i02.p01.
[9] E. Luthfi and A. W. Wijayanto, “Analisis perbandingan metode hirearchical, k-means, dan k-medoids clustering dalam pengelompokkan indeks pembangunan manusia Indonesia,” INOVASI, 2021, doi: 10.30872/jinv.v17i4.10106.
[10] A.- Akramunnisa and F. Fajriani, “K-Means Clustering Analysis pada PersebaranTingkat Pengangguran Kabupaten/Kota di Sulawesi Selatan,” J. Varian, 2020, doi: 10.30812/varian.v3i2.652.
[11] A. Budi and Samuel, “Klasterisasi Indeks Pembangunan Manusia (Ipm) Per Kabupaten Di Indonesia Dengan Menggunakan Algoritma K-Means,” J. Inform. dan Bisnis, 2016.
[12] A. O. Lima, G. B. Lyra, M. C. Abreu, J. F. Oliveira-Júnior, M. Zeri, and G. Cunha-Zeri, “Extreme rainfall events over Rio de Janeiro State, Brazil: Characterization using probability distribution functions and clustering analysis,” Atmos. Res., vol. 247, no. August 2020, p. 105221, 2021, doi: 10.1016/j.atmosres.2020.105221.
[13] M. D. Doi, A. Rusgiyono, and T. Wuryandari, “ANALISIS k-MEDOIDS DENGAN VALIDASI INDEKS PADA IPM DAERAH 3T DI INDONESIA,” J. Gaussian, 2023, doi: 10.14710/j.gauss.12.2.178-188.
[14] A. E. Ezugwu et al., “A comprehensive survey of clustering algorithms: State-of-the-art machine learning applications, taxonomy, challenges, and future research prospects,” Engineering Applications of Artificial Intelligence. 2022. doi: 10.1016/j.engappai.2022.104743.
[15] A. Rajabi, M. Eskandari, M. J. Ghadi, L. Li, J. Zhang, and P. Siano, “A comparative study of clustering techniques for electrical load pattern segmentation,” Renew. Sustain. Energy Rev., 2020, doi: 10.1016/j.rser.2019.109628.
[16] J. Li and J. Liu, “A modified extreme value perspective on best-performance life expectancy,” J. Popul. Res., 2020, doi: 10.1007/s12546-020-09248-8.
[17] M. S. Dhanoa et al., “A strategy for modelling heavy-tailed greenhouse gases (GHG) data using the generalised extreme value distribution: Are we overestimating GHG flux using the sample mean?,” Atmos. Environ., 2020, doi: 10.1016/j.atmosenv.2020.117500.
[18] H. Omrani, A. Alizadeh, and M. Amini, “A new approach based on BWM and MULTIMOORA methods for calculating semi-human development index: An application for provinces of Iran,” Socioecon. Plann. Sci., 2020, doi: 10.1016/j.seps.2019.02.004.
[19] B. Giles-Corti, M. Lowe, and J. Arundel, “Achieving the SDGs: Evaluating indicators to be used to benchmark and monitor progress towards creating healthy and sustainable cities,” Health Policy. 2020. doi: 10.1016/j.healthpol.2019.03.001.
[20] S. A. Takyi, O. Amponsah, M. O. Asibey, and R. A. Ayambire, “An overview of Ghana’s educational system and its implication for educational equity,” Int. J. Leadersh. Educ., 2021, doi: 10.1080/13603124.2019.1613565.
[21] A. Sinha, D. Balsalobre-Lorente, M. W. Zafar, and M. M. Saleem, “Analyzing global inequality in access to energy: Developing policy framework by inequality decomposition,” J. Environ. Manage., 2022, doi: 10.1016/j.jenvman.2021.114299.
[22] T. Ladi, A. Mahmoudpour, and A. Sharifi, “Assessing impacts of the water poverty index components on the human development index in Iran,” Habitat Int., 2021, doi: 10.1016/j.habitatint.2021.102375.
[23] G. Resce, “Wealth-adjusted Human Development Index,” J. Clean. Prod., 2021, doi: 10.1016/j.jclepro.2021.128587.
[24] A. Yumashev, B. Ślusarczyk, S. Kondrashev, and A. Mikhaylov, “Global indicators of sustainable development: Evaluation of the influence of the human development index on consumption and quality of energy,” Energies, 2020, doi: 10.3390/en13112768.
[25] M. A. ul Haq, G. S. Rao, M. Albassam, and M. Aslam, “Marshall–Olkin Power Lomax distribution for modeling of wind speed data,” Energy Reports, vol. 6, no. May, pp. 1118–1123, 2020, doi: 10.1016/j.egyr.2020.04.033.
[26] S. Kageyama, N. Mori, S. Mugikura, H. Tokunaga, and K. Takase, “Gaussian mixture model-based cluster analysis of apparent diffusion coefficient values: a novel approach to evaluate uterine endometrioid carcinoma grade,” Eur. Radiol., 2021, doi: 10.1007/s00330-020-07047-6.
[27] M. Krit, O. Gaudoin, M. Xie, and E. Remy, “Simplified likelihood based goodness-of-fit tests for the weibull distribution,” Commun. Stat. Simul. Comput., vol. 45, no. 3, pp. 920–951, 2016, doi: 10.1080/03610918.2013.879889.
[28] M. H. Ouahabi, H. Elkhachine, F. Benabdelouahab, and A. Khamlichi, “Comparative study of five different methods of adjustment by the Weibull model to determine the most accurate method of analyzing annual variations of wind energy in Tetouan - Morocco,” Procedia Manuf., vol. 46, no. 2019, pp. 698–707, 2020, doi: 10.1016/j.promfg.2020.03.099.
[29] P. A. Costa Rocha, R. C. de Sousa, C. F. de Andrade, and M. E. V. da Silva, “Comparison of seven numerical methods for determining Weibull parameters for wind energy generation in the northeast region of Brazil,” Appl. Energy, vol. 89, no. 1, pp. 395–400, 2012, doi: 10.1016/j.apenergy.2011.08.003.
[30] A. K. Azad, M. G. Rasul, M. M. Alam, S. M. Ameer Uddin, and S. K. Mondal, “Analysis of wind energy conversion system using Weibull distribution,” Procedia Eng., vol. 90, pp. 725–732, 2014, doi: 10.1016/j.proeng.2014.11.803.
[31] M. Nassar, A. Alzaatreh, M. Mead, and O. Abo-Kasem, “Alpha power Weibull distribution: Properties and applications,” Commun. Stat. - Theory Methods, vol. 46, no. 20, pp. 10236–10252, 2017, doi: 10.1080/03610926.2016.1231816.
[32] M. Krit, O. Gaudoin, and E. Remy, “Goodness-of-fit tests for the Weibull and extreme value distributions: A review and comparative study,” Commun. Stat. Simul. Comput., vol. 50, no. 7, pp. 1888–1911, 2021, doi: 10.1080/03610918.2019.1594292.
[33] P. K. Chaurasiya, S. Ahmed, and V. Warudkar, “Study of different parameters estimation methods of Weibull distribution to determine wind power density using ground based Doppler SODAR instrument,” Alexandria Eng. J., vol. 57, no. 4, pp. 2299–2311, 2018, doi: 10.1016/j.aej.2017.08.008.
[34] A. M. Basheer, “Alpha power inverse Weibull distribution with reliability application,” J. Taibah Univ. Sci., vol. 13, no. 1, pp. 423–432, 2019, doi: 10.1080/16583655.2019.1588488.
[35] A. O. Lima, G. B. Lyra, M. C. Abreu, J. F. Oliveira-Júnior, M. Zeri, and G. Cunha-Zeri, “Extreme rainfall events over Rio de Janeiro State, Brazil: Characterization using probability distribution functions and clustering analysis,” Atmos. Res., vol. 247, no. March 2020, p. 105221, 2021, doi: 10.1016/j.atmosres.2020.105221.
[36] A. Sulis, R. Cozza, and A. Annis, “Extreme wave analysis methods in the gulf of Cagliari (South Sardinia, Italy),” Ocean Coast. Manag., vol. 140, pp. 79–87, 2017, doi: 10.1016/j.ocecoaman.2017.02.023.
[37] A. O. Lima, G. B. Lyra, M. C. Abreu, J. F. Oliveira-Júnior, M. Zeri, and G. Cunha-Zeri, “Extreme rainfall events over Rio de Janeiro State, Brazil: Characterization using probability distribution functions and clustering analysis,” Atmos. Res., vol. 247, no. July 2020, p. 105221, 2021, doi: 10.1016/j.atmosres.2020.105221.
[38] Q. Han, S. Ma, T. Wang, and F. Chu, “Kernel density estimation model for wind speed probability distribution with applicability to wind energy assessment in China,” Renew. Sustain. Energy Rev., vol. 115, no. September, p. 109387, 2019, doi: 10.1016/j.rser.2019.109387.
[39] M. H. Samuh and A. M. Salhab, “Distribution of squared sum of products of independent Nakagami-m random variables,” Commun. Stat. Simul. Comput., 2023, doi: 10.1080/03610918.2023.2234668.
[40] K. Mohammadi, O. Alavi, and J. G. McGowan, “Use of Birnbaum-Saunders distribution for estimating wind speed and wind power probability distributions: A review,” Energy Convers. Manag., vol. 143, pp. 109–122, 2017, doi: 10.1016/j.enconman.2017.03.083.
[41] K. S. Guedes, C. F. de Andrade, P. A. C. Rocha, R. dos S. Mangueira, and E. P. de Moura, “Performance analysis of metaheuristic optimization algorithms in estimating the parameters of several wind speed distributions,” Appl. Energy, vol. 268, no. March, p. 114952, 2020, doi: 10.1016/j.apenergy.2020.114952.
[42] C. Jung and D. Schindler, “Global comparison of the goodness-of-fit of wind speed distributions,” Energy Convers. Manag., vol. 133, pp. 216–234, 2017, doi: 10.1016/j.enconman.2016.12.006.
[43] Y. M. Kantar and I. Usta, “Analysis of the upper-truncated Weibull distribution for wind speed,” Energy Convers. Manag., vol. 96, pp. 81–88, 2015, doi: 10.1016/j.enconman.2015.02.063.
[44] K. Mohammadi, O. Alavi, A. Mostafaeipour, N. Goudarzi, and M. Jalilvand, “Assessing different parameters estimation methods of Weibull distribution to compute wind power density,” Energy Convers. Manag., vol. 108, pp. 322–335, 2016, doi: 10.1016/j.enconman.2015.11.015.
[45] T. Arslan, Y. M. Bulut, and A. Altin Yavuz, “Comparative study of numerical methods for determining Weibull parameters for wind energy potential,” Renew. Sustain. Energy Rev., vol. 40, pp. 820–825, 2014, doi: 10.1016/j.rser.2014.08.009.
[46] A. K. Mbah and A. Paothong, “Shapiro–Francia test compared to other normality test using expected p-value,” J. Stat. Comput. Simul., vol. 85, no. 15, pp. 3002–3016, 2015, doi: 10.1080/00949655.2014.947986.
[47] M. M. Badr, “Goodness-of-fit tests for the Compound Rayleigh distribution with application to real data,” Heliyon, vol. 5, no. 8, p. e02225, 2019, doi: 10.1016/j.heliyon.2019.e02225.
[48] B. Yazici and S. Yolacan, “A comparison of various tests of normality,” J. Stat. Comput. Simul., vol. 77, no. 2, pp. 175–183, 2007, doi: 10.1080/10629360600678310.
[49] H. A. Bayoud, “Tests of normality: new test and comparative study,” Commun. Stat. Simul. Comput., vol. 50, no. 12, pp. 4442–4463, 2021, doi: 10.1080/03610918.2019.1643883.
[50] S. Dey, D. Kumar, P. L. Ramos, and F. Louzada, “Exponentiated Chen distribution: Properties and estimation,” Commun. Stat. Simul. Comput., vol. 46, no. 10, pp. 8118–8139, 2017, doi: 10.1080/03610918.2016.1267752.
[51] I. Pobočíková, Z. Sedliačková, and M. Michalková, “Application of Four Probability Distributions for Wind Speed Modeling,” Procedia Eng., vol. 192, pp. 713–718, 2017, doi: 10.1016/j.proeng.2017.06.123.
[52] K. Aprianto, “Optimasi Kernel K-Means dalam Pengelompokan Kabupaten/Kota Berdasarkan Indeks Pembangunan Manusia di Indonesia,” Limits J. Math. Its Appl., 2018, doi: 10.12962/limits.v15i1.3408.
[53] K. D. R. Sianipar and I. Gunawan, “Algoritma K-Means Dalam Pengelompokan Kabupaten/Kota Berdasarkan Indeks Pembengunan Manusia Di Sumatera Utara,” J. Infomedia, 2021, doi: 10.30811/jim.v6i2.2426.
[54] H. E. Prastyo and F. Ilfana, “Pengelompokan kabupaten dan kota di jawa timur berdasarkan indeks pembangunan manusia dengan menggunakan metode k-means tahun 2020-2021,” J. Ilm. Komputasi dan Stat., 2022.
DOI: https://doi.org/10.18860/cauchy.v10i2.32988
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 Alvian Sroyer, Henderina Morin, Felix Reba, Jonathan Wororomi, Agustinus Languwuyo

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Editorial Office
Mathematics Department,
Universitas Islam Negeri Maulana Malik Ibrahim Malang
Gajayana Street 50 Malang, East Java, Indonesia 65144
Faximile (+62) 341 558933
e-mail: cauchy@uin-malang.ac.id

CAUCHY: Jurnal Matematika Murni dan Aplikasi is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.






