Data Analysis of Student Monitoring Using the K-Means Clustering Method
DOI:
https://doi.org/10.64021/ijmst.1.2.50-57.2025Keywords:
Data Mining, Pre-processing, Student Monitoring, K-Means ClusteringAbstract
This study aims to group student monitoring data by focusing on two main variables, namely anxiety level and mood score, using the K-Means Clustering method. The research data was obtained from the Kaggle platform, which contains 1000 rows of data with nine attributes, including Student ID, Date, Class Time, Attendance Status, Stress Level, Sleep Hours, Anxiety Level, Mood Score, and Risk Level. The research process involved several stages, from problem identification, data collection, data cleaning and preprocessing, to the application of the K-Means algorithm. The analysis results showed that the data could be divided into two main groups: Cluster 1 consists of students with low to moderate anxiety levels and high mood scores, while Cluster 2 includes students with high anxiety and low mood scores. These findings provide relevant information for schools or campuses to design more effective psychological support and emotional monitoring programs. Additionally, this clustering method can serve as a foundation for developing an early detection system for psychological issues among students.
References
A. Koyalil and S. Rajalingam, “Enhanced Multi-level K-means Clustering and Cluster Head Selection Using a Modernized Pufferfish Optimization Algorithm for Lifetime Maximization in Wireless Sensor Networks,” Results in Engineering, p. 105836, Jun. 2025, doi: 10.1016/j.rineng.2025.105836.
R. K. Dinata, S. Safwandi, N. Hasdyna, and N. Azizah, “Analisis K-Means Clustering pada Data Sepeda Motor,” INFORMAL: Informatics Journal, vol. 5, no. 1, p. 10, Apr. 2020, doi: 10.19184/isj.v5i1.17071.
A. Al Fahrozi, F. Insani, E. Budianita, and I. Afrianty, “Implementasi Algoritma K-Means Clustering dalam Menentukan di Badan Pelatihan KesehatanPekanbaru,” vol. 1, pp. 474–492, 2023.
M. Sadeghi, P. Casey, E. J. M. Carranza, and E. P. Lynch, “Principal components analysis and K-means clustering of till geochemical data: Mapping and targeting of prospective areas for lithium exploration in Västernorrland Region, Sweden,” Ore Geol Rev, vol. 167, p. 106002, Apr. 2024, doi: 10.1016/j.oregeorev.2024.106002.
I. Nuryani and D. Darwis, “ANALISIS CLUSTERING PADA PENGGUNA BRAND HP MENGGUNAKAN METODE K-MEANS,” 2021.
W. A. Prastyabudi, A. N. Alifah, and A. Nurdin, “Segmenting the Higher Education Market: An Analysis of Admissions Data Using K-Means Clustering,” Procedia Comput Sci, vol. 234, pp. 96–105, 2024, doi: 10.1016/j.procs.2024.02.156.
M. Balcilar, A. H. Elsayed, and S. Hammoudeh, “Financial connectedness and risk transmission among MENA countries: Evidence from connectedness network and clustering analysis,” Journal of International Financial Markets, Institutions and Money, vol. 82, p. 101656, Jan. 2023, doi: 10.1016/j.intfin.2022.101656.
T. Liddell, A. S. Boser, S. Orofino, T. Mangin, and T. Carleton, “stagg:: A data pre-processing R package for climate impacts analysis,” Environmental Modelling & Software, vol. 183, p. 106202, Jan. 2025, doi: 10.1016/j.envsoft.2024.106202.
S. Wang et al., “Advances in Data Preprocessing for Biomedical Data Fusion: An Overview of the Methods, Challenges, and Prospects,” Information Fusion, vol. 76, pp. 376–421, Dec. 2021, doi: 10.1016/j.inffus.2021.07.001.
S. Khalighi, L. Ma, S. Ren, and A. Varveri, “Evaluating the impact of data pre-processing methods on classification of ATR-FTIR spectra of bituminous binders,” Fuel, vol. 376, p. 132701, Nov. 2024, doi: 10.1016/j.fuel.2024.132701.
W. Cardoso, J. V. Roque, J. J. Jansen, S. Y. Teng, and R. F. Teófilo, “Combinatorial Order Pre-processing Search (COPS): A new pre-processing strategy for large-scale interpretable data analysis in process analytical technologies,” Comput Chem Eng, vol. 192, p. 108892, Jan. 2025, doi: 10.1016/j.compchemeng.2024.108892.
R. Hasanah et al., “Play Store Data Scrapping and Preprocessing done as Sentiment Analysis Material,” Indonesian Journal of Modern Science and Technology, vol. 1, no. 1, pp. 16–21, Jan. 2025, doi: 10.64021/ijmst.1.1.16-21.2025.
S. Selvakumaran et al., “Improving operational use of post-disaster damage assessment for Urban Search and Rescue by integrated graph-based multimodal remote sensing data analysis,” Progress in Disaster Science, vol. 25, p. 100404, Jan. 2025, doi: 10.1016/j.pdisas.2025.100404.
F. Liu et al., “Use of latent profile analysis and k-means clustering to identify student anxiety profiles,” BMC Psychiatry, vol. 22, no. 1, Dec. 2022, doi: 10.1186/s12888-021-03648-7.
K. P. Sinaga and M.-S. Yang, “Unsupervised K-Means Clustering Algorithm,” IEEE Access, vol. 8, pp. 80716–80727, 2020, doi: 10.1109/ACCESS.2020.2988796.
E.-M. Papia and A. Kondi, “Quantifying subtle color transitions in Mark Rothko’s abstract paintings through K-means clustering and Delta E analysis,” J Cult Herit, vol. 72, pp. 194–204, Mar. 2025, doi: 10.1016/j.culher.2025.02.005.
W. Song, M. Zhao, and J. Yu, “Price distortion on market resource allocation efficiency: A DID analysis based on national-level big data comprehensive pilot zones,” International Review of Economics & Finance, p. 104128, Apr. 2025, doi: 10.1016/j.iref.2025.104128.
C. Gilga et al., “Legal and ethical considerations for demand-driven data collection and AI-based analysis in flood response,” International Journal of Disaster Risk Reduction, vol. 122, p. 105441, May 2025, doi: 10.1016/j.ijdrr.2025.105441.
E. A. Saputra and Y. Nataliani, “Analisis Pengelompokan Data Nilai Siswa untuk Menentukan Siswa Berprestasi Menggunakan Metode Clustering K-Means,” Journal of Information Systems and Informatics, vol. 3, no. 3, 2021, [Online]. Available: http://journal-isi.org/index.php/isi
A. L. Maukar, F. Marisa, A. A. Widodo, N. Kamilaningtyas, D. Novian, and D. Nugraha, “ANALISIS DATA PENERIMAAN MAHASISWA BARU BERBASIS K-MEANS,” Jurnal Informatika dan Komputer), vol. 6, no. 2, pp. 142–147, 2022.
M. Abidin et al., “Classification of Heart (Cardiovascular) Disease using the SVM Method,” Indonesian Journal of Modern Science and Technology, vol. 1, no. 1, pp. 9–15, Jan. 2025, doi: 10.64021/ijmst.1.1.9-15.2025.
L. Sans, I. Vallvé, J. Teixidó, J. M. Picas, J. Martínez-Roldán, and J. Pascual, “La era del big data: análisis del lenguaje natural mediante la aplicación de folksonomía,” Nefrología, vol. 42, no. 6, pp. 680–687, Nov. 2022, doi: 10.1016/j.nefro.2021.09.006.
A. Restrepo Román, D. J. Villegas, C. Rodriguez, A. Cogollo, I. D. Bedoya, and A. A. Amell Arrieta, “Implementation of a hierarchical cluster model to analyze wind and solar availability in the department of Antioquia, Colombia,” Case Studies in Chemical and Environmental Engineering, vol. 10, p. 101006, Dec. 2024, doi: 10.1016/j.cscee.2024.101006.
A. R. Nusantara Habibi et al., “Diabetes Mellitus Disease Analysis using Support Vector Machines and K-Nearest Neighbor Methods,” Indonesian Journal of Modern Science and Technology, vol. 1, no. 1, pp. 22–27, Jan. 2025, doi: 10.64021/ijmst.1.1.22-27.2025.
A. Maulana et al., “Classification of Stunting in Toddlers using Naive Bayes Method and Decision Tree,” Indonesian Journal of Modern Science and Technology, vol. 1, no. 1, pp. 28–33, Jan. 2025, doi: 10.64021/ijmst.1.1.28-33.2025.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Sulistiani, Ahmad Rizky Nusantara Habibi , Adrian Maulana , Hidear Talirongan , Anrom G. Abao , Ahmed Mahmoud Zaki Elmalky , Asno Azzawagama Firdaus (Author)

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
