Machine Learning Classification of Insomnia Using Multidimensional Features and SMOTE
Abstract
Sleep disorders, especially insomnia, are common among adolescents and negatively affect health. Early detection is crucial for appropriate treatment. This study aims to classify insomnia severity in adolescents using a Machine Learning (ML) model and multidimensional features derived from 19 questionnaire instruments. The dataset consists of 95 adolescents aged 16–19 years, categorized into Insomnia, Subclinical Insomnia, and Control classes. The modeling process includes reducing multicollinearity, class balancing with SMOTE, and hyperparameter optimization using GridSearchCV and StratifiedKFold. Feature importance analysis was conducted using decision tree-based methods and permutation importance. The results show that SMOTE improves SVM performance from 0.690 to 0.793 and positively affects Random Forest. Logistic Regression performs best without SMOTE (accuracy 0.759), while XGBoost shows the lowest accuracy (0.614) even with SMOTE. A total of 11 features consistently contribute to all models. In conclusion, ML models, particularly SVM, are effective for classifying insomnia severity in adolescents.
Keywords
References
P. Liu et al., “Automatic sleep stage classification using deep learning: signals, data representation, and neural networks,” Artificial Intelligence Review, vol. 57, no. 11, p. 301, 2024.
M. Zhao, H. Tuo, S. Wang, and L. Zhao, “The effects of dietary nutrition on sleep and sleep disorders,” Mediators of Inflammation, vol. 2020, no. 1, p. 3142874, 2020.
Y. J. Lee, J. Y. Lee, J. H. Cho, and J. H. Choi, “Interrater reliability of sleep stage scoring: a meta-analysis,” Journal of Clinical Sleep Medicine, vol. 18, no. 1, pp. 193–202, 2022.
S. Khanmohmmadi et al., “Revolutionizing sleep disorder diagnosis: a multi-task learning approach optimized with genetic and Q-learning techniques,” Scientific Reports, vol. 15, no. 1, p. 16603, 2025.
J. F. Gaultney, “The prevalence of sleep disorders in college students: impact on academic performance,” Journal of American College Health, vol. 59, no. 2, pp. 91–97, 2010.
T. S. Alshammari, “Applying machine learning algorithms for the classification of sleep disorders,” IEEE Access, vol. 12, pp. 36110–36121, 2024.
L. Frase, C. Nissen, K. Spiegelhalder, and B. Feige, “The importance and limitations of polysomnography in insomnia disorder—a critical appraisal,” Journal of Sleep Research, vol. 32, no. 6, p. e14036, 2023.
W. Lee et al., “The simplification of the Insomnia Severity Index and Epworth Sleepiness Scale using machine learning models,” Scientific Reports, vol. 13, no. 1, p. 6214, 2023.
S. Chung et al., “Psychometric properties of the Insomnia Severity Index and its comparison with the shortened versions among the general population,” Psychiatry Investigation, vol. 21, no. 1, p. 9, 2024.
K. Sadeghniiat-Haghighi, Z. Yazdi, and M. Firoozeh, “Comparison of two assessment tools that measure insomnia: the Insomnia Severity Index and polysomnography,” Indian Journal of Psychological Medicine, vol. 36, no. 1, pp. 54–57, 2014.
D. J. Buysse et al., “Relationships between the Pittsburgh Sleep Quality Index (PSQI), Epworth Sleepiness Scale (ESS), and clinical/polysomnographic measures in a community sample,” Journal of Clinical Sleep Medicine, vol. 4, no. 6, pp. 563–571, 2008.
S. Ha et al., “Predicting the risk of sleep disorders using a machine learning–based simple questionnaire: development and validation study,” Journal of Medical Internet Research, vol. 25, p. e46520, 2023.
A. R. Schwartz et al., “Brief digital sleep questionnaire powered by machine learning prediction models identifies common sleep disorders,” Sleep Medicine, vol. 71, pp. 66–76, 2020.
M. Sharma, J. Tiwari, V. Patel, and U. R. Acharya, “Automated identification of sleep disorder types using triplet half-band filter and ensemble machine learning techniques with EEG signals,” Electronics, vol. 10, no. 13, p. 1531, 2021.
H. Yeo et al., “Exploring predictors of insomnia severity in shift workers using machine learning model,” Frontiers in Public Health, vol. 13, p. 1494583, 2025.
O. Kiss, D. Yuksel, D. E. Prouty, F. C. Baker, and M. de Zambotti, “A dataset reflecting the multidimensionality of insomnia symptomatology in adolescence using standardized questionnaires,” Data in Brief, vol. 44, p. 108523, 2022.
Y. A. Akın and B. S. Arıca Polat, “Sleep disorders in multiple sclerosis,” Gulhane Medical Journal, vol. 63, no. 2, pp. 141–146, 2021.
L. Fang et al., “Machine learning models to identify clinically significant anxiety in short-term insomnia using accelerometers,” Depression and Anxiety, vol. 2025, no. 1, p. 3082856, 2025.
L. Simon, Y. Terhorst, C. Cohrdes, R. Pryss, L. Steinmetz, J. D. Elhai, and H. Baumeister, “The predictive value of supervised machine learning models for insomnia symptoms through smartphone usage behavior,” Sleep Medicine: X, vol. 7, p. 100114, 2024, doi: 10.1016/j.sleepx.2024.100114.
N. Nuraeni and M. Faisal, “Classification of sleep disorders using support vector machine,” Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics and Control, 2025. doi: 10.22219/kinetik.v10i1.2054.
T. R. Putri, P. N. Utamy, M. Wahyudi, Sumanto, and A. S. Budiman, “Application of support vector machine algorithm for classification of sleep disorders,” Journal of Artificial Intelligence and Engineering Applications, vol. 5, no. 1, pp. 83–89, 2025.
J. H. Kim, “Multicollinearity and misleading statistical results,” Korean Journal of Anesthesiology, vol. 72, no. 6, pp. 558–569, 2019.
J. Y.-L. Chan et al., “Mitigating the multicollinearity problem and its machine learning approach: a review,” Mathematics, vol. 10, no. 8, p. 1283, 2022.
X. Li, Y. Wang, S. Basu, K. Kumbier, and B. Yu, “A debiased MDI feature importance measure for random forests,” Advances in Neural Information Processing Systems, vol. 32, 2019.
B. D. Topçuoğlu, N. A. Lesniak, M. T. Ruffin IV, J. Wiens, and P. D. Schloss, “A framework for effective application of machine learning to microbiome-based classification problems,” mBio, vol. 11, no. 3, pp. 10–1128, 2020.
How To Cite This :
Refbacks
- There are currently no refbacks.










