Title | Cross Dataset Analysis for Generalizability of HRV-Based Stress Detection Models. | ||
Author | Benchekroun, Mouna; Velmovitsky, Pedro Elkind; Istrate, Dan; Zalc, Vincent; Morita, Plinio Pelegrini; Lenne, Dominique | ||
Journal | Sensors (Basel) | Publication Year/Month | 2023-Feb |
PMID | 36850407 | PMCID | PMC9960690 |
Affiliation + expend | 1.Biomechanics and Bioengineering Lab, University of Technology of Compiegne (UMR CNRS 7338), 60200 Compiegne, France. |
Stress is an increasingly prevalent mental health condition across the world. In Europe, for example, stress is considered one of the most common health problems, and over USD 300 billion are spent on stress treatments annually. Therefore, monitoring, identification and prevention of stress are of the utmost importance. While most stress monitoring is carried out through self-reporting, there are now several studies on stress detection from physiological signals using Artificial Intelligence algorithms. However, the generalizability of these models is only rarely discussed. The main goal of this work is to provide a monitoring proof-of-concept tool exploring the generalization capabilities of Heart Rate Variability-based machine learning models. To this end, two Machine Learning models are used, Logistic Regression and Random Forest to analyze and classify stress in two datasets differing in terms of protocol, stressors and recording devices. First, the models are evaluated using leave-one-subject-out cross-validation with train and test samples from the same dataset. Next, a cross-dataset validation of the models is performed, that is, leave-one-subject-out models trained on a Multi-modal Dataset for Real-time, Continuous Stress Detection from Physiological Signals dataset and validated using the University of Waterloo stress dataset. While both logistic regression and random forest models achieve good classification results in the independent dataset analysis, the random forest model demonstrates better generalization capabilities with a stable F1 score of 61%. This indicates that the random forest can be used to generalize HRV-based stress detection models, which can lead to better analyses in the mental health and medical research field through training and integrating different models.