Title | Gaussian process-based kernel as a diagnostic model for prediction of type 2 diabetes mellitus risk using non-linear heart rate variability features. | ||
Author | Shashikant, R; Chaskar, Uttam; Phadke, Leena; Patil, Chetankumar | ||
Journal | Biomed Eng Lett | Publication Year/Month | 2021-Aug |
PMID | 34350053 | PMCID | PMC8316508 |
Affiliation | 1.Department of Instrumentation and Control, College of Engineering, Pune, India. GRID: grid.32056.32. ISNI: 0000 0001 2190 9326;Department of Instrumentation and Control, College of Engineering, Pune, India. GRID: grid.32056.32. ISNI: 0000 0001 2190 9326;Department of Physiology, Smt. Kashibai Navale Medical College and General Hospital, Pune, India. GRID: grid.459956.2. ISNI: 0000 0004 1804 2495;Department of Instrumentation and Control, College of Engineering, Pune, India. GRID: grid.32056.32. ISNI: 0000 0001 2190 9326. |
The main objective of the study was to develop a low-cost, non-invasive diagnostic model for the early prediction of T2DM risk and validation of this model on patients. The model was designed based on the machine learning classification technique using non-linear Heart rate variability (HRV) features. The electrocardiogram of the healthy subjects (n = 35) and T2DM subjects (n = 100) were recorded in the supine position for 15 min, and HRV features were extracted. The significant non-linear HRV features were identified through statistical analysis. It was found that Poincare plot features (SD1 and SD2) can differentiate the T2DM subject data from healthy subject data. Several machine learning classifiers, such as Linear Discriminant Analysis (LDA), Quadratic Discriminant Analysis, Naive Bayes, and Gaussian Process Classifier (GPC), have classified the data based on the cross-validation approach. A GP classifier was implemented using three kernels, namely radial basis, linear, and polynomial kernel, considering the ability to handle the non-linear data. The classifier performance was evaluated and compared using performance metrics such as accuracy(AC), sensitivity(SN), specificity(SP), precision(PR), F1 score, and area under the receiver operating characteristic curve(AUC). Initially, all non-linear HRV features were selected for classification, but the specificity of the model was the limitation. Thus, only two Poincare plot features were used to design the diagnostic model. Our diagnostic model shows the performance using GPC based linear kernel as AC of 92.59%, SN of 96.07%, SP of 81.81%, PR of 94.23%, F1 score of 0.95, and AUC of 0.89, which are more extensive compared to other classification models. Further, the diagnostic model was deployed on the hardware module. Its performance on unknown/test data was validated on 65 subjects (healthy n = 15 and T2DM n = 50). Considering the desirable performance of the diagnostic model, it can be used as an initial screening test tool for a healthcare practitioner to predict T2DM risk.