TY - GEN
T1 - Transformers for Detection of Distressed Cardiac Patients with an ICD Based on Danish Text Messages
AU - Dittmann Weimar Andersen, Julie
AU - Lomstein Jensen, Marcus
AU - Wiil, Uffe Kock
AU - Skovbakke, Søren
AU - Skov, Ole
AU - Pedersen, Susanne S.
AU - Peimankar, Abdolrahman
AU - Ebrahimi, Ali
PY - 2023
Y1 - 2023
N2 - Cardiac patients with implantable cardioverter defibrillator devices frequently exhibit signs of anxiety and depression (termed 'Distressed'). Early detection of these patients is vital for evaluation, intervention, and prevention against relapse. Considering the growing datasets relevant to distress, coupled with the evolution of machine learning methodologies, there exists a promising prospect to develop intelligent systems for the detection of distressed cardiac patients through written materials. In this context, data from two sources were collected: a questionnaire and text communication messages, acquired through the randomized ACQUIRE-ICD study of 168 participants. These textual messages were labelled as either Distressed or Non-Distressed based on questionnaire responses. Following preprocessing, the dataset facilitated the development of transformer-based classification models, including mBERT, XLM-RoBERTa, ÆLÆCTRA, and RoBERTa, as well as a hard voting ensemble method to classify patients into Distressed and Non-Distressed categories. To address imbalances in class distribution and dataset scarcity, a data augmentation method was employed. Results indicated the superior performance of the proposed hard voting ensemble, recording weighted metrics of 80% precision, 67% recall, 73% F1-score, and 75% accuracy. Notably, this ensemble correctly identified 67% of Distressed samples, while the most efficient base transformer, mBERT, identified 63% of Distressed samples.
AB - Cardiac patients with implantable cardioverter defibrillator devices frequently exhibit signs of anxiety and depression (termed 'Distressed'). Early detection of these patients is vital for evaluation, intervention, and prevention against relapse. Considering the growing datasets relevant to distress, coupled with the evolution of machine learning methodologies, there exists a promising prospect to develop intelligent systems for the detection of distressed cardiac patients through written materials. In this context, data from two sources were collected: a questionnaire and text communication messages, acquired through the randomized ACQUIRE-ICD study of 168 participants. These textual messages were labelled as either Distressed or Non-Distressed based on questionnaire responses. Following preprocessing, the dataset facilitated the development of transformer-based classification models, including mBERT, XLM-RoBERTa, ÆLÆCTRA, and RoBERTa, as well as a hard voting ensemble method to classify patients into Distressed and Non-Distressed categories. To address imbalances in class distribution and dataset scarcity, a data augmentation method was employed. Results indicated the superior performance of the proposed hard voting ensemble, recording weighted metrics of 80% precision, 67% recall, 73% F1-score, and 75% accuracy. Notably, this ensemble correctly identified 67% of Distressed samples, while the most efficient base transformer, mBERT, identified 63% of Distressed samples.
KW - Distress
KW - Ensemble
KW - Machine Learning
KW - Mental Heath
KW - Transformers
UR - https://ieeexplore.ieee.org/document/10385964
U2 - 10.1109/BIBM58861.2023.10385964
DO - 10.1109/BIBM58861.2023.10385964
M3 - Article in proceedings
T3 - Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
SP - 4275
EP - 4281
BT - 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
A2 - Jiang, Xingpeng
A2 - Wang, Haiying
A2 - Alhajj, Reda
A2 - Hu, Xiaohua
A2 - Engel, Felix
A2 - Mahmud, Mufti
A2 - Pisanti, Nadia
A2 - Cui, Xuefeng
A2 - Song, Hong
PB - IEEE
T2 - 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
Y2 - 5 December 2023 through 8 December 2023
ER -