TY - JOUR
T1 - Domain-guided Self-supervision of EEG Data Improves Downstream Classification Performance and Generalizability
AU - Wagh, Neeraj
AU - Wei, Jionghao
AU - Rawal, Samarth
AU - Berry, Brent
AU - Barnard, Leland
AU - Brinkmann, Benjamin
AU - Worrell, Gregory
AU - Jones, David
AU - Varatharajah, Yogatheesan
N1 - Publisher Copyright:
© 2021 N. Wagh, J. Wei, S. Rawal, B. Berry, L. Barnard, B. Brinkmann, G. Worrell, D. Jones & Y. Varatharajah.
PY - 2021
Y1 - 2021
N2 - This paper presents a domain-guided approach for learning representations of scalp-electroencephalograms (EEGs) without relying on expert annotations. Expert labeling of EEGs has proven to be an unscalable process with low inter-reviewer agreement because of the complex and lengthy nature of EEG recordings. Hence, there is a need for machine learning (ML) approaches that can leverage expert domain knowledge without incurring the cost of labor-intensive annotations. Self-supervised learning (SSL) has shown promise in such settings, although existing SSL efforts on EEG data do not fully exploit EEG domain knowledge. Furthermore, it is unclear to what extent SSL models generalize to unseen tasks and datasets. Here we explore whether SSL tasks derived in a domain-guided fashion can learn generalizable EEG representations. Our contributions are three-fold: 1) we propose novel SSL tasks for EEG based on the spatial similarity of brain activity, underlying behavioral states, and age-related differences; 2) we present evidence that an encoder pretrained using the proposed SSL tasks shows strong predictive performance on multiple downstream classifications; and 3) using two large EEG datasets, we show that our encoder generalizes well to multiple EEG datasets during downstream evaluations.
AB - This paper presents a domain-guided approach for learning representations of scalp-electroencephalograms (EEGs) without relying on expert annotations. Expert labeling of EEGs has proven to be an unscalable process with low inter-reviewer agreement because of the complex and lengthy nature of EEG recordings. Hence, there is a need for machine learning (ML) approaches that can leverage expert domain knowledge without incurring the cost of labor-intensive annotations. Self-supervised learning (SSL) has shown promise in such settings, although existing SSL efforts on EEG data do not fully exploit EEG domain knowledge. Furthermore, it is unclear to what extent SSL models generalize to unseen tasks and datasets. Here we explore whether SSL tasks derived in a domain-guided fashion can learn generalizable EEG representations. Our contributions are three-fold: 1) we propose novel SSL tasks for EEG based on the spatial similarity of brain activity, underlying behavioral states, and age-related differences; 2) we present evidence that an encoder pretrained using the proposed SSL tasks shows strong predictive performance on multiple downstream classifications; and 3) using two large EEG datasets, we show that our encoder generalizes well to multiple EEG datasets during downstream evaluations.
UR - http://www.scopus.com/inward/record.url?scp=85140428370&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85140428370&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85140428370
SN - 2640-3498
VL - 158
SP - 130
EP - 142
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
T2 - 2021 Symposium on Machine Learning for Health, ML4H 2021
Y2 - 4 December 2021
ER -