TY - JOUR
T1 - Deep active learning for Interictal Ictal Injury Continuum EEG patterns
AU - Ge, Wendong
AU - Jing, Jin
AU - An, Sungtae
AU - Herlopian, Aline
AU - Ng, Marcus
AU - Struck, Aaron F.
AU - Appavu, Brian
AU - Johnson, Emily L.
AU - Osman, Gamaleldin
AU - Haider, Hiba A.
AU - Karakis, Ioannis
AU - Kim, Jennifer A.
AU - Halford, Jonathan J.
AU - Dhakar, Monica B.
AU - Sarkis, Rani A.
AU - Swisher, Christa B.
AU - Schmitt, Sarah
AU - Lee, Jong Woo
AU - Tabaeizadeh, Mohammad
AU - Rodriguez, Andres
AU - Gaspard, Nicolas
AU - Gilmore, Emily
AU - Herman, Susan T.
AU - Kaplan, Peter W.
AU - Pathmanathan, Jay
AU - Hong, Shenda
AU - Rosenthal, Eric S.
AU - Zafar, Sahar
AU - Sun, Jimeng
AU - Westover, M. Brandon
N1 - Funding Information:
MBW was supported by NIH (N IH-NINDS 1K23NS090900 , 1R01NS102190, 1R01NS102574, 1R01NS107291) . JJ, MBW, and MT received research support from SAGE therapeutics. EdA was supported by ZonMw (The Netherlands Organisation for Health Research and Development, 636310010) and SE was supported by the NIH ( F31NS105161, K24NS088568, T32MH020017, T32GM007753) , the Harvard Medical Scientist Training Program, and the Paul & Daisy Soros Fellowship.
Funding Information:
MBW was supported by NIH (<GN1>N</GN1>IH-NINDS 1K23NS090900, 1R01NS102190,1R01NS102574,1R01NS107291). JJ, MBW, and MT received research support from SAGE therapeutics. EdA was supported by ZonMw (The Netherlands Organisation for Health Research and Development, 636310010) and SE was supported by the NIH (F31NS105161,K24NS088568,T32MH020017,T32GM007753), the Harvard Medical Scientist Training Program, and the Paul & Daisy Soros Fellowship.
Publisher Copyright:
© 2020 Elsevier B.V.
PY - 2021/3/1
Y1 - 2021/3/1
N2 - Objectives: Seizures and seizure-like electroencephalography (EEG) patterns, collectively referred to as “ictal interictal injury continuum” (IIIC) patterns, are commonly encountered in critically ill patients. Automated detection is important for patient care and to enable research. However, training accurate detectors requires a large labeled dataset. Active Learning (AL) may help select informative examples to label, but the optimal AL approach remains unclear. Methods: We assembled >200,000 h of EEG from 1,454 hospitalized patients. From these, we collected 9,808 labeled and 120,000 unlabeled 10-second EEG segments. Labels included 6 IIIC patterns. In each AL iteration, a Dense-Net Convolutional Neural Network (CNN) learned vector representations for EEG segments using available labels, which were used to create a 2D embedding map. Nearest-neighbor label spreading within the embedding map was used to create additional pseudo-labeled data. A second Dense-Net was trained using real- and pseudo-labels. We evaluated several strategies for selecting candidate points for experts to label next. Finally, we compared two methods for class balancing within queries: standard balanced-based querying (SBBQ), and high confidence spread-based balanced querying (HCSBBQ). Results: Our results show: 1) Label spreading increased convergence speed for AL. 2) All query criteria produced similar results to random sampling. 3) HCSBBQ query balancing performed best. Using label spreading and HCSBBQ query balancing, we were able to train models approaching expert-level performance across all pattern categories after obtaining ∼7000 expert labels. Conclusion: Our results provide guidance regarding the use of AL to efficiently label large EEG datasets in critically ill patients.
AB - Objectives: Seizures and seizure-like electroencephalography (EEG) patterns, collectively referred to as “ictal interictal injury continuum” (IIIC) patterns, are commonly encountered in critically ill patients. Automated detection is important for patient care and to enable research. However, training accurate detectors requires a large labeled dataset. Active Learning (AL) may help select informative examples to label, but the optimal AL approach remains unclear. Methods: We assembled >200,000 h of EEG from 1,454 hospitalized patients. From these, we collected 9,808 labeled and 120,000 unlabeled 10-second EEG segments. Labels included 6 IIIC patterns. In each AL iteration, a Dense-Net Convolutional Neural Network (CNN) learned vector representations for EEG segments using available labels, which were used to create a 2D embedding map. Nearest-neighbor label spreading within the embedding map was used to create additional pseudo-labeled data. A second Dense-Net was trained using real- and pseudo-labels. We evaluated several strategies for selecting candidate points for experts to label next. Finally, we compared two methods for class balancing within queries: standard balanced-based querying (SBBQ), and high confidence spread-based balanced querying (HCSBBQ). Results: Our results show: 1) Label spreading increased convergence speed for AL. 2) All query criteria produced similar results to random sampling. 3) HCSBBQ query balancing performed best. Using label spreading and HCSBBQ query balancing, we were able to train models approaching expert-level performance across all pattern categories after obtaining ∼7000 expert labels. Conclusion: Our results provide guidance regarding the use of AL to efficiently label large EEG datasets in critically ill patients.
KW - Active learning
KW - Convolutional neural network
KW - Electroencephalography(EEG)
KW - Embedding map
KW - Ictal Interictal Injury Continuum
KW - Machine learning
KW - Seizure
UR - http://www.scopus.com/inward/record.url?scp=85094602767&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85094602767&partnerID=8YFLogxK
U2 - 10.1016/j.jneumeth.2020.108966
DO - 10.1016/j.jneumeth.2020.108966
M3 - Article
C2 - 33099261
AN - SCOPUS:85094602767
SN - 0165-0270
VL - 351
JO - Journal of Neuroscience Methods
JF - Journal of Neuroscience Methods
M1 - 108966
ER -