TY - JOUR
T1 - Scaling convolutional neural networks achieves expert level seizure detection in neonatal EEG
AU - Hogan, Robert
AU - Mathieson, Sean R.
AU - Luca, Aurel
AU - Ventura, Soraia
AU - Griffin, Sean
AU - Boylan, Geraldine B.
AU - O’Toole, John M.
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Neonatal seizures require urgent treatment, but often go undetected without expert EEG monitoring. We have developed and validated a seizure detection model using retrospective EEG data from 332 neonates. A convolutional neural network was trained and tested on over 50,000 hours (n = 202) of annotated single-channel EEG containing 12,402 seizure events. This model was then validated on two independent multi-reviewer datasets (n = 51 and n = 79). Increasing data and model size improved performance: Matthews correlation coefficient (MCC) and Pearson’s correlation (r) increased by up to 50% (15%) with data (model) scaling. The largest model (21m parameters) achieved state-of-the-art on an open-access dataset (MCC = 0.764, r = 0.824, and AUC = 0.982). This model also attained expert-level performance on both validation sets, a first in this field, with no significant difference in inter-rater agreement when the model replaces an expert (∣Δκ∣ < 0.094, p > 0.05).
AB - Neonatal seizures require urgent treatment, but often go undetected without expert EEG monitoring. We have developed and validated a seizure detection model using retrospective EEG data from 332 neonates. A convolutional neural network was trained and tested on over 50,000 hours (n = 202) of annotated single-channel EEG containing 12,402 seizure events. This model was then validated on two independent multi-reviewer datasets (n = 51 and n = 79). Increasing data and model size improved performance: Matthews correlation coefficient (MCC) and Pearson’s correlation (r) increased by up to 50% (15%) with data (model) scaling. The largest model (21m parameters) achieved state-of-the-art on an open-access dataset (MCC = 0.764, r = 0.824, and AUC = 0.982). This model also attained expert-level performance on both validation sets, a first in this field, with no significant difference in inter-rater agreement when the model replaces an expert (∣Δκ∣ < 0.094, p > 0.05).
UR - https://www.scopus.com/pages/publications/85218168604
U2 - 10.1038/s41746-024-01416-x
DO - 10.1038/s41746-024-01416-x
M3 - Article
AN - SCOPUS:85218168604
SN - 2398-6352
VL - 8
JO - npj Digital Medicine
JF - npj Digital Medicine
IS - 1
M1 - 17
ER -