TY - GEN
T1 - Machine learning without a feature set for detecting bursts in the EEG of preterm infants
AU - O'Toole, John M.
AU - Boylan, Geraldine B.
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/7
Y1 - 2019/7
N2 - Deep neural networks enable learning directly on the data without the domain knowledge needed to construct a feature set. This approach has been extremely successful in almost all machine learning applications. We propose a new framework that also learns directly from the data, without extracting a feature set. We apply this framework to detecting bursts in the EEG of premature infants. The EEG is recorded within days of birth in a cohort of infants without significant brain injury and born <30 weeks of gestation. The method first transforms the time-domain signal to the time-frequency domain and then trains a machine learning method, a gradient boosting machine, on each time-slice of the time-frequency distribution. We control for oversampling the time-frequency distribution with a significant reduction (<1%) in memory and computational complexity. The proposed method achieves similar accuracy to an existing multi-feature approach: area under the characteristic curve of 0.98 (with 95% confidence interval of 0.96 to 0.99), with a median sensitivity of 95% and median specificity of 94%. The proposed framework presents an accurate, simple, and computational efficient implementation as an alternative to both the deep learning approach and to the manual generation of a feature set.
AB - Deep neural networks enable learning directly on the data without the domain knowledge needed to construct a feature set. This approach has been extremely successful in almost all machine learning applications. We propose a new framework that also learns directly from the data, without extracting a feature set. We apply this framework to detecting bursts in the EEG of premature infants. The EEG is recorded within days of birth in a cohort of infants without significant brain injury and born <30 weeks of gestation. The method first transforms the time-domain signal to the time-frequency domain and then trains a machine learning method, a gradient boosting machine, on each time-slice of the time-frequency distribution. We control for oversampling the time-frequency distribution with a significant reduction (<1%) in memory and computational complexity. The proposed method achieves similar accuracy to an existing multi-feature approach: area under the characteristic curve of 0.98 (with 95% confidence interval of 0.96 to 0.99), with a median sensitivity of 95% and median specificity of 94%. The proposed framework presents an accurate, simple, and computational efficient implementation as an alternative to both the deep learning approach and to the manual generation of a feature set.
UR - https://www.scopus.com/pages/publications/85077846398
U2 - 10.1109/EMBC.2019.8856533
DO - 10.1109/EMBC.2019.8856533
M3 - Conference proceeding
C2 - 31947170
AN - SCOPUS:85077846398
T3 - Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
SP - 5799
EP - 5802
BT - 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2019
Y2 - 23 July 2019 through 27 July 2019
ER -