TY - GEN
T1 - Data-centric Cyber-attack Detection in Community Microgrids Using ML Techniques
AU - Trivedi, Rohit
AU - Patra, Sandipan
AU - Khadem, Shafi
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - This article proposes a data-centric strategy that emphasises data preprocessing, interpretation of machine learning models' performance, improving data quality and modifying models to deal with issues identified during the iterative loop of classification model development. The framework consists of three stages: stage-1 focuses on data collection and pre-processing, followed by data quality improvement and feature extraction in stage-2, and the final stage-3 with model hyper-parameter tuning. The concept of model interpretation is added within the framework that helps to understand the learning behaviour of machine learning (ML) models. This makes the models' performance more explainable and is known as Explainable Artificial Intelligence (XAI). For stage-1, the data is generated from a simulation of cyber-attacks in CIGRE low voltage microgrid network, which is then preprocessed. In stage-2, data is augmented using ensembled Synthetic Minority Over-sampling Technique (SMOTE) and Edited Nearest Neighbour (ENN) methods, followed by feature extraction using the Boruta python package. Finally, the hyper-parameters are tuned through a Tree-structured Parzen Estimator (TPE) algorithm. A time-series transformer model is also presented for cyber-attack detection. The findings from the proposed approach demonstrate that the model's predictive performance increases with subsequent stages.
AB - This article proposes a data-centric strategy that emphasises data preprocessing, interpretation of machine learning models' performance, improving data quality and modifying models to deal with issues identified during the iterative loop of classification model development. The framework consists of three stages: stage-1 focuses on data collection and pre-processing, followed by data quality improvement and feature extraction in stage-2, and the final stage-3 with model hyper-parameter tuning. The concept of model interpretation is added within the framework that helps to understand the learning behaviour of machine learning (ML) models. This makes the models' performance more explainable and is known as Explainable Artificial Intelligence (XAI). For stage-1, the data is generated from a simulation of cyber-attacks in CIGRE low voltage microgrid network, which is then preprocessed. In stage-2, data is augmented using ensembled Synthetic Minority Over-sampling Technique (SMOTE) and Edited Nearest Neighbour (ENN) methods, followed by feature extraction using the Boruta python package. Finally, the hyper-parameters are tuned through a Tree-structured Parzen Estimator (TPE) algorithm. A time-series transformer model is also presented for cyber-attack detection. The findings from the proposed approach demonstrate that the model's predictive performance increases with subsequent stages.
KW - Cyber Security
KW - Data Augmentation
KW - Explainable AI
KW - Feature Engineering
KW - Machine Learning
KW - Time Series Transformer
UR - https://www.scopus.com/pages/publications/85142892622
U2 - 10.1109/GlobConPT57482.2022.9938333
DO - 10.1109/GlobConPT57482.2022.9938333
M3 - Conference proceeding
AN - SCOPUS:85142892622
T3 - 2022 IEEE Global Conference on Computing, Power and Communication Technologies, GlobConPT 2022
BT - 2022 IEEE Global Conference on Computing, Power and Communication Technologies, GlobConPT 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE Global Conference on Computing, Power and Communication Technologies, GlobConPT 2022
Y2 - 23 September 2022 through 25 September 2022
ER -