Clustering-Based Numerosity Reduction for Cloud Workload Forecasting

Research output: Chapter in Book/Report/Conference proceedingsChapterpeer-review

Abstract

Finding smaller versions of large datasets that preserve the same characteristics as the original ones is becoming a central problem in Machine Learning, especially when computational resources are limited, and there is a need to reduce energy consumption. In this paper, we apply clustering techniques for wisely selecting a subset of datasets for training models for time series prediction of future workload in cloud computing. We train Bayesian Neural Networks (BNNs) and state-of-the-art probabilistic models to predict machine-level future resource demand distribution and evaluate them on unseen data from virtual machines in the Google Cloud data centre. Experiments show that selecting the training data via clustering approaches such as Self Organising Maps allows the model to achieve the same accuracy in less than half the time, requiring less than half the datasets rather than selecting more data at random. Moreover, BNNs can capture uncertainty aspects that can better inform scheduling decisions, which state-of-the-art time series forecasting methods cannot do. All the considered models achieve prediction time performance suitable for real-world scenarios.

Original languageEnglish
Title of host publicationAlgorithmic Aspects of Cloud Computing - 8th International Symposium, ALGOCLOUD 2023, Revised Selected Papers
EditorsIoannis Chatzigiannakis, Ioannis Karydis
PublisherSpringer Science and Business Media Deutschland GmbH
Pages115-132
Number of pages18
ISBN (Print)9783031493607
DOIs
Publication statusPublished - 2024
Event8th International Symposium on Algorithmic Aspects of Cloud Computing, ALGOCLOUD 2023 - Amsterdam, Netherlands
Duration: 5 Sep 20235 Sep 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14053 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference8th International Symposium on Algorithmic Aspects of Cloud Computing, ALGOCLOUD 2023
Country/TerritoryNetherlands
CityAmsterdam
Period5/09/235/09/23

Keywords

  • Bayesian Neural Network
  • Cloud Computing
  • Clustering
  • Deep Learning
  • Workload Prediction

Fingerprint

Dive into the research topics of 'Clustering-Based Numerosity Reduction for Cloud Workload Forecasting'. Together they form a unique fingerprint.

Cite this