Towards Trust-Based Data Weighting in Machine Learning

Research output: Chapter in Book/Report/Conference proceedingsChapterpeer-review

Abstract

In distributed environments, data for Machine Learning (ML) applications may be generated from numerous sources and devices, and traverse a cloud-edge continuum via a variety of protocols, using multiple security schemes and equipment types. While ML models typically benefit from using large training sets, not all data can be equally trusted. In this work, we examine data trust as a factor in creating ML models, and explore an approach using annotated trust metadata to contribute to data weighting in generating ML models. We assess the feasibility of this approach using well-known datasets for both linear regression and classification problems, demonstrating the benefit of including trust as a factor when using heterogeneous datasets. We discuss the potential benefits of this approach, and the opportunity it presents for improved data utilisation and processing.

Original languageEnglish
Title of host publication2023 IEEE 31st International Conference on Network Protocols, ICNP 2023
PublisherIEEE Computer Society
ISBN (Electronic)9798350303223
DOIs
Publication statusPublished - 2023
Event31st IEEE International Conference on Network Protocols, ICNP 2023 - Reykjavik, Iceland
Duration: 10 Oct 202313 Oct 2023

Publication series

NameProceedings - International Conference on Network Protocols, ICNP
ISSN (Print)1092-1648

Conference

Conference31st IEEE International Conference on Network Protocols, ICNP 2023
Country/TerritoryIceland
CityReykjavik
Period10/10/2313/10/23

Keywords

  • clustering
  • data confidence fabric
  • data weighting
  • edge computing
  • linear regression
  • machine learning

Fingerprint

Dive into the research topics of 'Towards Trust-Based Data Weighting in Machine Learning'. Together they form a unique fingerprint.

Cite this