Distributed Machine Learning and the Semblance of Trust

The utilisation of large and diverse datasets for machine learning (ML) at scale is required to promote scientific insight into many meaningful problems. However, due to data governance regulations such as GDPR as well as ethical concerns, the aggregation of personal and sensitive data is problemati...

Full description

Bibliographic Details
Main Authors:	Usynin, Dmitrii, Ziller, Alexander, Rueckert, Daniel, Passerat-Palmbach, Jonathan, Kaissis, Georgios
Format:	Text
Language:	unknown
Published:	2021
Subjects:	Computer Science - Machine Learning Computer Science - Cryptography and Security DML
Online Access:	http://arxiv.org/abs/2112.11040

id	ftarxivpreprints:oai:arXiv.org:2112.11040
record_format	openpolar
spelling	ftarxivpreprints:oai:arXiv.org:2112.11040 2023-09-05T13:19:06+02:00 Distributed Machine Learning and the Semblance of Trust Usynin, Dmitrii Ziller, Alexander Rueckert, Daniel Passerat-Palmbach, Jonathan Kaissis, Georgios 2021-12-21 http://arxiv.org/abs/2112.11040 unknown http://arxiv.org/abs/2112.11040 Computer Science - Machine Learning Computer Science - Cryptography and Security text 2021 ftarxivpreprints 2023-08-16T16:50:54Z The utilisation of large and diverse datasets for machine learning (ML) at scale is required to promote scientific insight into many meaningful problems. However, due to data governance regulations such as GDPR as well as ethical concerns, the aggregation of personal and sensitive data is problematic, which prompted the development of alternative strategies such as distributed ML (DML). Techniques such as Federated Learning (FL) allow the data owner to maintain data governance and perform model training locally without having to share their data. FL and related techniques are often described as privacy-preserving. We explain why this term is not appropriate and outline the risks associated with over-reliance on protocols that were not designed with formal definitions of privacy in mind. We further provide recommendations and examples on how such algorithms can be augmented to provide guarantees of governance, security, privacy and verifiability for a general ML audience without prior exposure to formal privacy techniques. Comment: Accepted at The Third AAAI Workshop on Privacy-Preserving Artificial Intelligence Text DML ArXiv.org (Cornell University Library)
institution	Open Polar
collection	ArXiv.org (Cornell University Library)
op_collection_id	ftarxivpreprints
language	unknown
topic	Computer Science - Machine Learning Computer Science - Cryptography and Security
spellingShingle	Computer Science - Machine Learning Computer Science - Cryptography and Security Usynin, Dmitrii Ziller, Alexander Rueckert, Daniel Passerat-Palmbach, Jonathan Kaissis, Georgios Distributed Machine Learning and the Semblance of Trust
topic_facet	Computer Science - Machine Learning Computer Science - Cryptography and Security
description	The utilisation of large and diverse datasets for machine learning (ML) at scale is required to promote scientific insight into many meaningful problems. However, due to data governance regulations such as GDPR as well as ethical concerns, the aggregation of personal and sensitive data is problematic, which prompted the development of alternative strategies such as distributed ML (DML). Techniques such as Federated Learning (FL) allow the data owner to maintain data governance and perform model training locally without having to share their data. FL and related techniques are often described as privacy-preserving. We explain why this term is not appropriate and outline the risks associated with over-reliance on protocols that were not designed with formal definitions of privacy in mind. We further provide recommendations and examples on how such algorithms can be augmented to provide guarantees of governance, security, privacy and verifiability for a general ML audience without prior exposure to formal privacy techniques. Comment: Accepted at The Third AAAI Workshop on Privacy-Preserving Artificial Intelligence
format	Text
author	Usynin, Dmitrii Ziller, Alexander Rueckert, Daniel Passerat-Palmbach, Jonathan Kaissis, Georgios
author_facet	Usynin, Dmitrii Ziller, Alexander Rueckert, Daniel Passerat-Palmbach, Jonathan Kaissis, Georgios
author_sort	Usynin, Dmitrii
title	Distributed Machine Learning and the Semblance of Trust
title_short	Distributed Machine Learning and the Semblance of Trust
title_full	Distributed Machine Learning and the Semblance of Trust
title_fullStr	Distributed Machine Learning and the Semblance of Trust
title_full_unstemmed	Distributed Machine Learning and the Semblance of Trust
title_sort	distributed machine learning and the semblance of trust
publishDate	2021
url	http://arxiv.org/abs/2112.11040
genre	DML
genre_facet	DML
op_relation	http://arxiv.org/abs/2112.11040
_version_	1776199922707595264

Distributed Machine Learning and the Semblance of Trust

Similar Items