Privacy preserving distributed machine learning with federated learning

Edge computing and distributed machine learning have advanced to a level that can revolutionize a particular organization. Distributed devices such as the Internet of Things (IoT) often produce a large amount of data, eventually resulting in big data that can be vital in uncovering hidden patterns,...

Full description

Bibliographic Details
Main Authors:	Pathum Chamikara, Peter Bertok, Ibrahim Khalil, D. Liu, Seyit Camtepe
Format:	Article in Journal/Newspaper
Language:	unknown
Published:	2021
Subjects:	Distributed computing and systems software not elsewhere classified Data privacy Distributed data privacy Distributed machine learning Federated learning Privacy preserving machine learning DML
Online Access:	https://figshare.com/articles/journal_contribution/Privacy_preserving_distributed_machine_learning_with_federated_learning/27539226

_version_	1821499334318882816
author	Pathum Chamikara Peter Bertok Ibrahim Khalil D. Liu Seyit Camtepe
author_facet	Pathum Chamikara Peter Bertok Ibrahim Khalil D. Liu Seyit Camtepe
author_sort	Pathum Chamikara
collection	Research from RMIT University
description	Edge computing and distributed machine learning have advanced to a level that can revolutionize a particular organization. Distributed devices such as the Internet of Things (IoT) often produce a large amount of data, eventually resulting in big data that can be vital in uncovering hidden patterns, and other insights in numerous fields such as healthcare, banking, and policing. Data related to areas such as healthcare and banking can contain potentially sensitive data that can become public if they are not appropriately sanitized. Federated learning (FedML) is a recently developed distributed machine learning (DML) approach that tries to preserve privacy by bringing the learning of an ML model to data owners’ devices. However, literature shows different attack methods such as membership inference that exploit the vulnerabilities of ML models as well as the coordinating servers to retrieve private data. Hence, FedML needs additional measures to guarantee data privacy. Furthermore, big data often requires more resources than available in a standard computer. This paper addresses these issues by proposing a distributed perturbation algorithm named as DISTPAB, for privacy preservation of horizontally partitioned data. DISTPAB alleviates computational bottlenecks by distributing the task of privacy preservation utilizing the asymmetry of resources of a distributed environment, which can have resource-constrained devices as well as high-performance computers. Experiments show that DISTPAB provides high accuracy, high efficiency, high scalability, and high attack resistance. Further experiments on privacy-preserving FedML show that DISTPAB is an excellent solution to stop privacy leaks in DML while preserving high data utility.
format	Article in Journal/Newspaper
genre	DML
genre_facet	DML
id	ftrmitunivfig:oai:figshare.com:article/27539226
institution	Open Polar
language	unknown
op_collection_id	ftrmitunivfig
op_relation	10779/rmit.27539226.v1 https://figshare.com/articles/journal_contribution/Privacy_preserving_distributed_machine_learning_with_federated_learning/27539226
op_rights	All rights reserved
publishDate	2021
record_format	openpolar
spelling	ftrmitunivfig:oai:figshare.com:article/27539226 2025-01-16T21:38:48+00:00 Privacy preserving distributed machine learning with federated learning Pathum Chamikara Peter Bertok Ibrahim Khalil D. Liu Seyit Camtepe 2021-01-01T00:00:00Z https://figshare.com/articles/journal_contribution/Privacy_preserving_distributed_machine_learning_with_federated_learning/27539226 unknown 10779/rmit.27539226.v1 https://figshare.com/articles/journal_contribution/Privacy_preserving_distributed_machine_learning_with_federated_learning/27539226 All rights reserved Distributed computing and systems software not elsewhere classified Data privacy Distributed data privacy Distributed machine learning Federated learning Privacy preserving machine learning Text Journal contribution 2021 ftrmitunivfig 2025-01-03T08:17:28Z Edge computing and distributed machine learning have advanced to a level that can revolutionize a particular organization. Distributed devices such as the Internet of Things (IoT) often produce a large amount of data, eventually resulting in big data that can be vital in uncovering hidden patterns, and other insights in numerous fields such as healthcare, banking, and policing. Data related to areas such as healthcare and banking can contain potentially sensitive data that can become public if they are not appropriately sanitized. Federated learning (FedML) is a recently developed distributed machine learning (DML) approach that tries to preserve privacy by bringing the learning of an ML model to data owners’ devices. However, literature shows different attack methods such as membership inference that exploit the vulnerabilities of ML models as well as the coordinating servers to retrieve private data. Hence, FedML needs additional measures to guarantee data privacy. Furthermore, big data often requires more resources than available in a standard computer. This paper addresses these issues by proposing a distributed perturbation algorithm named as DISTPAB, for privacy preservation of horizontally partitioned data. DISTPAB alleviates computational bottlenecks by distributing the task of privacy preservation utilizing the asymmetry of resources of a distributed environment, which can have resource-constrained devices as well as high-performance computers. Experiments show that DISTPAB provides high accuracy, high efficiency, high scalability, and high attack resistance. Further experiments on privacy-preserving FedML show that DISTPAB is an excellent solution to stop privacy leaks in DML while preserving high data utility. Article in Journal/Newspaper DML Research from RMIT University
spellingShingle	Distributed computing and systems software not elsewhere classified Data privacy Distributed data privacy Distributed machine learning Federated learning Privacy preserving machine learning Pathum Chamikara Peter Bertok Ibrahim Khalil D. Liu Seyit Camtepe Privacy preserving distributed machine learning with federated learning
title	Privacy preserving distributed machine learning with federated learning
title_full	Privacy preserving distributed machine learning with federated learning
title_fullStr	Privacy preserving distributed machine learning with federated learning
title_full_unstemmed	Privacy preserving distributed machine learning with federated learning
title_short	Privacy preserving distributed machine learning with federated learning
title_sort	privacy preserving distributed machine learning with federated learning
topic	Distributed computing and systems software not elsewhere classified Data privacy Distributed data privacy Distributed machine learning Federated learning Privacy preserving machine learning
topic_facet	Distributed computing and systems software not elsewhere classified Data privacy Distributed data privacy Distributed machine learning Federated learning Privacy preserving machine learning
url	https://figshare.com/articles/journal_contribution/Privacy_preserving_distributed_machine_learning_with_federated_learning/27539226

Privacy preserving distributed machine learning with federated learning

Similar Items