DAGC: Data-Volume-Aware Adaptive Sparsification Gradient Compression for Distributed Machine Learning in Mobile Computing

Distributed machine learning (DML) in mobile environments faces significant communication bottlenecks. Gradient compression has emerged as an effective solution to this issue, offering substantial benefits in environments with limited bandwidth and metered data. Yet, they encounter severe performanc...

Full description

Bibliographic Details
Main Authors:	Lu, Rongwei, Jiang, Yutong, Mao, Yinan, Tang, Chen, Chen, Bin, Cui, Laizhong, Wang, Zhi
Format:	Text
Language:	unknown
Published:	2023
Subjects:	Computer Science - Machine Learning DML
Online Access:	http://arxiv.org/abs/2311.07324

id	ftarxivpreprints:oai:arXiv.org:2311.07324
record_format	openpolar
spelling	ftarxivpreprints:oai:arXiv.org:2311.07324 2023-12-17T10:29:27+01:00 DAGC: Data-Volume-Aware Adaptive Sparsification Gradient Compression for Distributed Machine Learning in Mobile Computing Lu, Rongwei Jiang, Yutong Mao, Yinan Tang, Chen Chen, Bin Cui, Laizhong Wang, Zhi 2023-11-13 http://arxiv.org/abs/2311.07324 unknown http://arxiv.org/abs/2311.07324 Computer Science - Machine Learning text 2023 ftarxivpreprints 2023-11-19T02:07:21Z Distributed machine learning (DML) in mobile environments faces significant communication bottlenecks. Gradient compression has emerged as an effective solution to this issue, offering substantial benefits in environments with limited bandwidth and metered data. Yet, they encounter severe performance drop in non-IID environments due to a one-size-fits-all compression approach, which does not account for the varying data volumes across workers. Assigning varying compression ratios to workers with distinct data distributions and volumes is thus a promising solution. This study introduces an analysis of distributed SGD with non-uniform compression, which reveals that the convergence rate (indicative of the iterations needed to achieve a certain accuracy) is influenced by compression ratios applied to workers with differing volumes. Accordingly, we frame relative compression ratio assignment as an $n$-variables chi-square nonlinear optimization problem, constrained by a fixed and limited communication budget. We propose DAGC-R, which assigns the worker handling larger data volumes the conservative compression. Recognizing the computational limitations of mobile devices, we DAGC-A, which are computationally less demanding and enhances the robustness of the absolute gradient compressor in non-IID scenarios. Our experiments confirm that both the DAGC-A and DAGC-R can achieve better performance when dealing with highly imbalanced data volume distribution and restricted communication. Text DML ArXiv.org (Cornell University Library)
institution	Open Polar
collection	ArXiv.org (Cornell University Library)
op_collection_id	ftarxivpreprints
language	unknown
topic	Computer Science - Machine Learning
spellingShingle	Computer Science - Machine Learning Lu, Rongwei Jiang, Yutong Mao, Yinan Tang, Chen Chen, Bin Cui, Laizhong Wang, Zhi DAGC: Data-Volume-Aware Adaptive Sparsification Gradient Compression for Distributed Machine Learning in Mobile Computing
topic_facet	Computer Science - Machine Learning
description	Distributed machine learning (DML) in mobile environments faces significant communication bottlenecks. Gradient compression has emerged as an effective solution to this issue, offering substantial benefits in environments with limited bandwidth and metered data. Yet, they encounter severe performance drop in non-IID environments due to a one-size-fits-all compression approach, which does not account for the varying data volumes across workers. Assigning varying compression ratios to workers with distinct data distributions and volumes is thus a promising solution. This study introduces an analysis of distributed SGD with non-uniform compression, which reveals that the convergence rate (indicative of the iterations needed to achieve a certain accuracy) is influenced by compression ratios applied to workers with differing volumes. Accordingly, we frame relative compression ratio assignment as an $n$-variables chi-square nonlinear optimization problem, constrained by a fixed and limited communication budget. We propose DAGC-R, which assigns the worker handling larger data volumes the conservative compression. Recognizing the computational limitations of mobile devices, we DAGC-A, which are computationally less demanding and enhances the robustness of the absolute gradient compressor in non-IID scenarios. Our experiments confirm that both the DAGC-A and DAGC-R can achieve better performance when dealing with highly imbalanced data volume distribution and restricted communication.
format	Text
author	Lu, Rongwei Jiang, Yutong Mao, Yinan Tang, Chen Chen, Bin Cui, Laizhong Wang, Zhi
author_facet	Lu, Rongwei Jiang, Yutong Mao, Yinan Tang, Chen Chen, Bin Cui, Laizhong Wang, Zhi
author_sort	Lu, Rongwei
title	DAGC: Data-Volume-Aware Adaptive Sparsification Gradient Compression for Distributed Machine Learning in Mobile Computing
title_short	DAGC: Data-Volume-Aware Adaptive Sparsification Gradient Compression for Distributed Machine Learning in Mobile Computing
title_full	DAGC: Data-Volume-Aware Adaptive Sparsification Gradient Compression for Distributed Machine Learning in Mobile Computing
title_fullStr	DAGC: Data-Volume-Aware Adaptive Sparsification Gradient Compression for Distributed Machine Learning in Mobile Computing
title_full_unstemmed	DAGC: Data-Volume-Aware Adaptive Sparsification Gradient Compression for Distributed Machine Learning in Mobile Computing
title_sort	dagc: data-volume-aware adaptive sparsification gradient compression for distributed machine learning in mobile computing
publishDate	2023
url	http://arxiv.org/abs/2311.07324
genre	DML
genre_facet	DML
op_relation	http://arxiv.org/abs/2311.07324
_version_	1785581834885660672

DAGC: Data-Volume-Aware Adaptive Sparsification Gradient Compression for Distributed Machine Learning in Mobile Computing

Similar Items