Network Traffic Characteristics of Machine Learning Frameworks Under the Microscope

S.207-215 High computational demands of complex deep learning models led to workload distribution across multiple machines. Many frameworks for distributed machine learning (DML) have been developed and are employed in practice for orchestrating workload distribution. In this paper, we analyze and c...

Full description

Bibliographic Details
Published in:2021 17th International Conference on Network and Service Management (CNSM)
Main Authors: Zerwas, Johannes, Aykurt, Kaan, Schmid, Stefan, Blenk, Andreas
Format: Conference Object
Language:English
Published: 2021
Subjects:
004
005
DML
Online Access:https://publica.fraunhofer.de/handle/publica/413420
https://doi.org/10.23919/CNSM52442.2021.9615524
Description
Summary:S.207-215 High computational demands of complex deep learning models led to workload distribution across multiple machines. Many frameworks for distributed machine learning (DML) have been developed and are employed in practice for orchestrating workload distribution. In this paper, we analyze and compare network behaviors of three widely used state-of-the-art DML frameworks. The study reveals that traffic can largely vary across the frameworks. While some frameworks exhibit well predictable patterns, others are less structured. We further explore whether and how it is possible to relate the network traffic to the DML jobs' attributes, and present a multiple linear regression model accordingly. Our results can inform the networking community about traffic characteristics and contribute toward the generation of realistic DML traffic for simulation studies.