Optimizing Execution Time and Costs of Cross-Silo Federated Learning Applications with Datasets on different Cloud Providers - Sorbonne Université Access content directly
Conference Papers Year : 2022

Optimizing Execution Time and Costs of Cross-Silo Federated Learning Applications with Datasets on different Cloud Providers

Abstract

Under the coordination of a central server, Federate Learning (FL) enables a set of clients to collaboratively train a global machine learning model without exchanging their local data. When such clients have powerful machines, it is called cross-silo FL, and they store their data in private repositories denoted silos. We are interested in this paper in cross-silo FL where silos are geographically located in different regions of multi-cloud providers. Thus, aiming at minimizing financial costs and execution times of a cross-silo FL application, we propose a model based on a scheduling problem mathematical formulation, which receives as input both the application parameters and the cloud providers' resource features where clients' data are stored and renders the best assignment of clients and server to virtual machines. This formulation is part of a framework proposal to execute FL applications in different cloud providers. Taking as a use case a Tumor-Infiltrating Lymphocytes Classification problem, an FL application whose clients' datasets spread over different cloud providers' data repositories, evaluation results show that our model is scalable and improves the execution time and financial costs of the FL application by up to 53.70% and 48.34% in a scenario with 50 clients, executing in around 200 seconds, when compared to results where VMs are randomly selected. Experimental results with client silos in different Google (GCP) and Amazon (AWS) cloud regions also confirmed the effectiveness of our proposed model in a real multi-cloud environment.
Fichier principal
Vignette du fichier
SBACPAD_2022.pdf (329.68 Ko) Télécharger le fichier
Origin Files produced by the author(s)

Dates and versions

hal-04016538 , version 1 (06-03-2023)

Identifiers

Cite

Rafaela C Brum, Pierre Sens, Luciana Arantes, Maria Clicia Castro, Lucia Maria de A. Drummond. Optimizing Execution Time and Costs of Cross-Silo Federated Learning Applications with Datasets on different Cloud Providers. SBAC-PAD 2022 - IEEE 34th International Symposium on Computer Architecture and High Performance Computing, Nov 2022, Bordeaux, France. pp.253-262, ⟨10.1109/SBAC-PAD55451.2022.00036⟩. ⟨hal-04016538⟩
30 View
27 Download

Altmetric

Share

Gmail Mastodon Facebook X LinkedIn More