Optimizing Execution Time and Costs of Cross-Silo Federated Learning Applications with Datasets on different Cloud Providers

Rafaela C Brum; Pierre Sens; Luciana Arantes; Maria Clicia Castro; Lucia Maria de A. Drummond

doi:10.1109/SBAC-PAD55451.2022.00036

Communication Dans Un Congrès Année : 2022

Optimizing Execution Time and Costs of Cross-Silo Federated Learning Applications with Datasets on different Cloud Providers

(1) , (2) , (2) , (3) , (1)

1
2
3

Rafaela C Brum

Fonction : Auteur
PersonId : 1234217
ORCID : 0000-0003-3740-8379
IdRef : 275944794

Instituto de Computação [Niteroi-Rio de Janeiro]

Pierre Sens

Fonction : Auteur
PersonId : 737442
IdHAL : pierre-sens
ORCID : 0000-0002-5156-7715
IdRef : 259987166

DistributEd aLgorithms and sYStems

Luciana Arantes

Fonction : Auteur
PersonId : 2197
IdHAL : luciana-arantes
ORCID : 0000-0002-0938-2004
IdRef : 195040953

DistributEd aLgorithms and sYStems

Maria Clicia Castro

Fonction : Auteur

Instituto de Matemática e Estatística (Rio do Janeiro)

Lucia Maria de A. Drummond

Fonction : Auteur

Instituto de Computação [Niteroi-Rio de Janeiro]

Résumé

Under the coordination of a central server, Federate Learning (FL) enables a set of clients to collaboratively train a global machine learning model without exchanging their local data. When such clients have powerful machines, it is called cross-silo FL, and they store their data in private repositories denoted silos. We are interested in this paper in cross-silo FL where silos are geographically located in different regions of multi-cloud providers. Thus, aiming at minimizing financial costs and execution times of a cross-silo FL application, we propose a model based on a scheduling problem mathematical formulation, which receives as input both the application parameters and the cloud providers' resource features where clients' data are stored and renders the best assignment of clients and server to virtual machines. This formulation is part of a framework proposal to execute FL applications in different cloud providers. Taking as a use case a Tumor-Infiltrating Lymphocytes Classification problem, an FL application whose clients' datasets spread over different cloud providers' data repositories, evaluation results show that our model is scalable and improves the execution time and financial costs of the FL application by up to 53.70% and 48.34% in a scenario with 50 clients, executing in around 200 seconds, when compared to results where VMs are randomly selected. Experimental results with client silos in different Google (GCP) and Amazon (AWS) cloud regions also confirmed the effectiveness of our proposed model in a real multi-cloud environment.

Mots clés

Scheduling Problem Federated Learning Multi-Cloud

Domaines

Calcul parallèle, distribué et partagé [cs.DC]

Fichier principal

SBACPAD_2022.pdf (329.68 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Pierre Sens : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-04016538

Soumis le : lundi 6 mars 2023-15:26:32

Dernière modification le : vendredi 8 novembre 2024-16:26:02

Archivage à long terme le : mercredi 7 juin 2023-18:56:12

Dates et versions

hal-04016538 , version 1 (06-03-2023)

Identifiants

HAL Id : hal-04016538 , version 1
DOI : 10.1109/SBAC-PAD55451.2022.00036

Citer

Rafaela C Brum, Pierre Sens, Luciana Arantes, Maria Clicia Castro, Lucia Maria de A. Drummond. Optimizing Execution Time and Costs of Cross-Silo Federated Learning Applications with Datasets on different Cloud Providers. SBAC-PAD 2022 - IEEE 34th International Symposium on Computer Architecture and High Performance Computing, Nov 2022, Bordeaux, France. pp.253-262, ⟨10.1109/SBAC-PAD55451.2022.00036⟩. ⟨hal-04016538⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA LIP6 INRIA2 SORBONNE-UNIVERSITE SU-SCIENCES INRIA-BRASIL

85 Consultations

47 Téléchargements

Optimizing Execution Time and Costs of Cross-Silo Federated Learning Applications with Datasets on different Cloud Providers

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager