Pipelined Model Parallelism: Complexity Results and Memory Considerations

Olivier Beaumont; Lionel Eyraud-Dubois; Alena Shilova

Pré-Publication, Document De Travail Année : 2020

Pipelined Model Parallelism: Complexity Results and Memory Considerations

(1, 2) , (1, 2) , (1, 2)

1
2

Olivier Beaumont

Fonction : Auteur
PersonId : 181224
IdHAL : olivier-beaumont
ORCID : 0000-0003-2741-6228
IdRef : 124577083

High-End Parallel Algorithms for Challenging Numerical Simulations

Université de Bordeaux

Lionel Eyraud-Dubois

Fonction : Auteur
PersonId : 174911
IdHAL : lioneleyraud-dubois
ORCID : 0000-0003-2475-3309
IdRef : 172645301

High-End Parallel Algorithms for Challenging Numerical Simulations

Université de Bordeaux

Alena Shilova

Fonction : Auteur

High-End Parallel Algorithms for Challenging Numerical Simulations

Université de Bordeaux

Résumé

The training phase in Deep Neural Networks has become an important source of computing resource usage and because of the resulting volume of computation, it is crucial to perform it efficiently on parallel architectures. Even today, data parallelism is the most widely used method, but the associated requirement to replicate all the weights on the totality of computation resources poses problems of memory at the level of each node and of collective communications at the level of the platform. In this context, the model parallelism, which consists in distributing the different layers of the network over the computing nodes, is an attractive alternative. Indeed, it is expected to better distribute weights (to cope with memory problems) and it does not imply large collective communications since only forward activations are communicated. However, to be efficient, it must be combined with a pipelined / streaming approach, which leads in turn to new memory costs. The goal of this paper is to model these memory costs in detail, to analyze the complexity of the associated throughput optimization problem under memory constraints and to show that it is possible to formalize this optimization problem as an Integer Linear Program (ILP).

Domaines

Calcul parallèle, distribué et partagé [cs.DC] Réseau de neurones [cs.NE]

Fichier principal

paperILP.pdf (646.98 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Lionel Eyraud-Dubois : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02968802

Soumis le : vendredi 16 octobre 2020-18:00:26

Dernière modification le : mercredi 20 mars 2024-17:52:16

Dates et versions

hal-02968802 , version 1 (16-10-2020)

hal-02968802 , version 2 (16-10-2020)

hal-02968802 , version 3 (18-02-2021)

Identifiants

HAL Id : hal-02968802 , version 2

Citer

Olivier Beaumont, Lionel Eyraud-Dubois, Alena Shilova. Pipelined Model Parallelism: Complexity Results and Memory Considerations. 2020. ⟨hal-02968802v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

350 Consultations

457 Téléchargements

Pipelined Model Parallelism: Complexity Results and Memory Considerations

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager