Deterministic and probabilistic backward error analysis of neural networks in floating-point arithmetic - Calcul Intensif, Simulation, Optimisation
Pré-Publication, Document De Travail Année : 2024

Deterministic and probabilistic backward error analysis of neural networks in floating-point arithmetic

Résumé

The use of artificial neural networks is now becoming widespread across a wide variety of tasks. In this context of very rapid development, issues related to the storage and computational performance of these models emerge, since networks are sometimes very deep and comprise up to billions of parameters. For all these reasons, the use of reduced precision is increasingly being considered although, until now, its accuracy and robustness had been approached mostly from a practical standpoint or verified by software. The aim of this work is to provide formal tools to better understand, explain, and predict the accuracy and stability of neural networks when using floating-point arithmetic. To this end, we first extend to neural networks some well-known concepts from numerical linear algebra, such as condition number and backward error. We then apply a rounding error analysis based on existing tools in numerical linear algebra to obtain both forward and backward error bounds. This includes both deterministic worst-case bounds as well as probabilistic bounds that are sharper on average. These bounds both ensure the proper functioning of neural networks once trained, and provide recommendations on architectures and training methods to enhance the robustness of neural networks.
Fichier principal
Vignette du fichier
backward_error_analysis_neural_networks.pdf (735.74 Ko) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04663142 , version 1 (26-07-2024)

Identifiants

  • HAL Id : hal-04663142 , version 1

Citer

Théo Beuzeville, Alfredo Buttari, Serge Gratton, Theo Mary. Deterministic and probabilistic backward error analysis of neural networks in floating-point arithmetic. 2024. ⟨hal-04663142⟩
683 Consultations
169 Téléchargements

Partager

More