Encoding the Latent Posterior of Bayesian Neural Networks for Uncertainty Quantification

Gianni Franchi; Andrei Bursuc; Emanuel Aldea; Séverine Dubuisson; Isabelle Bloch

doi:10.1109/TPAMI.2023.3328829

Article Dans Une Revue IEEE Transactions on Pattern Analysis and Machine Intelligence Année : 2024

Encoding the Latent Posterior of Bayesian Neural Networks for Uncertainty Quantification

(1) , (2) , (3, 4) , (5, 6, 7) , (8)

1
2
3
4
5
6
7
8

Gianni Franchi

Fonction : Auteur
PersonId : 740481
IdHAL : gianni-franchi1
ORCID : 0000-0002-2184-1381
IdRef : 22427001X

Unité d'Informatique et d'Ingénierie des Systèmes

Andrei Bursuc

Fonction : Auteur

Valeo.ai

Emanuel Aldea

Fonction : Auteur

Systèmes et Applications des Technologies de l'Information et de l'Energie

Université Paris-Saclay

Séverine Dubuisson

Fonction : Auteur
PersonId : 172520
IdHAL : severine-dubuisson
ORCID : 0000-0001-7306-4134
IdRef : 061717657

Laboratoire des Sciences de l'Information et des Systèmes

Laboratoire d'Informatique et des Systèmes (LIS) (Marseille, Toulon)

Images et Modèles

Isabelle Bloch

Fonction : Auteur
PersonId : 175825
IdHAL : isabelle-bloch
ORCID : 0000-0002-6984-1532
IdRef : 031277861

Learning, Fuzzy and Intelligent systems

Résumé

Bayesian Neural Networks (BNNs) have long been considered an ideal, yet unscalable solution for improving the robustness and the predictive uncertainty of deep neural networks. While they could capture more accurately the posterior distribution of the network parameters, most BNN approaches are either limited to small networks or rely on constraining assumptions, e.g., parameter independence. These drawbacks have enabled prominence of simple, but computationally heavy approaches such as Deep Ensembles, whose training and testing costs increase linearly with the number of networks. In this work we aim for efficient deep BNNs amenable to complex computer vision architectures, e.g., ResNet-50 DeepLabv3+, and tasks, e.g., semantic segmentation and image classification, with fewer assumptions on the parameters. We achieve this by leveraging variational autoencoders (VAEs) to learn the interaction and the latent distribution of the parameters at each network layer. Our approach, called Latent-Posterior BNN (LP-BNN), is compatible with the recent BatchEnsemble method, leading to highly efficient (in terms of computation and memory during both training and testing) ensembles. LP-BNNs attain competitive results across multiple metrics in several challenging benchmarks for image classification, semantic segmentation, and out-of-distribution detection.

Mots clés

Training Bayes methods Uncertainty Correlation Task analysis Gaussian distribution Computational efficiency

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

final_draftLPBNN.pdf (4.06 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Séverine Dubuisson : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04320979

Soumis le : mercredi 3 avril 2024-14:57:52

Dernière modification le : mercredi 30 octobre 2024-13:32:07

Dates et versions

hal-04320979 , version 1 (03-04-2024)

Identifiants

HAL Id : hal-04320979 , version 1
ARXIV : 2012.02818
DOI : 10.1109/TPAMI.2023.3328829

Citer

Gianni Franchi, Andrei Bursuc, Emanuel Aldea, Séverine Dubuisson, Isabelle Bloch. Encoding the Latent Posterior of Bayesian Neural Networks for Uncertainty Quantification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46 (4), pp.2027-2040. ⟨10.1109/TPAMI.2023.3328829⟩. ⟨hal-04320979⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENSTA UNIV-TLN CNRS UNIV-AMU UNIV-CERGY ENS-CACHAN CNAM SATIE LSIS IFSTTAR LIP6 ENSTA_U2IS GENCI UNIV-PARIS-SACLAY LIS-LAB UNIV-RENNES SORBONNE-UNIVERSITE SU-SCIENCES IP_PARIS CY-TECH-SE ENS-PARIS-SACLAY GS-COMPUTER-SCIENCE INCIAM HESAM IRENAV LAMPA LCPI LABOMAP LISPEN MSMP

226 Consultations

24 Téléchargements

Encoding the Latent Posterior of Bayesian Neural Networks for Uncertainty Quantification

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager