Collaborating Foundation Models for Domain Generalized Semantic Segmentation

Yasser Benigmim; Subhankar Roy; Slim Essid; Vicky Kalogeiton; Stéphane Lathuilière

doi:10.1109/CVPR52733.2024.00300

Communication Dans Un Congrès Année : 2024

Collaborating Foundation Models for Domain Generalized Semantic Segmentation

(1, 2, 3, 4) , (5) , (3, 4) , (1) , (2, 3)

1
2
3
4
5

Yasser Benigmim

Fonction : Auteur
PersonId : 1392158

Laboratoire d'informatique de l'École polytechnique [Palaiseau]

Multimédia

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Subhankar Roy

Fonction : Auteur

University of Aberdeen

Slim Essid

Fonction : Auteur
PersonId : 181234
IdHAL : slimessid
ORCID : 0000-0002-0028-327X
IdRef : 11025130X

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Vicky Kalogeiton

Fonction : Auteur

Laboratoire d'informatique de l'École polytechnique [Palaiseau]

Stéphane Lathuilière

Fonction : Auteur
PersonId : 1058528
IdHAL : stephane-lathuiliere

Multimédia

Signal, Statistique et Apprentissage

Résumé

Domain Generalized Semantic Segmentation (DGSS) deals with training a model on a labeled source domain with the aim of generalizing to unseen domains during inference. Existing DGSS methods typically effectuate robust features by means of Domain Randomization (DR). Such an approach is often limited as it can only account for style diversification and not content. In this work, we take an orthogonal approach to DGSS and propose to use an assembly of CoLlaborative FOUndation models for Domain Generalized Semantic Segmentation (CLOUDS). In detail, CLOUDS is a framework that integrates FMs of various kinds: (i) CLIP backbone for its robust feature representation, (ii) generative models to diversify the content, thereby covering various modes of the possible target distribution, and (iii) Segment Anything Model (SAM) for iteratively refining the predictions of the segmentation model. Extensive experiments show that our CLOUDS excels in adapting from synthetic to real DGSS benchmarks and under varying weather conditions, notably outperforming prior methods by 5.6% and 6.7% on averaged miou, respectively. The code is available at : https://github.com/yasserben/CLOUDS

Mots clés

Training Adaptation models Semantic segmentation Clouds Collaboration Predictive models Benchmark testing Domain Adaptation Domain Generalization Semantic Segmentation Foundation Models Computer Vision Deep Learning

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV] Intelligence artificielle [cs.AI] Apprentissage [cs.LG]

Fichier principal

Benigmim_Collaborating_Foundation_Models_for_Domain_Generalized_Semantic_Segmentation_CVPR_2024_paper.pdf (3.44 Mo)

thumbnail.png (206.76 Ko)

Origine	Fichiers éditeurs autorisés sur une archive ouverte
licence	Paternité

Format	Figure, Image
licence	Paternité

Yasser Benigmim : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04611902

Soumis le : vendredi 6 décembre 2024-16:04:01

Dernière modification le : jeudi 12 décembre 2024-17:28:10

Dates et versions

hal-04611902 , version 1 (06-12-2024)

Licence

Paternité

Identifiants

HAL Id : hal-04611902 , version 1
ARXIV : 2312.09788
DOI : 10.1109/CVPR52733.2024.00300

Citer

Yasser Benigmim, Subhankar Roy, Slim Essid, Vicky Kalogeiton, Stéphane Lathuilière. Collaborating Foundation Models for Domain Generalized Semantic Segmentation. The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024, Jun 2024, Seattle, WA, United States. pp.3108-3119, ⟨10.1109/CVPR52733.2024.00300⟩. ⟨hal-04611902⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X CNRS LIX X-LIX X-DEP-INFO GENCI LTCI IDS MM S2A IP_PARIS ANR INSTITUT-MINES-TELECOM

0 Consultations

1 Téléchargements

Collaborating Foundation Models for Domain Generalized Semantic Segmentation

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager