Subsampling under distributional constraints

Florian Combes; Ricardo Fraiman; Badih Ghattas

Pré-Publication, Document De Travail Année : 2022

Subsampling under distributional constraints

(1) , (2) , (1)

1
2

Florian Combes

Fonction : Auteur

Institut de Mathématiques de Marseille

Ricardo Fraiman

Fonction : Auteur

Centro de Matematica [Uruguay]

Badih Ghattas

Fonction : Auteur
PersonId : 6703
IdHAL : badih-ghattas
ORCID : 0000-0002-6160-9341
IdRef : 069625654

Institut de Mathématiques de Marseille

Résumé

Some complex models are frequently employed to describe physical and mechanical phenomena. In this setting we have an input X in a general space, and an output Y = f (X) where f is a very complicated function, whose computational cost for every new input is very high. We are given two sets of observations of X, S 1 and S 2 of different sizes such that only f (S 1) is available. We tackle the problem of selecting a subsample S 3 ∈ S 2 of smaller size on which to run the complex model f , and such that distribution of f (S 3) is close to that of f (S 1). We suggest three algorithms to solve this problem and show their efficiency using simulated datasets and the Airfoil self-noise data set.

Mots clés

Optimal sampling numerical models nearest neighbours Kolmogorov–Smirnov

Domaines

Statistiques [math.ST] Optimisation et contrôle [math.OC]

Fichier principal

OptimalSampling.pdf (446.79 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Florian COMBES : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03666898

Soumis le : jeudi 12 mai 2022-19:22:32

Dernière modification le : mardi 30 avril 2024-16:30:29

Dates et versions

hal-03666898 , version 1 (12-05-2022)

Identifiants

HAL Id : hal-03666898 , version 1

Citer

Florian Combes, Ricardo Fraiman, Badih Ghattas. Subsampling under distributional constraints. 2022. ⟨hal-03666898⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-AMU EC-MARSEILLE INSMI I2M I2M-2014- TDS-MACS

123 Consultations

181 Téléchargements

Subsampling under distributional constraints

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager