Improved Coresets and Sublinear Algorithms for Power Means in Euclidean Spaces

Vincent Cohen-Addad; David Saulpic; Chris Schwiegelshohn

Communication Dans Un Congrès Année : 2021

Improved Coresets and Sublinear Algorithms for Power Means in Euclidean Spaces

(1) , (2) , (3)

1
2
3

Vincent Cohen-Addad

Fonction : Auteur
PersonId : 176295
IdHAL : vincent-cohen-addad
IdRef : 253136334

Google Research [Zurich]

David Saulpic

Fonction : Auteur
PersonId : 1080180
ORCID : 0000-0003-4208-8541
IdRef : 265099617

Recherche Opérationnelle

Chris Schwiegelshohn

Fonction : Auteur
PersonId : 1176319
ORCID : 0000-0002-1202-0805
IdRef : 265099897

Aarhus University [Aarhus]

Résumé

In this paper, we consider the problem of finding high dimensional power means: given a set A of n points in R^d , find the point m that minimizes the sum of Euclidean distance, raised to the power z, over all input points. Special cases of problem include the well-known Fermat-Weber problem-or geometric median problem-where z = 1, the mean or centroid where z = 2, and the Minimum Enclosing Ball problem, where z = ∞. We consider these problem in the big data regime. Here, we are interested in sampling as few points as possible such that we can accurately estimate m. More specifically, we consider sublinear algorithms as well as coresets for these problems. Sublinear algorithms have a random query access to the set A and the goal is to minimize the number of queries. Here, we show that O ε −z−3 samples are sufficient to achieve a (1+ε)-approximation, generalizing the results from Cohen, Lee, Miller, Pachocki, and Sidford [STOC '16] and Inaba, Katoh, and Imai [SoCG '94] to arbitrary z. Moreover, we show that this bound is nearly optimal, as any algorithm requires at least Ω ε −z+1 queries to achieve said approximation. The second contribution are coresets for these problems, where we aim to find find a small, weighted subset of the points which approximates cost of every candidate point c ∈ R d up to a (1 ± ε) factor. Here, we show that O ε −2 points are sufficient, improving on the O dε −2 bound by Feldman and Langberg [STOC '11] and the O ε −4 bound by Braverman, Jiang, Krauthgamer, and Wu [SODA 21].

Domaines

Informatique [cs]

Fichier principal

NeurIPS-2021-improved-coresets-and-sublinear-algorithms-for-power-means-in-euclidean-spaces-Paper.pdf (359.56 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

David Saulpic : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-03944707

Soumis le : mercredi 18 janvier 2023-10:56:20

Dernière modification le : mercredi 30 octobre 2024-13:34:07

Archivage à long terme le : mercredi 19 avril 2023-18:43:48

Dates et versions

hal-03944707 , version 1 (18-01-2023)

Identifiants

HAL Id : hal-03944707 , version 1

Citer

Vincent Cohen-Addad, David Saulpic, Chris Schwiegelshohn. Improved Coresets and Sublinear Algorithms for Power Means in Euclidean Spaces. Advances in Neural Information Processing Systems, Dec 2021, Virtual, France. pp.21085--21098. ⟨hal-03944707⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES

47 Consultations

11 Téléchargements

Improved Coresets and Sublinear Algorithms for Power Means in Euclidean Spaces

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager