A multi-objective approach for sustainable generative audio models

Constance Douwes; Philippe Esling; Jean-Pierre Briot

Pré-Publication, Document De Travail Année : 2021

A multi-objective approach for sustainable generative audio models

(1) , (1) , (2)

1
2

Constance Douwes

Fonction : Auteur
PersonId : 1266976
IdHAL : constance-douwes
ORCID : 0009-0000-5987-0252

Sciences et Technologies de la Musique et du Son

Philippe Esling

Fonction : Auteur
PersonId : 14916
IdHAL : philippe-esling
ORCID : 0000-0002-1655-7909
IdRef : 172472873

Sciences et Technologies de la Musique et du Son

Jean-Pierre Briot

Fonction : Auteur
PersonId : 5062
IdHAL : jean-pierre-briot
ORCID : 0000-0003-1621-6335
IdRef : 059824727

Systèmes Multi-Agents

Résumé

In recent years, the deep learning community has largely focused on the accuracy of deep generative models, resulting in impressive improvements in several research fields. However, this scientific race for quality comes at a tremendous computational cost, which incurs vast energy consumption and greenhouse gas emissions. If the current exponential growth of computational consumption persists, Artificial Intelligence (AI) will sadly become a considerable contributor to global warming. At the heart of this problem are the measures that we use as a scientific community to evaluate our work. Currently, researchers in the field of AI judge scientific works mostly based on the improvement in accuracy, log-likelihood, reconstruction or opinion scores, all of which entirely obliterates the actual computational cost of generative models. In this paper, we introduce the idea of relying on a multi-objective measure based on Pareto optimality, which simultaneously integrates the models accuracy, as well as the environmental impact of their training. By applying this measure on the current state-of-the-art in generative audio models, we show that this measure drastically changes the perceived significance of the results in the field, encouraging optimal training techniques and resource allocation. We hope that this type of measure will be widely adopted, in order to help the community to better evaluate the significance of their work, while bringing computational cost-and in fine carbon emissions-in the spotlight of AI research.

Domaines

Intelligence artificielle [cs.AI] Réseau de neurones [cs.NE] Traitement du signal et de l'image [eess.SP]

Fichier principal

2107.02621.pdf (677.44 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Jean-Pierre Briot : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-03296897

Soumis le : jeudi 22 juillet 2021-21:48:19

Dernière modification le : mercredi 30 octobre 2024-13:33:48

Archivage à long terme le : samedi 23 octobre 2021-19:41:18

Dates et versions

hal-03296897 , version 1 (22-07-2021)

Identifiants

HAL Id : hal-03296897 , version 1
ARXIV : 2107.02621

Citer

Constance Douwes, Philippe Esling, Jean-Pierre Briot. A multi-objective approach for sustainable generative audio models. 2021. ⟨hal-03296897⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS IRCAM LIP6 STMS SORBONNE-UNIVERSITE SU-SCIENCES

149 Consultations

307 Téléchargements

A multi-objective approach for sustainable generative audio models

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager