A multi-objective approach for sustainable generative audio models - Sorbonne Université Access content directly
Preprints, Working Papers, ... Year : 2021

A multi-objective approach for sustainable generative audio models


In recent years, the deep learning community has largely focused on the accuracy of deep generative models, resulting in impressive improvements in several research fields. However, this scientific race for quality comes at a tremendous computational cost, which incurs vast energy consumption and greenhouse gas emissions. If the current exponential growth of computational consumption persists, Artificial Intelligence (AI) will sadly become a considerable contributor to global warming. At the heart of this problem are the measures that we use as a scientific community to evaluate our work. Currently, researchers in the field of AI judge scientific works mostly based on the improvement in accuracy, log-likelihood, reconstruction or opinion scores, all of which entirely obliterates the actual computational cost of generative models. In this paper, we introduce the idea of relying on a multi-objective measure based on Pareto optimality, which simultaneously integrates the models accuracy, as well as the environmental impact of their training. By applying this measure on the current state-of-the-art in generative audio models, we show that this measure drastically changes the perceived significance of the results in the field, encouraging optimal training techniques and resource allocation. We hope that this type of measure will be widely adopted, in order to help the community to better evaluate the significance of their work, while bringing computational cost-and in fine carbon emissions-in the spotlight of AI research.
Fichier principal
Vignette du fichier
2107.02621.pdf (677.44 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-03296897 , version 1 (22-07-2021)



Constance Douwes, Philippe Esling, Jean-Pierre Briot. A multi-objective approach for sustainable generative audio models. 2021. ⟨hal-03296897⟩
103 View
262 Download



Gmail Facebook X LinkedIn More