Interpretable Random Forests via Rule Extraction

Clément Bénard; Gérard Biau; Sébastien da Veiga; Erwan Scornet

doi:10.48550/arXiv.2004.14841

Communication Dans Un Congrès Année : 2021

Interpretable Random Forests via Rule Extraction

(1, 2) , (3) , (2) , (4)

1
2
3
4

Clément Bénard

Fonction : Auteur

Laboratoire de Probabilités, Statistique et Modélisation

Safran Tech

Gérard Biau

Fonction : Auteur

Laboratoire de Probabilités, Statistique et Modélisation

Sébastien da Veiga

Fonction : Auteur
PersonId : 742899
IdHAL : sebastien-da-veiga
ORCID : 0009-0004-1637-7942
IdRef : 191001163

Safran Tech

Erwan Scornet

Fonction : Auteur
PersonId : 520
IdHAL : erwan-scornet

Centre de Mathématiques Appliquées de l'Ecole polytechnique

Résumé

We introduce SIRUS (Stable and Interpretable RUle Set) for regression, a stable rule learning algorithm which takes the form of a short and simple list of rules. State-of-the-art learning algorithms are often referred to as "black boxes" because of the high number of operations involved in their prediction process. Despite their powerful predictivity, this lack of interpretability may be highly restrictive for applications with critical decisions at stake. On the other hand, algorithms with a simple structure-typically decision trees, rule algorithms, or sparse linear models-are well known for their instability. This undesirable feature makes the conclusions of the data analysis unreliable and turns out to be a strong operational limitation. This motivates the design of SIRUS, which combines a simple structure with a remarkable stable behavior when data is perturbed. The algorithm is based on random forests, the predictive accuracy of which is preserved. We demonstrate the efficiency of the method both empirically (through experiments) and theoretically (with the proof of its asymptotic stability). Our R/C++ software implementation sirus is available from CRAN.

Domaines

Statistiques [math.ST] Intelligence artificielle [cs.AI]

Fichier principal

sirus_reg_final.pdf (648.95 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Clément Bénard : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-02557113

Soumis le : lundi 8 février 2021-17:07:14

Dernière modification le : mardi 3 décembre 2024-15:14:03

Dates et versions

hal-02557113 , version 1 (28-04-2020)

hal-02557113 , version 2 (08-06-2020)

hal-02557113 , version 3 (06-10-2020)

hal-02557113 , version 4 (08-02-2021)

Identifiants

HAL Id : hal-02557113 , version 4
ARXIV : 2004.14841
DOI : 10.48550/arXiv.2004.14841

Citer

Clément Bénard, Gérard Biau, Sébastien da Veiga, Erwan Scornet. Interpretable Random Forests via Rule Extraction. 24th International Conference on Artificial Intelligence and Statistics, Apr 2021, Online, France. pp.937-945, ⟨10.48550/arXiv.2004.14841⟩. ⟨hal-02557113v4⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X CNRS INRIA INSMI X-CMAP X-DEP-MATHA CMAP LPSM SORBONNE-UNIVERSITE SU-SCIENCES IP_PARIS UP-SCIENCES

594 Consultations

757 Téléchargements

Interpretable Random Forests via Rule Extraction

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager