Random Forests and Kernel Methods

Abstract : Random forests are ensemble methods which grow trees as base learners and combine their predictions by averaging. Random forests are known for their good practical performance, particularly in high-dimensional settings. On the theoretical side, several studies highlight the potentially fruitful connection between random forests and kernel methods. In this paper, we work out in full details this connection. In particular, we show that by slightly modifying their definition, random forests can be rewritten as kernel methods (called KeRF for Kernel based on Random Forests) which are more interpretable and easier to analyze. Explicit expressions of KeRF estimates for some specific random forest models are given, together with upper bounds on their rate of consistency. We also show empirically that KeRF estimates compare favourably to random forest estimates.
Document type :
Journal articles
Complete list of metadatas

Contributor : Gestionnaire Hal-Upmc <>
Submitted on : Wednesday, January 13, 2016 - 9:49:23 AM
Last modification on : Friday, May 24, 2019 - 5:23:40 PM

Links full text



Erwan Scornet. Random Forests and Kernel Methods. IEEE Transactions on Information Theory, Institute of Electrical and Electronics Engineers, 2016, PP (99), pp.1-1. ⟨10.1109/TIT.2016.2514489⟩. ⟨hal-01255002⟩



Record views