Hidden variables unseen by Random Forests

Blum, Ricardo; Hiabu, Munir; Mammen, Enno; Meyer, Joseph T.

Hidden variables unseen by Random Forests

Authors: Ricardo Blum
Munir Hiabu
Enno Mammen
Joseph T. Meyer
Publication date: 4 September 2023
Publisher

Abstract

Random Forests are widely claimed to capture interactions well. However, some simple examples suggest that they perform poorly in the presence of certain pure interactions that the conventional CART criterion struggles to capture during tree construction. We argue that alternative partitioning schemes can enhance identification of these interactions. Furthermore, we extend recent theory of Random Forests based on the notion of impurity decrease by considering probabilistic impurity decrease conditions. Within this framework, consistency of a new algorithm coined 'Random Split Random Forest' tailored to address function classes involving pure interactions is established. In a simulation study, we validate that the modifications considered enhance the model's fitting ability in scenarios where pure interactions play a crucial role

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2309.01460

Last time updated on 12/09/2023