Grokking-like effects in counterfactual inference

Abdullahi, Umar; Fairbank, Michael; Fasli, Maria; Matran-Fernandez, Ana; Samothrakis, Spyridon

Grokking-like effects in counterfactual inference

Authors: Umar Abdullahi
Michael Fairbank
Maria Fasli
Ana Matran-Fernandez
Spyridon Samothrakis
Publication date: 18 July 2022
Publisher: 'Institute of Electrical and Electronics Engineers (IEEE)'
Doi

Abstract

We show that a typical neural network, which ignores any covariate/feature re-balancing, can be as effective as any explicit counterfactual method. We adopt the architecture of TARNet—a simple neural network with two heads (one for treatment, one for control) which is trained with a relatively high batch size. Combined with ensemble methods, this produces competitive results in four counterfactual inference benchmarks: IHDP, NEWS, JOBS, and TWINS. Our results indicate that relatively simple methods might be good enough for counterfactual prediction, with quality constraints coming from hyperparameter tuning. Our analysis indicates that the reason behind the observed phenomenon might be “grokking”, a recently developed theory

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

University of Essex Research Repository

oai:repository.essex.ac.uk:345...

Last time updated on 22/05/2023