962 research outputs found
On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach
Interpretable and explainable machine learning has seen a recent surge of
interest. We focus on safety as a key motivation behind the surge and make the
relationship between interpretability and safety more quantitative. Toward
assessing safety, we introduce the concept of maximum deviation via an
optimization problem to find the largest deviation of a supervised learning
model from a reference model regarded as safe. We then show how
interpretability facilitates this safety assessment. For models including
decision trees, generalized linear and additive models, the maximum deviation
can be computed exactly and efficiently. For tree ensembles, which are not
regarded as interpretable, discrete optimization techniques can still provide
informative bounds. For a broader class of piecewise Lipschitz functions, we
leverage the multi-armed bandit literature to show that interpretability
produces tighter (regret) bounds on the maximum deviation. We present case
studies, including one on mortgage approval, to illustrate our methods and the
insights about models that may be obtained from deviation maximization.Comment: Published at NeurIPS 202
Robust and Heterogenous Odds Ratio: Estimating Price Sensitivity for Unbought Items
Problem definition: Mining for heterogeneous responses to an intervention is
a crucial step for data-driven operations, for instance to personalize
treatment or pricing. We investigate how to estimate price sensitivity from
transaction-level data. In causal inference terms, we estimate heterogeneous
treatment effects when (a) the response to treatment (here, whether a customer
buys a product) is binary, and (b) treatment assignments are partially observed
(here, full information is only available for purchased items).
Methodology/Results: We propose a recursive partitioning procedure to estimate
heterogeneous odds ratio, a widely used measure of treatment effect in medicine
and social sciences. We integrate an adversarial imputation step to allow for
robust inference even in presence of partially observed treatment assignments.
We validate our methodology on synthetic data and apply it to three case
studies from political science, medicine, and revenue management. Managerial
Implications: Our robust heterogeneous odds ratio estimation method is a simple
and intuitive tool to quantify heterogeneity in patients or customers and
personalize interventions, while lifting a central limitation in many revenue
management data
- …