18 research outputs found
Social welfare and profit maximization from revealed preferences
Consider the seller's problem of finding optimal prices for her
(divisible) goods when faced with a set of consumers, given that she can
only observe their purchased bundles at posted prices, i.e., revealed
preferences. We study both social welfare and profit maximization with revealed
preferences. Although social welfare maximization is a seemingly non-convex
optimization problem in prices, we show that (i) it can be reduced to a dual
convex optimization problem in prices, and (ii) the revealed preferences can be
interpreted as supergradients of the concave conjugate of valuation, with which
subgradients of the dual function can be computed. We thereby obtain a simple
subgradient-based algorithm for strongly concave valuations and convex cost,
with query complexity , where is the additive
difference between the social welfare induced by our algorithm and the optimum
social welfare. We also study social welfare maximization under the online
setting, specifically the random permutation model, where consumers arrive
one-by-one in a random order. For the case where consumer valuations can be
arbitrary continuous functions, we propose a price posting mechanism that
achieves an expected social welfare up to an additive factor of
from the maximum social welfare. Finally, for profit maximization (which may be
non-convex in simple cases), we give nearly matching upper and lower bounds on
the query complexity for separable valuations and cost (i.e., each good can be
treated independently)
Personalized dynamic pricing with machine learning: high-dimensional features and heterogeneous elasticity
We consider a seller who can dynamically adjust the price of a product at the individual customer level, by utilizing information about customers’ characteristics encoded as a d-dimensional feature vector. We assume a personalized demand model, parameters of which depend on s out of the d features. The seller initially does not know the relationship between the customer features and the product demand but learns this through sales observations over a selling horizon of T periods. We prove that the seller’s expected regret, that is, the revenue loss against a clairvoyant who knows the underlying demand relationship, is at least of order S√T under any admissible policy. We then design a near-optimal pricing policy for a semiclairvoyant seller (who knows which s of the d features are in the demand model) who achieves an expected regret of order S√T log T. We extend this policy to a more realistic setting, where the seller does not know the true demand predictors, and show that this policy has an expected regret of order S√T (log d + log T , which is also near-optimal. Finally, we test our theory on simulated data and on a data set from an online auto loan company in the United States. On both data sets, our experimentation-based pricing policy is superior to intuitive and/or widely-practiced customized pricing methods, such as myopic pricing and segment-then-optimize policies. Furthermore, our policy improves upon the loan company’s historical pricing decisions by 47% in expected revenue over a six-month period
Dynamic pricing using Thompson Sampling with fuzzy events
\u3cp\u3eIn this paper we study a repeated posted-price auction between a single seller and a single buyer that interact for a finite number of periods or rounds. In each round, the seller offers the same item for sale to the buyer. The seller announces a price and the buyer can decide to buy the item at the announced price or the buyer can decide not to buy the item. In this paper we study the problem from the perspective of the buyer who only gets to observe a stochastic measurement of the valuation of the item after he buys the item. Furthermore, in our model the buyer uses fuzzy sets to describe his satisfaction with the observed valuations and he uses fuzzy sets to describe his dissatisfaction with the observed price. In our problem, the buyer makes decisions based on the probability of a fuzzy event. His decision to buy or not depends on whether the satisfaction from having a high enough valuation for the item out weights the dissatisfaction of the quoted price. We propose an algorithm based on Thompson Sampling and demonstrate that it performs well using numerical experiments.\u3c/p\u3