44 research outputs found
Contextual Search in the Presence of Irrational Agents
We study contextual search, a generalization of binary search in higher
dimensions, which captures settings such as feature-based dynamic pricing.
Standard game-theoretic formulations of this problem assume that agents act in
accordance with a specific behavioral model. In practice, however, some agents
may not prescribe to the dominant behavioral model or may act in ways that are
seemingly arbitrarily irrational. Existing algorithms heavily depend on the
behavioral model being (approximately) accurate for all agents and have poor
performance in the presence of even a few such arbitrarily irrational agents.
We initiate the study of contextual search when some of the agents can behave
in ways inconsistent with the underlying behavioral model. In particular, we
provide two algorithms, one built on robustifying multidimensional binary
search methods and one on translating the setting to a proxy setting
appropriate for gradient descent. Our techniques draw inspiration from learning
theory, game theory, high-dimensional geometry, and convex analysis.Comment: Compared to the first version titled "Corrupted Multidimensional
Binary Search: Learning in the Presence of Irrational Agents", this version
provides a broader scope of behavioral models of irrationality, specifies how
the results apply to different loss functions, and discusses the power and
limitations of additional algorithmic approache
Contextual Dynamic Pricing with Strategic Buyers
Personalized pricing, which involves tailoring prices based on individual
characteristics, is commonly used by firms to implement a consumer-specific
pricing policy. In this process, buyers can also strategically manipulate their
feature data to obtain a lower price, incurring certain manipulation costs.
Such strategic behavior can hinder firms from maximizing their profits. In this
paper, we study the contextual dynamic pricing problem with strategic buyers.
The seller does not observe the buyer's true feature, but a manipulated feature
according to buyers' strategic behavior. In addition, the seller does not
observe the buyers' valuation of the product, but only a binary response
indicating whether a sale happens or not. Recognizing these challenges, we
propose a strategic dynamic pricing policy that incorporates the buyers'
strategic behavior into the online learning to maximize the seller's cumulative
revenue. We first prove that existing non-strategic pricing policies that
neglect the buyers' strategic behavior result in a linear regret
with the total time horizon, indicating that these policies are not better
than a random pricing policy. We then establish that our proposed policy
achieves a sublinear regret upper bound of . Importantly, our
policy is not a mere amalgamation of existing dynamic pricing policies and
strategic behavior handling algorithms. Our policy can also accommodate the
scenario when the marginal cost of manipulation is unknown in advance. To
account for it, we simultaneously estimate the valuation parameter and the cost
parameter in the online pricing policy, which is shown to also achieve an
regret bound. Extensive experiments support our theoretical
developments and demonstrate the superior performance of our policy compared to
other pricing policies that are unaware of the strategic behaviors