2 research outputs found
Using Multi-Agent Reinforcement Learning in Auction Simulations
Game theory has been developed by scientists as a theory of strategic
interaction among players who are supposed to be perfectly rational. These
strategic interactions might have been presented in an auction, a business
negotiation, a chess game, or even in a political conflict aroused between
different agents. In this study, the strategic (rational) agents created by
reinforcement learning algorithms are supposed to be bidder agents in various
types of auction mechanisms such as British Auction, Sealed Bid Auction, and
Vickrey Auction designs. Next, the equilibrium points determined by the agents
are compared with the outcomes of the Nash equilibrium points for these
environments. The bidding strategy of the agents is analyzed in terms of
individual rationality, truthfulness (strategy-proof), and computational
efficiency. The results show that using a multi-agent reinforcement learning
strategy improves the outcomes of the auction simulations
Advancing Ad Auction Realism: Practical Insights & Modeling Implications
This paper proposes a learning model of online ad auctions that allows for
the following four key realistic characteristics of contemporary online
auctions: (1) ad slots can have different values and click-through rates
depending on users' search queries, (2) the number and identity of competing
advertisers are unobserved and change with each auction, (3) advertisers only
receive partial, aggregated feedback, and (4) payment rules are only partially
specified. We model advertisers as agents governed by an adversarial bandit
algorithm, independent of auction mechanism intricacies. Our objective is to
simulate the behavior of advertisers for counterfactual analysis, prediction,
and inference purposes. Our findings reveal that, in such richer environments,
"soft floors" can enhance key performance metrics even when bidders are drawn
from the same population. We further demonstrate how to infer advertiser value
distributions from observed bids, thereby affirming the practical efficacy of
our approach even in a more realistic auction setting