5,968 research outputs found
Reinforcement Learning Algorithms and Complexity of Inventory Control, A Review
Driven by the ability to perform sequential decision-making in complex dynamic situations, Reinforcement Learning (RL) has quickly become a promising avenue to solve inventory control (IC) problems. The objective of this paper is to provide a comprehensive overview of the IC problems that have been effectively solved due to the application of RL. Our contributions include providing the first systematic review in this field of interest and application. We also identify potential extensions and come up with four propositions that formulate a theoretical framework that may help develop RL algorithms to solve complex IC problems. We recommend specific future research directions and novel approaches in solving IC problems
Online Learning of Aggregate Knowledge about Non-linear Preferences Applied to Negotiating Prices and Bundles
In this paper, we consider a form of multi-issue negotiation where a shop
negotiates both the contents and the price of bundles of goods with his
customers. We present some key insights about, as well as a procedure for,
locating mutually beneficial alternatives to the bundle currently under
negotiation. The essence of our approach lies in combining aggregate
(anonymous) knowledge of customer preferences with current data about the
ongoing negotiation process. The developed procedure either works with already
obtained aggregate knowledge or, in the absence of such knowledge, learns the
relevant information online. We conduct computer experiments with simulated
customers that have_nonlinear_ preferences. We show how, for various types of
customers, with distinct negotiation heuristics, our procedure (with and
without the necessary aggregate knowledge) increases the speed with which deals
are reached, as well as the number and the Pareto efficiency of the deals
reached compared to a benchmark.Comment: 10 pages, 5 eps figures, ACM Proceedings documentclass, Published in
"Proc. 6th Int'l Conf. on Electronic Commerce ICEC04, Delft, The
Netherlands," M. Janssen, H. Sol, R. Wagenaar (eds.). ACM Pres
From supply chains to demand networks. Agents in retailing: the electrical bazaar
A paradigm shift is taking place in logistics. The focus is changing from operational effectiveness to adaptation. Supply Chains will develop into networks that will adapt to consumer demand in almost real time. Time to market, capacity of adaptation and enrichment of customer experience seem to be the key elements of this new paradigm. In this environment emerging technologies like RFID (Radio Frequency ID), Intelligent Products and the Internet, are triggering a reconsideration of methods, procedures and goals. We present a Multiagent System framework specialized in retail that addresses these changes with the use of rational agents and takes advantages of the new market opportunities. Like in an old bazaar, agents able to learn, cooperate, take advantage of gossip and distinguish between collaborators and competitors, have the ability to adapt, learn and react to a changing environment better than any other structure. Keywords: Supply Chains, Distributed Artificial Intelligence, Multiagent System.Postprint (published version
Dynamic pricing models for electronic business
Dynamic pricing is the dynamic adjustment of prices to consumers
depending upon the value these customers attribute to a product or service. Today’s
digital economy is ready for dynamic pricing; however recent research has shown
that the prices will have to be adjusted in fairly sophisticated ways, based on
sound mathematical models, to derive the benefits of dynamic pricing. This article
attempts to survey different models that have been used in dynamic pricing. We
first motivate dynamic pricing and present underlying concepts, with several examples,
and explain conditions under which dynamic pricing is likely to succeed. We
then bring out the role of models in computing dynamic prices. The models surveyed
include inventory-based models, data-driven models, auctions, and machine
learning. We present a detailed example of an e-business market to show the use
of reinforcement learning in dynamic pricing
A Note on the Equivalence of Rationalizability Concepts in Generalized Nice Games
Moulin (1984) describes the class of nice games for which the solution concept of point-rationalizability coincides with iterated elimination of strongly dominated strategies. As a consequence nice games have the desirable property that all rationalizability concepts determine the same strategic solution. However, nice games are characterized by rather strong assumptions. For example, only single-valued best responses are admitted and the individual strategy sets have to be convex and compact subsets of the real line R1. This note shows that equivalence of all rationalizability concepts can be extended to multi-valued best response correspondences. The surprising finding is that equivalence does not hold for individual strategy sets that are compact and convex subsets of Rn with n>1.
An integrated decision making model for dynamic pricing and inventory control of substitutable products based on demand learning
Purpose: This paper focuses on the PC industry, analyzing a PC supply chain system composed of onelarge retailer and two manufacturers. The retailer informs the suppliers of the total order quantity, namelyQ, based on demand forecast ahead of the selling season. The suppliers manufacture products accordingto the predicted quantity. When the actual demand has been observed, the retailer conducts demandlearning and determines the actual order quantity. Under the assumption that the products of the twosuppliers are one-way substitutable, an integrated decision-making model for dynamic pricing andinventory control is established.Design/methodology/approach: This paper proposes a mathematical model where a large domestichousehold appliance retailer decides the optimal original ordering quantity before the selling season and theoptimal actual ordering quantity, and two manufacturers decide the optimal wholesale price.Findings:By applying this model to a large domestic household appliance retail terminal, the authors canconclude that the model is quite feasible and effective. Meanwhile, the results of simulation analysis showthat when the product prices of two manufacturers both reduce gradually, one manufacturer will often waittill the other manufacturer reduces their price to a crucial inflection point, then their profit will show aqualitative change instead of a real-time profit-price change.Practical implications: This model can be adopted to a supply chain system composed of one largeretailer and two manufacturers, helping manufacturers better make a pricing and inventory controldecision.Originality/value: Previous research focuses on the ordering quantity directly be decided. Limited workhas considered the actual ordering quantity based on demand learning. However, this paper considers boththe optimal original ordering quantity before the selling season and the optimal actual ordering quantityfrom the perspective of the retailerPeer Reviewe
- …