5,968 research outputs found

    Reinforcement Learning Algorithms and Complexity of Inventory Control, A Review

    Get PDF
    Driven by the ability to perform sequential decision-making in complex dynamic situations, Reinforcement Learning (RL) has quickly become a promising avenue to solve inventory control (IC) problems. The objective of this paper is to provide a comprehensive overview of the IC problems that have been effectively solved due to the application of RL. Our contributions include providing the first systematic review in this field of interest and application. We also identify potential extensions and come up with four propositions that formulate a theoretical framework that may help develop RL algorithms to solve complex IC problems. We recommend specific future research directions and novel approaches in solving IC problems

    Dynamic pricing and learning: historical origins, current research, and new directions

    Get PDF

    Online Learning of Aggregate Knowledge about Non-linear Preferences Applied to Negotiating Prices and Bundles

    Full text link
    In this paper, we consider a form of multi-issue negotiation where a shop negotiates both the contents and the price of bundles of goods with his customers. We present some key insights about, as well as a procedure for, locating mutually beneficial alternatives to the bundle currently under negotiation. The essence of our approach lies in combining aggregate (anonymous) knowledge of customer preferences with current data about the ongoing negotiation process. The developed procedure either works with already obtained aggregate knowledge or, in the absence of such knowledge, learns the relevant information online. We conduct computer experiments with simulated customers that have_nonlinear_ preferences. We show how, for various types of customers, with distinct negotiation heuristics, our procedure (with and without the necessary aggregate knowledge) increases the speed with which deals are reached, as well as the number and the Pareto efficiency of the deals reached compared to a benchmark.Comment: 10 pages, 5 eps figures, ACM Proceedings documentclass, Published in "Proc. 6th Int'l Conf. on Electronic Commerce ICEC04, Delft, The Netherlands," M. Janssen, H. Sol, R. Wagenaar (eds.). ACM Pres

    From supply chains to demand networks. Agents in retailing: the electrical bazaar

    Get PDF
    A paradigm shift is taking place in logistics. The focus is changing from operational effectiveness to adaptation. Supply Chains will develop into networks that will adapt to consumer demand in almost real time. Time to market, capacity of adaptation and enrichment of customer experience seem to be the key elements of this new paradigm. In this environment emerging technologies like RFID (Radio Frequency ID), Intelligent Products and the Internet, are triggering a reconsideration of methods, procedures and goals. We present a Multiagent System framework specialized in retail that addresses these changes with the use of rational agents and takes advantages of the new market opportunities. Like in an old bazaar, agents able to learn, cooperate, take advantage of gossip and distinguish between collaborators and competitors, have the ability to adapt, learn and react to a changing environment better than any other structure. Keywords: Supply Chains, Distributed Artificial Intelligence, Multiagent System.Postprint (published version

    Dynamic pricing models for electronic business

    Get PDF
    Dynamic pricing is the dynamic adjustment of prices to consumers depending upon the value these customers attribute to a product or service. Today’s digital economy is ready for dynamic pricing; however recent research has shown that the prices will have to be adjusted in fairly sophisticated ways, based on sound mathematical models, to derive the benefits of dynamic pricing. This article attempts to survey different models that have been used in dynamic pricing. We first motivate dynamic pricing and present underlying concepts, with several examples, and explain conditions under which dynamic pricing is likely to succeed. We then bring out the role of models in computing dynamic prices. The models surveyed include inventory-based models, data-driven models, auctions, and machine learning. We present a detailed example of an e-business market to show the use of reinforcement learning in dynamic pricing

    A Note on the Equivalence of Rationalizability Concepts in Generalized Nice Games

    Get PDF
    Moulin (1984) describes the class of nice games for which the solution concept of point-rationalizability coincides with iterated elimination of strongly dominated strategies. As a consequence nice games have the desirable property that all rationalizability concepts determine the same strategic solution. However, nice games are characterized by rather strong assumptions. For example, only single-valued best responses are admitted and the individual strategy sets have to be convex and compact subsets of the real line R1. This note shows that equivalence of all rationalizability concepts can be extended to multi-valued best response correspondences. The surprising finding is that equivalence does not hold for individual strategy sets that are compact and convex subsets of Rn with n>1.

    An integrated decision making model for dynamic pricing and inventory control of substitutable products based on demand learning

    Get PDF
    Purpose: This paper focuses on the PC industry, analyzing a PC supply chain system composed of onelarge retailer and two manufacturers. The retailer informs the suppliers of the total order quantity, namelyQ, based on demand forecast ahead of the selling season. The suppliers manufacture products accordingto the predicted quantity. When the actual demand has been observed, the retailer conducts demandlearning and determines the actual order quantity. Under the assumption that the products of the twosuppliers are one-way substitutable, an integrated decision-making model for dynamic pricing andinventory control is established.Design/methodology/approach: This paper proposes a mathematical model where a large domestichousehold appliance retailer decides the optimal original ordering quantity before the selling season and theoptimal actual ordering quantity, and two manufacturers decide the optimal wholesale price.Findings:By applying this model to a large domestic household appliance retail terminal, the authors canconclude that the model is quite feasible and effective. Meanwhile, the results of simulation analysis showthat when the product prices of two manufacturers both reduce gradually, one manufacturer will often waittill the other manufacturer reduces their price to a crucial inflection point, then their profit will show aqualitative change instead of a real-time profit-price change.Practical implications: This model can be adopted to a supply chain system composed of one largeretailer and two manufacturers, helping manufacturers better make a pricing and inventory controldecision.Originality/value: Previous research focuses on the ordering quantity directly be decided. Limited workhas considered the actual ordering quantity based on demand learning. However, this paper considers boththe optimal original ordering quantity before the selling season and the optimal actual ordering quantityfrom the perspective of the retailerPeer Reviewe
    corecore