Search CORE

2,232 research outputs found

Markov Decision Processes with Applications in Wireless Sensor Networks: A Survey

Author: Alsheikh Mohammad Abu
Hoang Dinh Thai
Lin Shaowei
Niyato Dusit
Tan Hwee-Pink
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 04/01/2015
Field of study

Wireless sensor networks (WSNs) consist of autonomous and resource-limited devices. The devices cooperate to monitor one or more physical phenomena within an area of interest. WSNs operate as stochastic systems because of randomness in the monitored environments. For long service time and low maintenance cost, WSNs require adaptive and robust methods to address data exchange, topology formulation, resource and power optimization, sensing coverage and object detection, and security challenges. In these problems, sensor nodes are to make optimized decisions from a set of accessible strategies to achieve design goals. This survey reviews numerous applications of the Markov decision process (MDP) framework, a powerful decision-making tool to develop adaptive algorithms and protocols for WSNs. Furthermore, various solution methods are discussed and compared to serve as a guide for using MDPs in WSNs

arXiv.org e-Print Archive

University of Canberra Research Repository

Adaptive Load Balancing: A Study in Multi-Agent Learning

Author: Schaerf A.
Shoham Y.
Tennenholtz M.
Publication venue
Publication date: 01/01/1995
Field of study

We study the process of multi-agent reinforcement learning in the context of load balancing in a distributed system, without use of either central coordination or explicit communication. We first define a precise framework in which to study adaptive load balancing, important features of which are its stochastic nature and the purely local information available to individual agents. Given this framework, we show illuminating results on the interplay between basic adaptive behavior parameters and their effect on system efficiency. We then investigate the properties of adaptive load balancing in heterogeneous populations, and address the issue of exploration vs. exploitation in that context. Finally, we show that naive use of communication may not improve, and might even harm system efficiency.Comment: See http://www.jair.org/ for any accompanying file

arXiv.org e-Print Archive

CiteSeerX

Archivio istituzionale della ricerca - Università degli Studi di Udine

How to split the costs and charge the travellers sharing a ride? : aligning system’s optimum with users’ equilibrium

Author: Alonso-Mora Javier
Cats Oded
Fielbaum Andres
Kucharski Rafał
Publication venue: 'Elsevier BV'
Publication date: 01/01/2022
Field of study

Jagiellonian Univeristy Repository

A Theory of Mind Approach as Test-Time Mitigation Against Emergent Adversarial Communication

Author: Behzadan Vahid
Piazza Nancirose
Publication venue
Publication date: 14/02/2023
Field of study

Multi-Agent Systems (MAS) is the study of multi-agent interactions in a shared environment. Communication for cooperation is a fundamental construct for sharing information in partially observable environments. Cooperative Multi-Agent Reinforcement Learning (CoMARL) is a learning framework where we learn agent policies either with cooperative mechanisms or policies that exhibit cooperative behavior. Explicitly, there are works on learning to communicate messages from CoMARL agents; however, non-cooperative agents, when capable of access a cooperative team's communication channel, have been shown to learn adversarial communication messages, sabotaging the cooperative team's performance particularly when objectives depend on finite resources. To address this issue, we propose a technique which leverages local formulations of Theory-of-Mind (ToM) to distinguish exhibited cooperative behavior from non-cooperative behavior before accepting messages from any agent. We demonstrate the efficacy and feasibility of the proposed technique in empirical evaluations in a centralized training, decentralized execution (CTDE) CoMARL benchmark. Furthermore, while we propose our explicit ToM defense for test-time, we emphasize that ToM is a construct for designing a cognitive defense rather than be the objective of the defense.Comment: 6 pages, 7 figure

arXiv.org e-Print Archive

The lot sizing problem: A tertiary study

Author: Glock C.H.
Grosse E.H.
Ries J.M.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

This paper provides a survey of literature reviews in the area of lot sizing. Its intention is to show which streams of research emerged from Harris' seminal lot size model, and which major achievements have been accomplished in the respective areas. We first develop the methodology of this review and then descriptively analyze the sample. Subsequently, a content-related classification scheme for lot sizing models is developed, and the reviews contained in our sample are discussed in light of this classification scheme. Our analysis shows that various extensions of Harris' lot size model were developed over the years, such as lot sizing models that include multi-stage inventory systems, incentives, or productivity issues. The aims of our tertiary study are the following: firstly, it helps primary researchers to position their own work in the literature, to reproduce the development of different types of lot sizing problems, and to find starting points if they intend to work in a new research direction. Secondly, the study identifies several topics that offer opportunities for future secondary research

TUbiblio

City Research Online

Crossref

Recommended from our members

A feature-based comparison of the centralised versus market-based decision making under lens of environment uncertainty: Case of the mobile task allocation problem

Author: Al-Yafi Karim
Publication venue: Brunel University Brunel Business School PhD Theses
Publication date: 01/01/2012
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.Decision making problems are amongst the most common challenges facing managers at different management levels in the organisation: strategic, tactical, and operational. However, prior reaching decisions at the operational level of the management hierarchy, operations management departments frequently have to deal with the optimisation process to evaluate the available decision alternatives. Industries with complex supply chain structures and service organisations that have to optimise the utilisation of their resources are examples. Conventionally, operational decisions used to be taken centrally by a decision making authority located at the top of a hierarchically-structured organisation. In order to take decisions, information related to the managed system and the affecting externalities (e.g. demand) should be globally available to the decision maker. The obtained information is then processed to reach the optimal decision. This approach usually makes extensive use of information systems (IS) containing myriad of optimisation algorithms and meta-heuristics to process the high amount and complex nature of data. The decisions reached are then broadcasted to the passive actuators of the system to put them in execution. On the other hand, recent advancements in information and communication technologies (ICT) made it possible to distribute the decision making rights and proved its applicability in several sectors. The market-based approach is as such a distributed decision making mechanism where passive actuators are delegated the rights of taking individual decisions matching their self-interests. The communication among the market agents is done through market transactions regulated by auctions. The system’s global optimisation, therefore, raise from the aggregated self-oriented market agents. As opposed to the centralised approach, the main characteristics of the market-based approach are the market mechanism and local knowledge of the agents. The existence of both approaches attracted several studies to compare them in different contexts. Recently, some comparisons compared the centralised versus market-based approaches in the context of transportation applications from an algorithm perspective. Transportation applications and routing problems are assumed to be good candidates for this comparison given the distributed nature of the system and due to the presence of several sources of uncertainty. Uncertainty exceptions make decisions highly vulnerable and necessitating frequent corrective interventions to keep an efficient level of service. Motivated by the previous comparison studies, this research aims at further investigating the features of both approaches and to contrast them in the context of a distributed task allocation problem in light of environmental uncertainty. Similar applications are often faced by service industries with mobile workforce. Contrary to the previous comparison studies that sought to compare those approaches at the mechanism level, this research attempts to identify the effect of the most significant characteristics of each approach to face environmental uncertainty, which is reflected in this research by the arrival of dynamic tasks and the occurrence of stochasticity delays. To achieve the aim of this research, a target optimisation problem from the VRP family is proposed and solved with both approaches. Given that this research does not target proposing new algorithms, two basic solution mechanisms are adopted to compare the centralised and the market-based approach. The produced solutions are executed on a dedicated multi-agent simulation system. During execution dynamism and stochasticity are introduced. The research findings suggest that a market-based approach is attractive to implement in highly uncertain environments when the degree of local knowledge and workers’ experience is high and when the system tends to be complex with large dimensions. It is also suggested that a centralised approach fits more in situations where uncertainty is lower and the decision maker is able to make timely decision updates, which is in turn regulated by the size of the system at hand

Brunel University Research Archive

Decentralized utilitarian mechanisms for scheduling games

Author: Abed
Aspnes
Awerbuch
Awerbuch
Awerbuch
Azar
Azar
Bagchi
Beckman
Borodin
Bruno
Caragiannis
Caragiannis
Caragiannis
Chevaleyre
Chien
Choi
Christodoulou
Christodoulou
Chung
Cole
Cominetti
Conway
Correa
Czumaj
Davis
Dürr
Even-Dar
Farzad
Finn
Fleischer
Fleischer
Foster
Gairing
Gonnet
Hall
Hartline
Hoefer
Hoeksma
Hoogeveen
Horn
Ibarra
Immorlica
José R. Correa
Koch
Korilis
Koutsoupias
Lenstra
Neil Olver
Peterson
Pinedo
Richard Cole
Roughgarden
Roughgarden
Sahni
Schulz
Schuurman
Sethuraman
Shoham
Skutella
Skutella
Smith
Vahab Mirrokni
Vasilis Gkatzelis
von Stackelberg
Vredeveld
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Supply Chain

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

Traditionally supply chain management has meant factories, assembly lines, warehouses, transportation vehicles, and time sheets. Modern supply chain management is a highly complex, multidimensional problem set with virtually endless number of variables for optimization. An Internet enabled supply chain may have just-in-time delivery, precise inventory visibility, and up-to-the-minute distribution-tracking capabilities. Technology advances have enabled supply chains to become strategic weapons that can help avoid disasters, lower costs, and make money. From internal enterprise processes to external business transactions with suppliers, transporters, channels and end-users marks the wide range of challenges researchers have to handle. The aim of this book is at revealing and illustrating this diversity in terms of scientific and theoretical fundamentals, prevailing concepts as well as current practical applications

Directory of Open Access Books (DOAB)

AI and OR in management of operations: history and trends

Author: Kobbacy KAH
Rasmy MH
Vadera S
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

The last decade has seen a considerable growth in the use of Artificial Intelligence (AI) for operations management with the aim of finding solutions to problems that are increasing in complexity and scale. This paper begins by setting the context for the survey through a historical perspective of OR and AI. An extensive survey of applications of AI techniques for operations management, covering a total of over 1200 papers published from 1995 to 2004 is then presented. The survey utilizes Elsevier's ScienceDirect database as a source. Hence, the survey may not cover all the relevant journals but includes a sufficiently wide range of publications to make it representative of the research in the field. The papers are categorized into four areas of operations management: (a) design, (b) scheduling, (c) process planning and control and (d) quality, maintenance and fault diagnosis. Each of the four areas is categorized in terms of the AI techniques used: genetic algorithms, case-based reasoning, knowledge-based systems, fuzzy logic and hybrid techniques. The trends over the last decade are identified, discussed with respect to expected trends and directions for future work suggested

University of Salford Institutional Repository