Search CORE

27,223 research outputs found

Decision Making in Uncertain and Changing Environments

Author: Andriy Zapechelnyuk
Karl H. Schlag
Publication venue
Publication date
Field of study

Online Reinforcement Learning for Dynamic Multimedia Systems

Author: Mastronarde Nicholas
van der Schaar Mihaela
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 29/06/2009
Field of study

In our previous work, we proposed a systematic cross-layer framework for dynamic multimedia systems, which allows each layer to make autonomous and foresighted decisions that maximize the system's long-term performance, while meeting the application's real-time delay constraints. The proposed solution solved the cross-layer optimization offline, under the assumption that the multimedia system's probabilistic dynamics were known a priori. In practice, however, these dynamics are unknown a priori and therefore must be learned online. In this paper, we address this problem by allowing the multimedia system layers to learn, through repeated interactions with each other, to autonomously optimize the system's long-term performance at run-time. We propose two reinforcement learning algorithms for optimizing the system under different design constraints: the first algorithm solves the cross-layer optimization in a centralized manner, and the second solves it in a decentralized manner. We analyze both algorithms in terms of their required computation, memory, and inter-layer communication overheads. After noting that the proposed reinforcement learning algorithms learn too slowly, we introduce a complementary accelerated learning algorithm that exploits partial knowledge about the system's dynamics in order to dramatically improve the system's performance. In our experiments, we demonstrate that decentralized learning can perform as well as centralized learning, while enabling the layers to act autonomously. Additionally, we show that existing application-independent reinforcement learning algorithms, and existing myopic learning algorithms deployed in multimedia systems, perform significantly worse than our proposed application-aware and foresighted learning methods.Comment: 35 pages, 11 figures, 10 table

arXiv.org e-Print Archive

Crossref

Learning in evolutionary environments

Author: Dosi Giovanni
Fagiolo Giorgio
Marengo Luigi
Publication venue
Publication date: 01/01/1996
Field of study

Not availabl

CiteSeerX

Unitn-eprints Research

Catalogo dei prodotti della ricerca

International Institute for Applied Systems Analysis (IIASA)

Decision Making in Uncertain and Changing Environments

Author: Andriy Zapechelnyuk
Karl Schlag
Publication venue
Publication date
Field of study

We consider an agent who has to repeatedly make choices in an uncertain and changing environment, who has full information of the past, who discounts future payoffs, but who has no prior. We provide a learning algorithm that performs almost as well as the best of a given finite number of experts or benchmark strategies and does so at any point in time, provided the agent is sufficiently patient. The key is to find the appropriate degree of forgetting distant past. Standard learning algorithms that treat recent and distant past equally do not have the sequential epsilon optimality property.Adaptive learning, experts, distribution-free, epsilon-optimality, Hannan regret

Research Papers in Economics

Decision making in uncertain and changing environments

Author: Andriy Zapechelnyuk
Karl Schlag
Publication venue
Publication date
Field of study

Research Papers in Economics

The Value of Information for Populations in Varying Environments

Author: A. Rosenblueth
A. Sasaki
A. Wagner
A.J. Robson
A.R. Barron
C. Adami
C. Shannon
C.E. Shannon
C.E. Shannon
C.T. Bergstrom
D. Bernoulli
D. Polani
D. Tanny
D.W. Stephens
E. Jablonka
E. Kussell
E. Kussell
G.N. Iyengar
H. Atlan
H. Furstenberg
H. Marko
H. Markowitz
H. Touchette
H. Touchette
H.H. Permuter
H.S. Witsenhausen
I. Csiszár
J. Beatty
J. Kelly
J. Maynard Smith
J. Maynard-Smith
J. Seger
J.F.C. Kingman
J.L. Massey
J.O. Berger
J.W. Szostak
K.B. Athreya
L. Breiman
L. Pack Kaelbling
L.A. Real
M. Gastpar
M.C. Donaldson-Matasci
N. Rashevsky
N. Wiener
Olivier Rivoire
P. Godfrey-Smith
P. Haccou
P. Leslie
P. Nurse
P.A. Samuelson
P.H. Algoet
R.C. Lewontin
R.C. Merton
S. Karlin
S. Mills
S. Tuljapurkar
S.C. Stearns
S.K. Mitter
Stanislas Leibler
T. Berger
T.G. Kurtz
T.J. Perkins
T.M. Cover
W.R. Ashby
W.R. Ashby
Y.-H. Kim
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/10/2010
Field of study

The notion of information pervades informal descriptions of biological systems, but formal treatments face the problem of defining a quantitative measure of information rooted in a concept of fitness, which is itself an elusive notion. Here, we present a model of population dynamics where this problem is amenable to a mathematical analysis. In the limit where any information about future environmental variations is common to the members of the population, our model is equivalent to known models of financial investment. In this case, the population can be interpreted as a portfolio of financial assets and previous analyses have shown that a key quantity of Shannon's communication theory, the mutual information, sets a fundamental limit on the value of information. We show that this bound can be violated when accounting for features that are irrelevant in finance but inherent to biological systems, such as the stochasticity present at the individual level. This leads us to generalize the measures of uncertainty and information usually encountered in information theory

arXiv.org e-Print Archive

Crossref

Hal - Université Grenoble Alpes

A Theory of Firm Decline

Author: Gian Luca Clementi
Sonia B. Di Giannatale
Thomas F. Cooley
Publication venue
Publication date
Field of study

Research Papers in Economics

A Theory of Firm Decline

Author: Gian Luca Clementi
Sonia Di Giannatal
Thomas Cooley
Publication venue
Publication date
Field of study

We study the problem of an investor that buys an equity stake in an entrepreneurial venture, under the assumption that the former cannot monitor the latter’s operations. The dynamics implied by the optimal incentive scheme is rich and quite different from that induced by other models of repeated moral hazard. In particular, our framework generates a rationale for firm decline. As young firms accumulate capital, the claims of both investor (outside equity) and entrepreneur (inside equity) increase. At some juncture, however, even as the latter keeps on growing, invested capital and firm value start declining and so does the value of outside equity. The reason is that incentive provision is costlier the wealthier the entrepreneur (the greater is inside equity). In turn, this leads to a decline in the constrained–efficient level of effort and therefore to a drop in the return to investment.Principal Agent, Moral Hazard, Hidden Action, Incentives, Survival, Firm Dynamics

Research Papers in Economics