49,463 research outputs found
A general theory of intertemporal decision-making and the perception of time
Animals and humans make decisions based on their expected outcomes. Since
relevant outcomes are often delayed, perceiving delays and choosing between
earlier versus later rewards (intertemporal decision-making) is an essential
component of animal behavior. The myriad observations made in experiments
studying intertemporal decision-making and time perception have not yet been
rationalized within a single theory. Here we present a
theory-Training--Integrated Maximized Estimation of Reinforcement Rate
(TIMERR)--that explains a wide variety of behavioral observations made in
intertemporal decision-making and the perception of time. Our theory postulates
that animals make intertemporal choices to optimize expected reward rates over
a limited temporal window; this window includes a past integration interval
(over which experienced reward rate is estimated) and the expected delay to
future reward. Using this theory, we derive a mathematical expression for the
subjective representation of time. A unique contribution of our work is in
finding that the past integration interval directly determines the steepness of
temporal discounting and the nonlinearity of time perception. In so doing, our
theory provides a single framework to understand both intertemporal
decision-making and time perception.Comment: 37 pages, 4 main figures, 3 supplementary figure
Discounting of reward sequences: a test of competing formal models of hyperbolic discounting
Humans are known to discount future rewards hyperbolically in time. Nevertheless, a formal recursive model of hyperbolic discounting has been elusive until recently, with the introduction of the hyperbolically discounted temporal difference (HDTD) model. Prior to that, models of learning (especially reinforcement learning) have relied on exponential discounting, which generally provides poorer fits to behavioral data. Recently, it has been shown that hyperbolic discounting can also be approximated by a summed distribution of exponentially discounted values, instantiated in the ÎĽAgents model. The HDTD model and the ÎĽAgents model differ in one key respect, namely how they treat sequences of rewards. The ÎĽAgents model is a particular implementation of a Parallel discounting model, which values sequences based on the summed value of the individual rewards whereas the HDTD model contains a non-linear interaction. To discriminate among these models, we observed how subjects discounted a sequence of three rewards, and then we tested how well each candidate model fit the subject data. The results show that the Parallel model generally provides a better fit to the human data
Encoding of Marginal Utility across Time in the Human Brain
Marginal utility theory prescribes the relationship between the objective property of the magnitude of rewards and their subjective value. Despite its pervasive influence, however, there is remarkably little direct empirical evidence for such a theory of value, let alone of its neurobiological basis. We show that human preferences in an intertemporal choice task are best described by a model that integrates marginally diminishing utility with temporal discounting. Using functional magnetic resonance imaging, we show that activity in the dorsal striatum encodes both the marginal utility of rewards, over and above that which can be described by their magnitude alone, and the discounting associated with increasing time. In addition, our data show that dorsal striatum may be involved in integrating subjective valuation systems inherent to time and magnitude, thereby providing an overall metric of value used to guide choice behavior. Furthermore, during choice, we show that anterior cingulate activity correlates with the degree of difficulty associated with dissonance between value and time. Our data support an integrative architecture for decision making, revealing the neural representation of distinct subcomponents of value that may contribute to impulsivity and decisiveness
Measuring time preferences
We review research that measures time preferences—i.e., preferences over intertemporal tradeoffs. We distinguish between studies using financial flows, which we call “money earlier or later” (MEL) decisions and studies that use time-dated consumption/effort. Under different structural models, we show how to translate what MEL experiments directly measure (required rates of return for financial flows) into a discount function over utils. We summarize empirical regularities found in MEL studies and the predictive power of those studies. We explain why MEL choices are driven in part by some factors that are distinct from underlying time preferences.National Institutes of Health (NIA R01AG021650 and P01AG005842) and the Pershing Square Fund for Research in the Foundations of Human Behavior
Impulsivity and self-control during intertemporal decision making linked to the neural dynamics of reward value representation
A characteristic marker of impulsive decision making is the discounting of delayed rewards, demonstrated via choice preferences and choice-related brain activity. However, delay discounting may also arise from how subjective reward value is dynamically represented in the brain when anticipating an upcoming chosen reward. In the current study, brain activity was continuously monitored as human participants freely selected an immediate or delayed primary liquid reward and then waited for the specified delay before consuming it. The ventromedial prefrontal cortex (vmPFC) exhibited a characteristic pattern of activity dynamics during the delay period, as well as modulation during choice, that is consistent with the time-discounted coding of subjective value. The ventral striatum (VS) exhibited a similar activity pattern, but preferentially in impulsive individuals. A contrasting profile of delay-related and choice activation was observed in the anterior PFC (aPFC), but selectively in patient individuals. Functional connectivity analyses indicated that both vmPFC and aPFC exerted modulatory, but opposite, influences on VS activation. These results link behavioral impulsivity and self-control to dynamically evolving neural representations of future reward value, not just during choice, but also during postchoice delay periods
Recommended from our members
Optimal seismic upgrade timing in seaports with increasing throughput demand via real options
A real options (RO) formulation is proposed for decision-making on the timing to upgrade the seismic performance of existing seaports with increasing throughput demand in earthquake prone areas. The pay-off of the seismic upgrade investment option is estimated based on projected net earnings, repair cost, and downtime for a damaging reference seismic event having a pre-specified annual probability of occurrence. These projections inform a discrete-time RO binomial tree, following the American option valuation framework, which propagates the probability of the reference seismic event assuming Poisson temporal distribution of earthquake occurrence. The net present value of the expected annual payoff of the considered investment is used as an index supporting risk-informed decision-making discounted by the weighted average cost of capital (WACC). Numerical examples pertaining to decision makers with different capital cost, namely port authorities and terminal operators, operating in different economic environments typical of developed and developing countries are furnished to illustrate the applicability of the proposed RO formulation. It is found that high WACC and/or low throughput growth bring the optimal seismic upgrade timing forward, while earthquake consequences and upgrade cost have almost no influence on this timing
Time Discounting: Declining Impatience and Interval Effect
Most studies have not distinguished delay from intervals, so that whether the declining impatience really holds has been an open question. We conducted an experiment that explicitly distinguishes them, and confirmed the declining impatience. This implies that people make dynamically inconsistent plans. We also found the interval effect that the per-period time discount rate decreases with prolonged intervals. We show that the interval and the magnitude effects are caused, at least partially, because subjects' choices are influenced by the differential in reward amount, while Weber's law solves neither the delay nor the interval effects.
- …