2,049 research outputs found
Recommended from our members
Applying a Fuzzy-Morphological approach to complexity within management decision-making
Recommended from our members
Polygenic Adaptation to an Environmental Shift: Temporal Dynamics of Variation Under Gaussian Stabilizing Selection and Additive Effects on a Single Trait.
Predictions about the effect of natural selection on patterns of linked neutral variation are largely based on models involving the rapid fixation of unconditionally beneficial mutations. However, when phenotypes adapt to a new optimum trait value, the strength of selection on individual mutations decreases as the population adapts. Here, I use explicit forward simulations of a single trait with additive-effect mutations adapting to an "optimum shift." Detectable "hitchhiking" patterns are only apparent if (i) the optimum shifts are large with respect to equilibrium variation for the trait, (ii) mutation rates to large-effect mutations are low, and (iii) large-effect mutations rapidly increase in frequency and eventually reach fixation, which typically occurs after the population reaches the new optimum. For the parameters simulated here, partial sweeps do not appreciably affect patterns of linked variation, even when the mutations are strongly selected. The contribution of new mutations vs. standing variation to fixation depends on the mutation rate affecting trait values. Given the fixation of a strongly selected variant, patterns of hitchhiking are similar on average for the two classes of sweeps because sweeps from standing variation involving large-effect mutations are rare when the optimum shifts. The distribution of effect sizes of new mutations has little effect on the time to reach the new optimum, but reducing the mutational variance increases the magnitude of hitchhiking patterns. In general, populations reach the new optimum prior to the completion of any sweeps, and the times to fixation are longer for this model than for standard models of directional selection. The long fixation times are due to a combination of declining selection pressures during adaptation and the possibility of interference among weakly selected sites for traits with high mutation rates
GeoGauss: Strongly Consistent and Light-Coordinated OLTP for Geo-Replicated SQL Database
Multinational enterprises conduct global business that has a demand for
geo-distributed transactional databases. Existing state-of-the-art databases
adopt a sharded master-follower replication architecture. However, the
single-master serving mode incurs massive cross-region writes from clients, and
the sharded architecture requires multiple round-trip acknowledgments (e.g.,
2PC) to ensure atomicity for cross-shard transactions. These limitations drive
us to seek yet another design choice. In this paper, we propose a strongly
consistent OLTP database GeoGauss with full replica multi-master architecture.
To efficiently merge the updates from different master nodes, we propose a
multi-master OCC that unifies data replication and concurrent transaction
processing. By leveraging an epoch-based delta state merge rule and the
optimistic asynchronous execution, GeoGauss ensures strong consistency with
light-coordinated protocol and allows more concurrency with weak isolation,
which are sufficient to meet our needs. Our geo-distributed experimental
results show that GeoGauss achieves 7.06X higher throughput and 17.41X lower
latency than the state-of-the-art geo-distributed database CockroachDB on the
TPC-C benchmark
What Can Causal Networks Tell Us about Metabolic Pathways?
Graphical models describe the linear correlation structure of data and have been used to establish causal relationships among phenotypes in genetic mapping populations. Data are typically collected at a single point in time. Biological processes on the other hand are often non-linear and display time varying dynamics. The extent to which graphical models can recapitulate the architecture of an underlying biological processes is not well understood. We consider metabolic networks with known stoichiometry to address the fundamental question: “What can causal networks tell us about metabolic pathways?”. Using data from an Arabidopsis BaySha population and simulated data from dynamic models of pathway motifs, we assess our ability to reconstruct metabolic pathways using graphical models. Our results highlight the necessity of non-genetic residual biological variation for reliable inference. Recovery of the ordering within a pathway is possible, but should not be expected. Causal inference is sensitive to subtle patterns in the correlation structure that may be driven by a variety of factors, which may not emphasize the substrate-product relationship. We illustrate the effects of metabolic pathway architecture, epistasis and stochastic variation on correlation structure and graphical model-derived networks. We conclude that graphical models should be interpreted cautiously, especially if the implied causal relationships are to be used in the design of intervention strategies
Probabilistic data types
Dissertação de mestrado integrado em Engenharia InformáticaConflict-Free Replicated Data Types (CRDTs) provide deterministic outcomes from concurrent
executions. The conflict resolution mechanism uses information on the ordering of the last
operations performed, which indicates if a given operation is known by a replica, typically
using some variant of version vectors. This thesis will explore the construction of CRDTs
that use a novel stochastic mechanism that can track with high accuracy knowledge of the
occurrence of recently performed operations and with less accuracy for older operations.
The aim is to obtain better scaling properties and avoid the use of metadata that is linear on
the number of replicas.Conflict-Free Replicated Data Types (CRDTs) oferecem resultados determinísticos de execuções
concorrentes. O mecanismo de resolução de conflitos usa informação sobre a ordenação das últimas operações realizadas, que indica se uma dada operação é conhecida por uma réplica, geralmente usando alguma variante de version vectors. Esta tese explorara a construção de CRDTs que utilizam um novo mecanismo estocástico que pode identificar com alta precisão
o conhecimento sobre a ocorrência de operações realizadas recentemente e com menor
precisão para operações mais antigas. O objetivo é a obtenção de melhores propriedades de escalabilidade e evitar o uso de metadados em quantidade linear em relação ao número de réplicas
Dynamical system theory of periodically collapsing bubbles
We propose a reduced form set of two coupled continuous time equations
linking the price of a representative asset and the price of a bond, the later
quantifying the cost of borrowing. The feedbacks between asset prices and bonds
are mediated by the dependence of their "fundamental values" on past asset
prices and bond themselves. The obtained nonlinear self-referencing price
dynamics can induce, in a completely objective deterministic way, the
appearance of periodically exploding bubbles ending in crashes. Technically,
the periodically explosive bubbles arise due to the proximity of two types of
bifurcations as a function of the two key control parameters and , which
represent, respectively, the sensitivity of the fundamental asset price on past
asset and bond prices and of the fundamental bond price on past asset prices.
One is a Hopf bifurcation, when a stable focus transforms into an unstable
focus and a limit cycle appears. The other is a rather unusual bifurcation,
when a stable node and a saddle merge together and disappear, while an unstable
focus survives and a limit cycle develops. The lines, where the periodic
bubbles arise, are analogous to the critical lines of phase transitions in
statistical physics. The amplitude of bubbles and waiting times between them
respectively diverge with the critical exponents and ,
as the critical lines are approached.Comment: Latex file, 35 pages, 16 figure
Building Tunable CRDTs
Nowadays multiple large-scale services are hosted on the Internet, many of them with
millions of daily users. These systems need to scale efficiently, providing fast access
and being always available, despite failures of the servers or of the network and high
amounts of users accessing the service. As such, these services typically trade strong
consistency for high availability and low latency. However, not having strong consistency implies that conflicts arising from concurrent updates will occur, which need to be solved.
Conflict-Free Replicated Data Types (CRDTs) provide low latency and solve conflicts
automatically, ensuring eventual state convergence.
However, one shortcoming of CRDTs is the way they deal with concurrency conflicts –
usually they solve them automatically by applying a specific policy. For example, in a set, e priority to the add. These policies are limited and not adequate for all applications, especially since CRDTs don’t allow for policies dependent on the application context, such as, give priority to add if it’s element e, but give priority to remove if it’s element f.
As such, in this thesis we propose a new type of CRDTs, called tunable CRDT (t-
CRDT), which allows the programmer to specify for each operation what is the desired
conflict solving policy, by supplying a simple boolean function. The programmer can
either supply his own policy or use one of the many we provide in our t-CRDT library.
T-CRDTs solve conflicts automatically by applying the policies in each operation.
This new type of CRDTs adapt more easily to each application specific needs, as it
gives more control to the programmer while still having the main properties of CRDTs,
i.e., eventual convergence of state and low latency. With this, it is expected that more applications can start using CRDTs as their data solution and benefit from their properties
Approaches to Conflict-free Replicated Data Types
Conflict-free Replicated Data Types (CRDTs) allow optimistic replication in a
principled way. Different replicas can proceed independently, being available
even under network partitions, and always converging deterministically:
replicas that have received the same updates will have equivalent state, even
if received in different orders. After a historical tour of the evolution from
sequential data types to CRDTs, we present in detail the two main approaches to
CRDTs, operation-based and state-based, including two important variations, the
pure operation-based and the delta-state based. Intended as a tutorial for
prospective CRDT researchers and designers, it provides solid coverage of the
essential concepts, clarifying some misconceptions which frequently occur, but
also presents some novel insights gained from considerable experience in
designing both specific CRDTs and approaches to CRDTs.Comment: 36 page
- …