85,617 research outputs found
Neural Feedback Scheduling of Real-Time Control Tasks
Many embedded real-time control systems suffer from resource constraints and
dynamic workload variations. Although optimal feedback scheduling schemes are
in principle capable of maximizing the overall control performance of
multitasking control systems, most of them induce excessively large
computational overheads associated with the mathematical optimization routines
involved and hence are not directly applicable to practical systems. To
optimize the overall control performance while minimizing the overhead of
feedback scheduling, this paper proposes an efficient feedback scheduling
scheme based on feedforward neural networks. Using the optimal solutions
obtained offline by mathematical optimization methods, a back-propagation (BP)
neural network is designed to adapt online the sampling periods of concurrent
control tasks with respect to changes in computing resource availability.
Numerical simulation results show that the proposed scheme can reduce the
computational overhead significantly while delivering almost the same overall
control performance as compared to optimal feedback scheduling.Comment: To appear in International Journal of Innovative Computing,
Information and Contro
Hierarchical Parallelisation of Functional Renormalisation Group Calculations -- hp-fRG
The functional renormalisation group (fRG) has evolved into a versatile tool
in condensed matter theory for studying important aspects of correlated
electron systems. Practical applications of the method often involve a high
numerical effort, motivating the question in how far High Performance Computing
(HPC) can leverage the approach. In this work we report on a multi-level
parallelisation of the underlying computational machinery and show that this
can speed up the code by several orders of magnitude. This in turn can extend
the applicability of the method to otherwise inaccessible cases. We exploit
three levels of parallelisation: Distributed computing by means of Message
Passing (MPI), shared-memory computing using OpenMP, and vectorisation by means
of SIMD units (single-instruction-multiple-data). Results are provided for two
distinct High Performance Computing (HPC) platforms, namely the IBM-based
BlueGene/Q system JUQUEEN and an Intel Sandy-Bridge-based development cluster.
We discuss how certain issues and obstacles were overcome in the course of
adapting the code. Most importantly, we conclude that this vast improvement can
actually be accomplished by introducing only moderate changes to the code, such
that this strategy may serve as a guideline for other researcher to likewise
improve the efficiency of their codes
Theano: new features and speed improvements
Theano is a linear algebra compiler that optimizes a user's
symbolically-specified mathematical computations to produce efficient low-level
implementations. In this paper, we present new features and efficiency
improvements to Theano, and benchmarks demonstrating Theano's performance
relative to Torch7, a recently introduced machine learning library, and to
RNNLM, a C++ library targeted at recurrent neural networks.Comment: Presented at the Deep Learning Workshop, NIPS 201
Exact Algorithm for Sampling the 2D Ising Spin Glass
A sampling algorithm is presented that generates spin glass configurations of
the 2D Edwards-Anderson Ising spin glass at finite temperature, with
probabilities proportional to their Boltzmann weights. Such an algorithm
overcomes the slow dynamics of direct simulation and can be used to study
long-range correlation functions and coarse-grained dynamics. The algorithm
uses a correspondence between spin configurations on a regular lattice and
dimer (edge) coverings of a related graph: Wilson's algorithm [D. B. Wilson,
Proc. 8th Symp. Discrete Algorithms 258, (1997)] for sampling dimer coverings
on a planar lattice is adapted to generate samplings for the dimer problem
corresponding to both planar and toroidal spin glass samples. This algorithm is
recursive: it computes probabilities for spins along a "separator" that divides
the sample in half. Given the spins on the separator, sample configurations for
the two separated halves are generated by further division and assignment. The
algorithm is simplified by using Pfaffian elimination, rather than Gaussian
elimination, for sampling dimer configurations. For n spins and given floating
point precision, the algorithm has an asymptotic run-time of O(n^{3/2}); it is
found that the required precision scales as inverse temperature and grows only
slowly with system size. Sample applications and benchmarking results are
presented for samples of size up to n=128^2, with fixed and periodic boundary
conditions.Comment: 18 pages, 10 figures, 1 table; minor clarification
Automatic Termination Analysis of Programs Containing Arithmetic Predicates
For logic programs with arithmetic predicates, showing termination is not
easy, since the usual order for the integers is not well-founded. A new method,
easily incorporated in the TermiLog system for automatic termination analysis,
is presented for showing termination in this case.
The method consists of the following steps: First, a finite abstract domain
for representing the range of integers is deduced automatically. Based on this
abstraction, abstract interpretation is applied to the program. The result is a
finite number of atoms abstracting answers to queries which are used to extend
the technique of query-mapping pairs. For each query-mapping pair that is
potentially non-terminating, a bounded (integer-valued) termination function is
guessed. If traversing the pair decreases the value of the termination
function, then termination is established. Simple functions often suffice for
each query-mapping pair, and that gives our approach an edge over the classical
approach of using a single termination function for all loops, which must
inevitably be more complicated and harder to guess automatically. It is worth
noting that the termination of McCarthy's 91 function can be shown
automatically using our method.
In summary, the proposed approach is based on combining a finite abstraction
of the integers with the technique of the query-mapping pairs, and is
essentially capable of dividing a termination proof into several cases, such
that a simple termination function suffices for each case. Consequently, the
whole process of proving termination can be done automatically in the framework
of TermiLog and similar systems.Comment: Appeared also in Electronic Notes in Computer Science vol. 3
Iterative structure of finite loop integrals
In this paper we develop further and refine the method of differential
equations for computing Feynman integrals. In particular, we show that an
additional iterative structure emerges for finite loop integrals. As a concrete
non-trivial example we study planar master integrals of light-by-light
scattering to three loops, and derive analytic results for all values of the
Mandelstam variables and and the mass . We start with a recent
proposal for defining a basis of loop integrals having uniform transcendental
weight properties and use this approach to compute all planar two-loop master
integrals in dimensional regularization. We then show how this approach can be
further simplified when computing finite loop integrals. This allows us to
discuss precisely the subset of integrals that are relevant to the problem. We
find that this leads to a block triangular structure of the differential
equations, where the blocks correspond to integrals of different weight. We
explain how this block triangular form is found in an algorithmic way. Another
advantage of working in four dimensions is that integrals of different loop
orders are interconnected and can be seamlessly discussed within the same
formalism. We use this method to compute all finite master integrals needed up
to three loops. Finally, we remark that all integrals have simple Mandelstam
representations.Comment: 26 pages plus appendices, 5 figure
- …