45,152 research outputs found
Recent Advances in Graph Partitioning
We survey recent trends in practical algorithms for balanced graph
partitioning together with applications and future research directions
Achieving High Speed CFD simulations: Optimization, Parallelization, and FPGA Acceleration for the unstructured DLR TAU Code
Today, large scale parallel simulations are fundamental tools to handle complex problems. The number of processors in current computation platforms has been recently increased and therefore it is necessary to optimize the application performance and to enhance the scalability of massively-parallel systems. In addition, new heterogeneous architectures, combining conventional processors with specific hardware, like FPGAs, to accelerate the most time consuming functions are considered as a strong alternative to boost the performance.
In this paper, the performance of the DLR TAU code is analyzed and optimized. The improvement of the code efficiency is addressed through three key activities: Optimization, parallelization and hardware acceleration. At first, a profiling analysis of the most time-consuming processes of the Reynolds Averaged Navier Stokes flow solver on a three-dimensional unstructured mesh is performed. Then, a study of the code scalability with new partitioning algorithms are tested to show the most suitable partitioning algorithms for the selected applications. Finally, a feasibility study on the application of FPGAs and GPUs for the hardware acceleration of CFD simulations is presented
A Light Signalling Approach to Node Grouping for Massive MIMO IoT Networks
Massive MIMO is a promising technology to connect very large numbers of
energy constrained nodes, as it offers both extensive spatial multiplexing and
large array gain. A challenge resides in partitioning the many nodes in groups
that can communicate simultaneously such that the mutual interference is
minimized. We here propose node partitioning strategies that do not require
full channel state information, but rather are based on nodes' respective
directional channel properties. In our considered scenarios, these typically
have a time constant that is far larger than the coherence time of the channel.
We developed both an optimal and an approximation algorithm to partition users
based on directional channel properties, and evaluated them numerically. Our
results show that both algorithms, despite using only these directional channel
properties, achieve similar performance in terms of the minimum
signal-to-interference-plus-noise ratio for any user, compared with a reference
method using full channel knowledge. In particular, we demonstrate that
grouping nodes with related directional properties is to be avoided. We hence
realise a simple partitioning method requiring minimal information to be
collected from the nodes, and where this information typically remains stable
over a long term, thus promoting their autonomy and energy efficiency
- …