30,809 research outputs found
A domain-specific language and matrix-free stencil code for investigating electronic properties of Dirac and topological materials
We introduce PVSC-DTM (Parallel Vectorized Stencil Code for Dirac and
Topological Materials), a library and code generator based on a domain-specific
language tailored to implement the specific stencil-like algorithms that can
describe Dirac and topological materials such as graphene and topological
insulators in a matrix-free way. The generated hybrid-parallel (MPI+OpenMP)
code is fully vectorized using Single Instruction Multiple Data (SIMD)
extensions. It is significantly faster than matrix-based approaches on the node
level and performs in accordance with the roofline model. We demonstrate the
chip-level performance and distributed-memory scalability of basic building
blocks such as sparse matrix-(multiple-) vector multiplication on modern
multicore CPUs. As an application example, we use the PVSC-DTM scheme to (i)
explore the scattering of a Dirac wave on an array of gate-defined quantum
dots, to (ii) calculate a bunch of interior eigenvalues for strong topological
insulators, and to (iii) discuss the photoemission spectra of a disordered Weyl
semimetal.Comment: 16 pages, 2 tables, 11 figure
Multicore-optimized wavefront diamond blocking for optimizing stencil updates
The importance of stencil-based algorithms in computational science has
focused attention on optimized parallel implementations for multilevel
cache-based processors. Temporal blocking schemes leverage the large bandwidth
and low latency of caches to accelerate stencil updates and approach
theoretical peak performance. A key ingredient is the reduction of data traffic
across slow data paths, especially the main memory interface. In this work we
combine the ideas of multi-core wavefront temporal blocking and diamond tiling
to arrive at stencil update schemes that show large reductions in memory
pressure compared to existing approaches. The resulting schemes show
performance advantages in bandwidth-starved situations, which are exacerbated
by the high bytes per lattice update case of variable coefficients. Our thread
groups concept provides a controllable trade-off between concurrency and memory
usage, shifting the pressure between the memory interface and the CPU. We
present performance results on a contemporary Intel processor
Fully-automatic inverse tone mapping algorithm based on dynamic mid-level tone mapping
High Dynamic Range (HDR) displays can show images with higher color contrast levels and peak luminosities than the common Low Dynamic Range (LDR) displays. However, most existing video content is recorded and/or graded in LDR format. To show LDR content on HDR displays, it needs to be up-scaled using a so-called inverse tone mapping algorithm. Several techniques for inverse tone mapping have been proposed in the last years, going from simple approaches based on global and local operators to more advanced algorithms such as neural networks. Some of the drawbacks of existing techniques for inverse tone mapping are the need for human intervention, the high computation time for more advanced algorithms, limited low peak brightness, and the lack of the preservation of the artistic intentions. In this paper, we propose a fully-automatic inverse tone mapping operator based on mid-level mapping capable of real-time video processing. Our proposed algorithm allows expanding LDR images into HDR images with peak brightness over 1000 nits, preserving the artistic intentions inherent to the HDR domain. We assessed our results using the full-reference objective quality metrics HDR-VDP-2.2 and DRIM, and carrying out a subjective pair-wise comparison experiment. We compared our results with those obtained with the most recent methods found in the literature. Experimental results demonstrate that our proposed method outperforms the current state-of-the-art of simple inverse tone mapping methods and its performance is similar to other more complex and time-consuming advanced techniques
Management of Digital Video Broadcasting Services in Open Delivery Platforms
The future of Digital Video Broadcasting (DVB) is moving towards solutions offering an efficient way of carrying interactive IP multimedia services over digital terrestrial broadcasting networks to handheld terminals. One of the most promising technologies is Digital Video Broadcasting-Handheld (DVB-H), at present under standardisation. Services deployed via this type of DVB technologies should enjoy reliability comparable to TV services and high quality standards. However, the market at present does not provide effective and economical solutions for the deployment of such services over multi-domain IP networks, due to their high level of unreliability. This paper focuses on service management, service level agreement (SLA) and network performance requirements of DVB-H services. Experimental results are presented concerning QoS sensitivity to network performance of DVB-H services delivered over a multi-domain IP network. Moreover, a solution for efficient and cost effective service management via QoS monitoring and control and network SLA design is proposed. The solution gives DVB-H operators the possibility of fully managing service QoS without being tied to third party operators
Preliminary design and optimization of toroidally-wound limited angle servo motor based on a generalized magnetic circuit model
This paper proposes a new generalized equivalent magnetic circuit model for the preliminary design of a toroidally-wound limited angle servo motor (LASM). In the model, the magnetic networks are formulated as a function of the pole number and geometric dimensions. Nonlinear saturation effect of the ferromagnetic material is also taken into consideration. A multi-objective optimization function involving the torque requirement, the mass, the time constant, and magnetic saturations of ferromagnetic material is introduced. Based on the proposed model, six design cases with different objectives have been carried by the particle swarm optimization (PSO) method. The comparisons of different optimization cases demonstrate the effectiveness and computation efficiency of the proposed method, and hence its suitability in preliminary design. Moreover, the generalized model can be readily applied in the other electromagnetic modelling
Optimization of Mixture Proportions for Concrete PavementsâInfluence of Supplementary Cementitious Materials, Paste Content and Aggregate Gradation
The ultimate goals of this study included investigation of the optimal ranges for paste content, amount of cementations materials and aggregate gradation for concrete paving mixtures. In general, the optimum concrete mixtures developed in this study contained low paste content (below 23%), and were characterized by low scaling and sorptivity. In addition, it was also possible to achieve high cement replacement levels for these mixtures. Finally, for optimized fly ash mixtures, the selection of well graded aggregate gradation with high packing density increased the most desired paste content for those mixtures, thus indicating that combined aggregate gradation has strong influence on concrete performance. Lastly, concrete mixtures developed with optimum ranges of variables studied in this research contained low cement content.
The overall scope of the research was divided into three distinctive phases, each of which is described briefly below:
PHASE I: This phase consisted of statistical optimization of the proportions of concrete binder. The Central Composite Design methodology (CCD) was used to design the experiment for the optimization of binder in three types of concrete mixtures: a) cement + fly ash, b) cement + GGBFS, and c) cement + fly ash + GGBFS. The variables studied in each of these systems included: paste content (from 21 to 25 % by mixture volume) and total content of supplementary cementitious material (SCM) in the mixture. This was expressed as weight percent of total binder, and varied depending on the binder system used.
PHASE II: The main goal of this phase was to investigate the effect of different aggregate gradations on the fresh and hardened properties of optimized concrete mixtures developed in PHASE I, as well as to identify the most desired aggregate gradations for paving mixtures. Different aggregate gradations were prepared by blending of 2, 3 or 4 different sizes of aggregates based on concept of Shilstoneâs Coarseness Factor Chart.
PHASE III: The concept of air-free pasteâaggregate void saturation ratio (kâ) introduced in PHASE II seemed to fairly accurate link the properties of concrete mixtures with their paste content. Thus, it was decided to further investigate this concept in connection with aggregate packing density (Ί). In addition, it was believed that defining optimum values of âkâ will allow for revising the paste content ranges developed in PHASE I for different systems, and thus define more general optimum paste ranges for paving mixtures
- âŠ