30,809 research outputs found

    A domain-specific language and matrix-free stencil code for investigating electronic properties of Dirac and topological materials

    Full text link
    We introduce PVSC-DTM (Parallel Vectorized Stencil Code for Dirac and Topological Materials), a library and code generator based on a domain-specific language tailored to implement the specific stencil-like algorithms that can describe Dirac and topological materials such as graphene and topological insulators in a matrix-free way. The generated hybrid-parallel (MPI+OpenMP) code is fully vectorized using Single Instruction Multiple Data (SIMD) extensions. It is significantly faster than matrix-based approaches on the node level and performs in accordance with the roofline model. We demonstrate the chip-level performance and distributed-memory scalability of basic building blocks such as sparse matrix-(multiple-) vector multiplication on modern multicore CPUs. As an application example, we use the PVSC-DTM scheme to (i) explore the scattering of a Dirac wave on an array of gate-defined quantum dots, to (ii) calculate a bunch of interior eigenvalues for strong topological insulators, and to (iii) discuss the photoemission spectra of a disordered Weyl semimetal.Comment: 16 pages, 2 tables, 11 figure

    Multicore-optimized wavefront diamond blocking for optimizing stencil updates

    Full text link
    The importance of stencil-based algorithms in computational science has focused attention on optimized parallel implementations for multilevel cache-based processors. Temporal blocking schemes leverage the large bandwidth and low latency of caches to accelerate stencil updates and approach theoretical peak performance. A key ingredient is the reduction of data traffic across slow data paths, especially the main memory interface. In this work we combine the ideas of multi-core wavefront temporal blocking and diamond tiling to arrive at stencil update schemes that show large reductions in memory pressure compared to existing approaches. The resulting schemes show performance advantages in bandwidth-starved situations, which are exacerbated by the high bytes per lattice update case of variable coefficients. Our thread groups concept provides a controllable trade-off between concurrency and memory usage, shifting the pressure between the memory interface and the CPU. We present performance results on a contemporary Intel processor

    Fully-automatic inverse tone mapping algorithm based on dynamic mid-level tone mapping

    Get PDF
    High Dynamic Range (HDR) displays can show images with higher color contrast levels and peak luminosities than the common Low Dynamic Range (LDR) displays. However, most existing video content is recorded and/or graded in LDR format. To show LDR content on HDR displays, it needs to be up-scaled using a so-called inverse tone mapping algorithm. Several techniques for inverse tone mapping have been proposed in the last years, going from simple approaches based on global and local operators to more advanced algorithms such as neural networks. Some of the drawbacks of existing techniques for inverse tone mapping are the need for human intervention, the high computation time for more advanced algorithms, limited low peak brightness, and the lack of the preservation of the artistic intentions. In this paper, we propose a fully-automatic inverse tone mapping operator based on mid-level mapping capable of real-time video processing. Our proposed algorithm allows expanding LDR images into HDR images with peak brightness over 1000 nits, preserving the artistic intentions inherent to the HDR domain. We assessed our results using the full-reference objective quality metrics HDR-VDP-2.2 and DRIM, and carrying out a subjective pair-wise comparison experiment. We compared our results with those obtained with the most recent methods found in the literature. Experimental results demonstrate that our proposed method outperforms the current state-of-the-art of simple inverse tone mapping methods and its performance is similar to other more complex and time-consuming advanced techniques

    Management of Digital Video Broadcasting Services in Open Delivery Platforms

    Get PDF
    The future of Digital Video Broadcasting (DVB) is moving towards solutions offering an efficient way of carrying interactive IP multimedia services over digital terrestrial broadcasting networks to handheld terminals. One of the most promising technologies is Digital Video Broadcasting-Handheld (DVB-H), at present under standardisation. Services deployed via this type of DVB technologies should enjoy reliability comparable to TV services and high quality standards. However, the market at present does not provide effective and economical solutions for the deployment of such services over multi-domain IP networks, due to their high level of unreliability. This paper focuses on service management, service level agreement (SLA) and network performance requirements of DVB-H services. Experimental results are presented concerning QoS sensitivity to network performance of DVB-H services delivered over a multi-domain IP network. Moreover, a solution for efficient and cost effective service management via QoS monitoring and control and network SLA design is proposed. The solution gives DVB-H operators the possibility of fully managing service QoS without being tied to third party operators

    Preliminary design and optimization of toroidally-wound limited angle servo motor based on a generalized magnetic circuit model

    Get PDF
    This paper proposes a new generalized equivalent magnetic circuit model for the preliminary design of a toroidally-wound limited angle servo motor (LASM). In the model, the magnetic networks are formulated as a function of the pole number and geometric dimensions. Nonlinear saturation effect of the ferromagnetic material is also taken into consideration. A multi-objective optimization function involving the torque requirement, the mass, the time constant, and magnetic saturations of ferromagnetic material is introduced. Based on the proposed model, six design cases with different objectives have been carried by the particle swarm optimization (PSO) method. The comparisons of different optimization cases demonstrate the effectiveness and computation efficiency of the proposed method, and hence its suitability in preliminary design. Moreover, the generalized model can be readily applied in the other electromagnetic modelling

    Optimization of Mixture Proportions for Concrete Pavements—Influence of Supplementary Cementitious Materials, Paste Content and Aggregate Gradation

    Get PDF
    The ultimate goals of this study included investigation of the optimal ranges for paste content, amount of cementations materials and aggregate gradation for concrete paving mixtures. In general, the optimum concrete mixtures developed in this study contained low paste content (below 23%), and were characterized by low scaling and sorptivity. In addition, it was also possible to achieve high cement replacement levels for these mixtures. Finally, for optimized fly ash mixtures, the selection of well graded aggregate gradation with high packing density increased the most desired paste content for those mixtures, thus indicating that combined aggregate gradation has strong influence on concrete performance. Lastly, concrete mixtures developed with optimum ranges of variables studied in this research contained low cement content. The overall scope of the research was divided into three distinctive phases, each of which is described briefly below: PHASE I: This phase consisted of statistical optimization of the proportions of concrete binder. The Central Composite Design methodology (CCD) was used to design the experiment for the optimization of binder in three types of concrete mixtures: a) cement + fly ash, b) cement + GGBFS, and c) cement + fly ash + GGBFS. The variables studied in each of these systems included: paste content (from 21 to 25 % by mixture volume) and total content of supplementary cementitious material (SCM) in the mixture. This was expressed as weight percent of total binder, and varied depending on the binder system used. PHASE II: The main goal of this phase was to investigate the effect of different aggregate gradations on the fresh and hardened properties of optimized concrete mixtures developed in PHASE I, as well as to identify the most desired aggregate gradations for paving mixtures. Different aggregate gradations were prepared by blending of 2, 3 or 4 different sizes of aggregates based on concept of Shilstone’s Coarseness Factor Chart. PHASE III: The concept of air-free paste–aggregate void saturation ratio (k”) introduced in PHASE II seemed to fairly accurate link the properties of concrete mixtures with their paste content. Thus, it was decided to further investigate this concept in connection with aggregate packing density (Ω). In addition, it was believed that defining optimum values of “k” will allow for revising the paste content ranges developed in PHASE I for different systems, and thus define more general optimum paste ranges for paving mixtures
    • 

    corecore