122 research outputs found

    New Multithreaded Hybrid CPU/GPU Approach to Hartree−Fock

    Get PDF
    In this article, a new multithreaded Hartree–Fock CPU/GPU method is presented which utilizes automatically generated code and modern C++ techniques to achieve a significant improvement in memory usage and computer time. In particular, the newly implemented Rys Quadrature and Fock Matrix algorithms, implemented as a stand-alone C++ library, with C and Fortran bindings, provides up to 40% improvement over the traditional Fortran Rys Quadrature. The C++ GPU HF code provides approximately a factor of 17.5 improvement over the corresponding C++ CPU code

    Computational Physics on Graphics Processing Units

    Full text link
    The use of graphics processing units for scientific computations is an emerging strategy that can significantly speed up various different algorithms. In this review, we discuss advances made in the field of computational physics, focusing on classical molecular dynamics, and on quantum simulations for electronic structure calculations using the density functional theory, wave function techniques, and quantum field theory.Comment: Proceedings of the 11th International Conference, PARA 2012, Helsinki, Finland, June 10-13, 201

    CUDA Implementation of a Navier-Stokes Solver on Multi-GPU Desktop Platforms for Incompressible Flows

    Get PDF
    Graphics processor units (GPU) that are traditionally designed for graphics rendering have emerged as massively-parallel co-processors to the central processing unit (CPU). Small-footprint desktop supercomputers with hundreds of cores that can deliver teraflops peak performance at the price of conventional workstations have been realized. A computational fluid dynamics (CFD) simulation capability with rapid computational turnaround time has the potential to transform engineering analysis and design optimization procedures. We describe the implementation of a Navier-Stokes solver for incompressible fluid flow using desktop platforms equipped with multi-GPUs. Specifically, NVIDIA’s Compute Unified Device Architecture (CUDA) programming model is used to implement the discretized form of the governing equations. The projection algorithm to solve the incompressible fluid flow equations is divided into distinct CUDA kernels, and a unique implementation that exploits the memory hierarchy of the CUDA programming model is suggested. Using a quad-GPU platform, we observe two orders of magnitude speedup relative to a serial CPU implementation. Our results demonstrate that multi-GPU desktops can serve as a cost-effective small-footprint parallel computing platform to accelerate CFD simulations substantially. I. Introductio

    Excited-State Electronic Structure with Configuration Interaction Singles and Tamm–Dancoff Time-Dependent Density Functional Theory on Graphical Processing Units

    Get PDF
    Excited-state calculations are implemented in a development version of the GPU-based TeraChem software package using the configuration interaction singles (CIS) and adiabatic linear response Tamm–Dancoff time-dependent density functional theory (TDA-TDDFT) methods. The speedup of the CIS and TDDFT methods using GPU-based electron repulsion integrals and density functional quadrature integration allows full ab initio excited-state calculations on molecules of unprecedented size. CIS/6-31G and TD-BLYP/6-31G benchmark timings are presented for a range of systems, including four generations of oligothiophene dendrimers, photoactive yellow protein (PYP), and the PYP chromophore solvated with 900 quantum mechanical water molecules. The effects of double and single precision integration are discussed, and mixed precision GPU integration is shown to give extremely good numerical accuracy for both CIS and TDDFT excitation energies (excitation energies within 0.0005 eV of extended double precision CPU results)

    Ab Initio Screening Approach for the Discovery of Lignin Polymer Breaking Pathways

    Get PDF
    The directed depolymerization of lignin biopolymers is of utmost relevance for the valorization or commercialization of biomass fuels. We present a computational and theoretical screening approach to identify potential cleavage pathways and resulting fragments that are formed during depolymerization of lignin oligomers containing two to six monomers. We have developed a chemical discovery technique to identify the chemically relevant putative fragments in eight known polymeric linkage types of lignin. Obtaining these structures is a crucial precursor to the development of any further kinetic modeling. We have developed this approach by adapting steered molecular dynamics calculations under constant force and varying the points of applied force in the molecule to diversify the screening approach. Key observations include relationships between abundance and breaking frequency, the relative diversity of potential pathways for a given linkage, and the observation that readily cleaved bonds can destabilize adjacent bonds, causing subsequent automatic cleavage.Massachusetts Institute of Technology (Research Support Corporation, Reed Grant)United States. Dept. of Energy. Computational Science Graduate Fellowship Program (DOE-CSGF)Burroughs Wellcome Fund (Career Award at the Scientific Interface

    A functional variant in the Stearoyl-CoA desaturase gene promoter enhances fatty acid desaturation in pork

    Get PDF
    There is growing public concern about reducing saturated fat intake. Stearoyl-CoA desaturase (SCD) is the lipogenic enzyme responsible for the biosynthesis of oleic acid (18:1) by desaturating stearic acid (18:0). Here we describe a total of 18 mutations in the promoter and 3′ non-coding region of the pig SCD gene and provide evidence that allele T at AY487830:g.2228T>C in the promoter region enhances fat desaturation (the ratio 18:1/18:0 in muscle increases from 3.78 to 4.43 in opposite homozygotes) without affecting fat content (18:0+18:1, intramuscular fat content, and backfat thickness). No mutations that could affect the functionality of the protein were found in the coding region. First, we proved in a purebred Duroc line that the C-T-A haplotype of the 3 single nucleotide polymorphisms (SNPs) (g.2108C>T; g.2228T>C; g.2281A>G) of the promoter region was additively associated to enhanced 18:1/18:0 both in muscle and subcutaneous fat, but not in liver. We show that this association was consistent over a 10-year period of overlapping generations and, in line with these results, that the C-T-A haplotype displayed greater SCD mRNA expression in muscle. The effect of this haplotype was validated both internally, by comparing opposite homozygote siblings, and externally, by using experimental Duroc-based crossbreds. Second, the g.2281A>G and the g.2108C>T SNPs were excluded as causative mutations using new and previously published data, restricting the causality to g.2228T>C SNP, the last source of genetic variation within the haplotype. This mutation is positioned in the core sequence of several putative transcription factor binding sites, so that there are several plausible mechanisms by which allele T enhances 18:1/18:0 and, consequently, the proportion of monounsaturated to saturated fat.This research was supported by grants from the Spanish Ministry of Science and Innovation (AGL2009-09779 and AGL2012-33529). RRF is recipient of a PhD scholarship from the Spanish Ministry of Science and Innovation (BES-2010-034607). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of manuscript

    The STAR experiment at the relativistic heavy ion collider

    Full text link
    • …
    corecore