12 research outputs found
On the Efficient Evaluation of the Exchange Correlation Potential on Graphics Processing Unit Clusters
The predominance of Kohn-Sham density functional theory (KS-DFT) for the
theoretical treatment of large experimentally relevant systems in molecular
chemistry and materials science relies primarily on the existence of efficient
software implementations which are capable of leveraging the latest advances in
modern high performance computing (HPC). With recent trends in HPC leading
towards in increasing reliance on heterogeneous accelerator based architectures
such as graphics processing units (GPU), existing code bases must embrace these
architectural advances to maintain the high-levels of performance which have
come to be expected for these methods. In this work, we purpose a three-level
parallelism scheme for the distributed numerical integration of the
exchange-correlation (XC) potential in the Gaussian basis set discretization of
the Kohn-Sham equations on large computing clusters consisting of multiple GPUs
per compute node. In addition, we purpose and demonstrate the efficacy of the
use of batched kernels, including batched level-3 BLAS operations, in achieving
high-levels of performance on the GPU. We demonstrate the performance and
scalability of the implementation of the purposed method in the NWChemEx
software package by comparing to the existing scalable CPU XC integration in
NWChem.Comment: 26 pages, 9 figure
Recommended from our members
OptPlatform: metaheuristic optimisation framework for solving complex real-world problems
This thesis was submitted for the award of Doctor of Philosophy and was awarded by Brunel University LondonWe optimise daily, whether that is planning a round trip that visits the most attractions within a given holiday budget or just taking a train instead of driving a car in a rush hour. Many problems, just like these, are solved by individuals as part of our daily schedule, and they are effortless and straightforward. If we now scale that to many individuals with many different schedules, like a school timetable, we get to a point where it is just not feasible or practical to solve by hand. In such instances, optimisation methods are used to obtain an optimal solution. In this thesis, a practical approach to optimisation has been taken by developing an optimisation platform with all the necessary tools to be used by practitioners who are not necessarily familiar with the subject of optimisation. First, a high-performance metaheuristic optimisation framework (MOF) called OptPlatform is implemented, and the versatility and performance are evaluated across multiple benchmarks and real-world optimisation problems. Results show that, compared to competing MOFs, the OptPlatform outperforms in both the solution quality and computation time. Second, the most suitable hardware platform for OptPlatform is determined by an in-depth analysis of Ant Colony Optimisation scaling across CPU, GPU and enterprise Xeon Phi. Contrary to the common benchmark problems used in the literature, the supply chain problem solved could not scale on GPUs. Third, a variety of metaheuristics are implemented into OptPlatform. Including, a new metaheuristic based on Imperialist Competitive Algorithm (ICA), called ICA with Independence and Constrained Assimilation (ICAwICA) is proposed. The ICAwICA was compared against two different types of benchmark problems, and results show the versatile application of the algorithm, matching and in some cases outperforming the custom-tuned approaches. Finally, essential MOF features like automatic algorithm selection and tuning, lacking on existing frameworks, are implemented in OptPlatform. Two novel approaches are proposed and compared to existing methods. Results indicate the superiority of the implemented tuning algorithms within constrained tuning budget environment
Rheology and Structure Formation in Complex Polymer Melts
Polymeric materials are ubiquitous in our modern lives.
Their many applications in complex materials are accompanied by potentially huge benefits for technological advancement.
These applications range from batteries, fuel cells, molecular sieves, tires, and microelectronic devices.
The ability to self-assemble into nanostructures in combination with their viscoelastic properties make polymers attractive for this wide range of applications.
I perform computer simulations gaining knowledge about their properties for applications and manufacturing, to improve the understanding of these materials.
The simulation of multicomponent polymer melts poses an extreme computational challenge.
The large spatial extent of defects in self-assembled structures or nonperiodic metastable phases, which are prone to finite size effects, require the study of large system sizes.
Hence, I use a soft, coarse-grained polymer model reducing the degrees of freedom to gain insights into long time and length scales.
Consistent implementations of these models that scale well on modern GPUs accelerated HPCs hardware enable investigations with up to billions of particles. Consequently, I can address challenges that were deemed intractable before.
Firstly, I analyze metastable network phases as a function of the volume fraction, f, of diblock copolymers for polymeric battery electrolytes.
One polymer block provides the mechanical stability while the other is ion conducting.
The focus lies on the structure of the conducting phase.
Due to the trapped metastable states, I investigate systems of extreme sizes with billions of particles circumventing finite size effects.
In fact, I identify fractal structures on significant length scales inside the network phase, which influence the transport properties locally.
As such, this work highlights the necessity of soft models and scaling implementations obtaining insights on engineering scales.
Secondly, I will investigate the simulation of viscoelastic properties of polymeric materials with soft, coarse-grained models.
It is particularly challenging to correctly capture the entangled dynamics.
The noncrossability of polymer backbones introduces topological constraints on the motion of the chains.
A soft, coarse-grained model does not capture this noncrossability automatically.
Hence, I utilize a SLSP model to mimic the entanglements via dynamic bonds.
With this model and a novel technique to average the stress auto-correlation function G(t), I perform a dynamic mechanical analysis of polymer melts and a cross-linked network.
The obtained storage modulus G'(w) and loss modulus G''(w) meet the expectations for a comparison with experimental studies.
A nonequilibrium study of diblock copolymers in shear flow completes this work.
Shear flow is a powerful method to macroscopically order a metastable microstructure.
In a symmetric diblock copolymer melt, the equilibrium microstructure is a lamellar phase.
The first step determines the perpendicular orientation of the lamellae in shear flow as stable at all stresses according to the concept of the Rayleighian, R.
Further, I study the transition between a grain in the unstable orientation next to a grain in the stable orientation.
I identify two different transition pathways.
At low applied stresses, the grain boundary of the stable grain grows into the unstable grain.
At higher stresses, the unstable orientation is destabilized and forms an intermediate microemulsion-like phase with no local orientation.
This intermediate phase turns subsequently into the stable orientation. Oscillatory shear at high frequencies delays the onset of this microemulsion pathway.
In a collaboration with Matthias Heck and Manfred Wilhelm at KIT, these transitions have been studied in LAOS experiments as well
A History of Materials and Technologies Development
The purpose of the book is to provide the students with the text that presents an introductory knowledge about the development of materials and technologies and includes the most commonly available information on human development. The idea of the publication has been generated referring to the materials taken from the organic and non-organic evolution of nature. The suggested texts might be found a purposeful tool for the University students proceeding with studying engineering due to the fact that all subjects in this particular field more or less have to cover the history and development of the studied object. It is expected that studying different materials and technologies will help the students with a better understanding of driving forces, positive and negative consequences of technological development, etc
Wideband vibration energy harvesting using electromagnetic transduction for powering internet of things
The ‘Internet of Things-(IoT)’ envisions a world scattered with physical sensors that collect and transmit data about almost anything and thereby enabling intelligent decision-making for a smart environment. While technological advancements have reduced the power consumption of such devices significantly, the problem of perpetual energy supply beyond the limited capability of batteries is a bottleneck to this vision which is yet to be resolved. This issue has surged the research to investigate the prospect of harvesting the energy out of ambient mechanical vibrations. However, limited applications of conventional resonant devices under most practical environments involving frequency varying inputs, has gushed the research on wideband transducers recently. To facilitate multi-frequency operation at low-frequency regime, design innovations of the Silicon-onInsulator based MEMS suspension systems are performed through multi-modal activation. For continuous bandwidth widening, the benefits of using nonlinear stiffness in the system dynamics are investigated. By topologically varying the spring architectures, dramatically improved operational bandwidth with large power-density is obtained, which is benchmarked using a novel figure-of-merit. However, the fundamental phenomenon of multi-stability limits many nonlinear oscillator based applications including energy harvesting. To address this, an electrical control mechanism is introduced which dramatically improves the energy conversion efficiency over a wide bandwidth in a frequencyamplitude varying environment using only a small energy budget. The underlying effects are independent of the device-scale and the transduction methods, and are explained using a modified Duffing oscillator model. One of the key requirements for fully integrated electromagnetic transducers is the CMOS compatible batch-fabrication of permanent magnets with large energy-product. In the final module of the works, nano-structured CoPtP hard-magnetic material with large coercivity is developed at room-temperature using a current modulated electro-deposition technique. The demagnetization fields of the magnetic structures are minimized through optimized micro-patterns which enable the full integration of high performance electromagnetic energy harvesters
Elasmobranchs as living resources: Advances in the biology, ecology, systematics, and the status of the fisheries
This report owes its genesis to the foresight and enthusiam
of Dr. Kazuhiro Mizue. By happy circumstance, Professor Mizue contacted me in 1983 with his visionary ideas on cooperative programs. He noted that the time was right because the Japan Society for the Promotion of Science and the National Science Foundation had mutually given priority to cooperative programs in marine biology.
I therefore agreed to act as the U.S. coordinator and proposed to NSF, a short trip to Japan to negotiate site visits and timing with ten previously appointed Japanese scientists and, if that trip were successful, to negotiate a joint research project, possibly followed by a joint seminar. (PDF file contains 528 pages.
XSEDE: eXtreme Science and Engineering Discovery Environment Third Quarter 2012 Report
The Extreme Science and Engineering Discovery Environment (XSEDE) is the most advanced, powerful, and robust collection of integrated digital resources and services in the world. It is an integrated cyberinfrastructure ecosystem with singular interfaces for allocations, support, and other key services that researchers can use to interactively share computing resources, data, and expertise.This a report of project activities and highlights from the third quarter of 2012.National Science Foundation, OCI-105357
A Meeting of Minds: In Recognition of the Contributions of Randall J. Cohrs
A Special Issue in memory of Randall J. Cohrs, Ph.D. Topics include original research reports on a variety of viruses as well as reviews and commentaries on Randy’s contributions to many investigations