Search CORE

476 research outputs found

Verified lifting of stencil computations

Author: Alur Rajeev
Alvin Cheung
Amarasinghe Saman P
Armando Solar-Lezama
Catanzaro Bryan
Heroux Michael A.
Jeon Jinseong
Kamil Shoaib
Karbyshev Aleksandr
Lezama Armando Solar
Mallinson A.C.
Plotkin Gordon D
Reynolds John C.
Shachar Itzhaky
Shoaib Kamil
Vasco Diego
Zaharia Matei
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/06/2016
Field of study

This paper demonstrates a novel combination of program synthesis and verification to lift stencil computations from low-level Fortran code to a high-level summary expressed using a predicate language. The technique is sound and mostly automated, and leverages counter-example guided inductive synthesis (CEGIS) to find provably correct translations. Lifting existing code to a high-performance description language has a number of benefits, including maintainability and performance portability. For example, our experiments show that the lifted summaries can enable domain specific compilers to do a better job of parallelization as compared to an off-the-shelf compiler working on the original code, and can even support fully automatic migration to hardware accelerators such as GPUs. We have implemented verified lifting in a system called STNG and have evaluated it using microbenchmarks, mini-apps, and real-world applications. We demonstrate the benefits of verified lifting by first automatically summarizing Fortran source code into a high-level predicate language, and subsequently translating the lifted summaries into Halide, with the translated code achieving median performance speedups of 4.1X and up to 24X for non-trivial stencils as compared to the original implementation.United States. Department of Energy. Office of Science (Award DE-SC0008923)United States. Department of Energy. Office of Science (Award DE-SC0005288

DSpace@MIT

Crossref

On 2D elliptic discontinuous Galerkin methods

Author: Kirby RM
Peiro J
Sherwin SJ
Taylor RL
Zienkiewicz OC
Publication venue: 'Wiley'
Publication date: 29/01/2006
Field of study

Spiral - Imperial College Digital Repository

Towards self-verification in finite difference code generation

Author: Dwyer M
Gorman G
Hovland P
Hückelheim JC
Kukreja N
Lange M
Luo Z
Luporini F
Siegel S
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 18/09/2017
Field of study

Code generation from domain-specific languages is becoming increasingly popular as a method to obtain optimised low-level code that performs well on a given platform and for a given problem instance. Ensuring the correctness of generated codes is crucial. At the same time, testing or manual inspection of the code is problematic, as the generated code can be complex and hard to read. Moreover, the generated code may change depending on the problem type, domain size, or target platform, making conventional code review or testing methods impractical. As a solution, we propose the integration of formal verification tools into the code generation process. We present a case study in which the CIVL verification tool is combined with the Devito finite difference framework that generates optimised stencil code for PDE solvers from symbolic equations. We show a selection of properties of the generated code that can be automatically specified and verified during the code generation process. Our approach allowed us to detect a previously unknown bug in the Devito code generation tool

Spiral - Imperial College Digital Repository

Code Transpilation for Hardware Accelerators

Author: Bhatia Sahil
Cheung Alvin
Genc Hasan
Laddad Shadaj
Nishida Yuto
Shao Yakun Sophia
Publication venue
Publication date: 11/08/2023
Field of study

DSLs and hardware accelerators have proven to be very effective in optimizing computationally expensive workloads. In this paper, we propose a solution to the challenge of manually rewriting legacy or unoptimized code in domain-specific languages and hardware accelerators. We introduce an approach that integrates two open-source tools: Metalift, a code translation framework, and Gemmini, a DNN accelerator generator. The integration of these two tools offers significant benefits, including simplified workflows for developers to run legacy code on Gemmini generated accelerators and a streamlined programming stack for Gemmini that reduces the effort required to add new instructions. This paper provides details on this integration and its potential to simplify and optimize computationally expensive workloads

arXiv.org e-Print Archive

C2TACO: Lifting Tensor Code to TACO

Author: De Souza Magalhães José Wesley
O'Boyle Michael F P
Polgreen Elizabeth
Woodruff Jackson
Publication venue
Publication date: 22/10/2023
Field of study

Edinburgh Research Explorer

The Three Pillars of Machine Programming

Author: Albarghouthi Aws
Alur Rajeev
Babble Labble Chris Re.
Bielik Pavol
Clinton Whaley R.
D’Antoni Loris
Frigo Matteo
Inala Jeevana Priya
Iyer Srinivasan
Le Xuan-Bach D.
Lei Tao
Meng Na
Peleg Hila
Singh Rishabh
Thien Nguyen Hoang Duong
Zheng Wenting
Publication venue: ScholarlyCommons
Publication date: 01/01/2018
Field of study

In this position paper, we describe our vision of the future of machine programming through a categorical examination of three pillars of research. Those pillars are:(i) intention,(ii) invention, and (iii) adaptation. Intention emphasizes advancements in the human-to-computer and computer-to-machine-learning interfaces. Invention emphasizes the creation or refinement of algorithms or core hardware and software building blocks through machine learning (ML). Adaptation emphasizes advances in the use of ML-based constructs to autonomously evolve software

Crossref

ScholarlyCommons@Penn

High Order Spectral Volume and Spectral Difference Methods on Unstructured Grids

Author: Kannan Ravishekar
Publication venue: Iowa State University Digital Repository
Publication date: 01/01/2008
Field of study

The spectral volume (SV) and the spectral difference (SD) methods were developed by Wang and Liu and their collaborators for conservation laws on unstructured grids. They were introduced to achieve high-order accuracy in an efficient manner. Recently, these methods were extended to three-dimensional systems and to the Navier Stokes equations. The simplicity and robustness of these methods have made them competitive against other higher order methods such as the discontinuous Galerkin and residual distribution methods. Although explicit TVD Runge-Kutta schemes for the temporal advancement are easy to implement, they suffer from small time step limited by the Courant-Friedrichs-Lewy (CFL) condition. When the polynomial order is high or when the grid is stretched due to complex geometries or boundary layers, the convergence rate of explicit schemes slows down rapidly. Solution strategies to remedy this problem include implicit methods and multigrid methods. A novel implicit lower-upper symmetric Gauss-Seidel (LU-SGS) relaxation method is employed as an iterative smoother. It is compared to the explicit TVD Runge-Kutta smoothers. For some p-multigrid calculations, combining implicit and explicit smoothers for different p-levels is also studied. The multigrid method considered is nonlinear and uses Full Approximation Scheme (FAS). An overall speed-up factor of up to 150 is obtained using a three-level p-multigrid LU-SGS approach in comparison with the single level explicit method for the Euler equations for the 3rd order SD method. A study of viscous flux formulations was carried out for the SV method. Three formulations were used to discretize the viscous fluxes: local discontinuous Galerkin (LDG), a penalty method and the 2nd method of Bassi and Rebay. Fourier analysis revealed some interesting advantages for the penalty method. These were implemented in the Navier Stokes solver. An implicit and p-multigrid method was also implemented for the above. An overall speed-up factor of up to 1500 is obtained using a three-level p-multigrid LU-SGS approach in comparison with the single level explicit method for the Navier-Stokes equations. The SV method was also extended to turbulent flows. The RANS based SA model was used to close the Reynolds stresses. The numerical results are very promising and indicate that the approaches have great potentials for 3D flow problems

Digital Repository @ Iowa State University (ISU)

CiteSeerX