Search CORE

26,219 research outputs found

C++ Templates as Partial Evaluation

Author: Veldhuizen Todd L.
Publication venue
Publication date: 02/11/1998
Field of study

This paper explores the relationship between C++ templates and partial evaluation. Templates were designed to support generic programming, but unintentionally provided the ability to perform compile-time computations and code generation. These features are completely accidental, and as a result their syntax is awkward. By recasting these features in terms of partial evaluation, a much simpler syntax can be achieved. C++ may be regarded as a two-level language in which types are first-class values. Template instantiation resembles an offline partial evaluator. This paper describes preliminary work toward a single mechanism based on Partial Evaluation which unifies generic programming, compile-time computation and code generation. The language Catat is introduced to illustrate these ideas.Comment: 13 page

arXiv.org e-Print Archive

CiteSeerX

The CIAO multiparadigm compiler and system: A progress report

Author: Bueno Carrillo Francisco
García de la Banda M.
Hermenegildo Manuel V.
Puebla Sánchez Alvaro Germán
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/1995
Field of study

Abstract is not available

Archivo Digital UPM

Speculative Staging for Interpreter Optimization

Author: Brunthaler Stefan
Publication venue
Publication date: 08/10/2013
Field of study

Interpreters have a bad reputation for having lower performance than just-in-time compilers. We present a new way of building high performance interpreters that is particularly effective for executing dynamically typed programming languages. The key idea is to combine speculative staging of optimized interpreter instructions with a novel technique of incrementally and iteratively concerting them at run-time. This paper introduces the concepts behind deriving optimized instructions from existing interpreter instructions---incrementally peeling off layers of complexity. When compiling the interpreter, these optimized derivatives will be compiled along with the original interpreter instructions. Therefore, our technique is portable by construction since it leverages the existing compiler's backend. At run-time we use instruction substitution from the interpreter's original and expensive instructions to optimized instruction derivatives to speed up execution. Our technique unites high performance with the simplicity and portability of interpreters---we report that our optimization makes the CPython interpreter up to more than four times faster, where our interpreter closes the gap between and sometimes even outperforms PyPy's just-in-time compiler.Comment: 16 pages, 4 figures, 3 tables. Uses CPython 3.2.3 and PyPy 1.

arXiv.org e-Print Archive

CiteSeerX

ADF95: Tool for automatic differentiation of a FORTRAN code designed for large numbers of independent variables

Author: Adams
Beck
Christian W. Straka
Ehrig
Metcalf
Stamatiadis
Publication venue: 'Elsevier BV'
Publication date: 04/03/2005
Field of study

ADF95 is a tool to automatically calculate numerical first derivatives for any mathematical expression as a function of user defined independent variables. Accuracy of derivatives is achieved within machine precision. ADF95 may be applied to any FORTRAN 77/90/95 conforming code and requires minimal changes by the user. It provides a new derived data type that holds the value and derivatives and applies forward differencing by overloading all FORTRAN operators and intrinsic functions. An efficient indexing technique leads to a reduced memory usage and a substantially increased performance gain over other available tools with operator overloading. This gain is especially pronounced for sparse systems with large number of independent variables. A wide class of numerical simulations, e.g., those employing implicit solvers, can profit from ADF95.Comment: 24 pages, 2 figures, 4 tables, accepted in Computer Physics Communication

arXiv.org e-Print Archive

Crossref

Relay: A New IR for Machine Learning Frameworks

Author: Abadi Martin
Chen Tianqi
Krizhevsky Alex
Rotem Nadav
Shankar Asim
Vasilache Nicolas
Wei Richard
Wiltschko Alex
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 25/09/2018
Field of study

Machine learning powers diverse services in industry including search, translation, recommendation systems, and security. The scale and importance of these models require that they be efficient, expressive, and portable across an array of heterogeneous hardware devices. These constraints are often at odds; in order to better accommodate them we propose a new high-level intermediate representation (IR) called Relay. Relay is being designed as a purely-functional, statically-typed language with the goal of balancing efficient compilation, expressiveness, and portability. We discuss the goals of Relay and highlight its important design constraints. Our prototype is part of the open source NNVM compiler framework, which powers Amazon's deep learning framework MxNet

arXiv.org e-Print Archive

Crossref