Relay: A New IR for Machine Learning Frameworks

Abadi Martin; Chen Tianqi; Krizhevsky Alex; Rotem Nadav; Shankar Asim; Vasilache Nicolas; Wei Richard; Wiltschko Alex

research

Relay: A New IR for Machine Learning Frameworks

Authors: Abadi Martin
Chen Tianqi
Krizhevsky Alex
Rotem Nadav
Shankar Asim
Vasilache Nicolas
Wei Richard
Wiltschko Alex
Publication date: 25 September 2018
Publisher: 'Association for Computing Machinery (ACM)'
Doi

Abstract

Machine learning powers diverse services in industry including search, translation, recommendation systems, and security. The scale and importance of these models require that they be efficient, expressive, and portable across an array of heterogeneous hardware devices. These constraints are often at odds; in order to better accommodate them we propose a new high-level intermediate representation (IR) called Relay. Relay is being designed as a purely-functional, statically-typed language with the goal of balancing efficient compilation, expressiveness, and portability. We discuss the goals of Relay and highlight its important design constraints. Our prototype is part of the open source NNVM compiler framework, which powers Amazon's deep learning framework MxNet

Similar works

Full text

Available Versions

Crossref

info:doi/10.1145%2F3211346.321...

Last time updated on 10/08/2021