Representations of Materials for Machine Learning

Damewood, James; Gómez-Bombarelli, Rafael; Karaguesian, Jessica; Lunger, Jaclyn R.; Peng, Jiayu; Tan, Aik Rui; Xie, Mingrou

Representations of Materials for Machine Learning

Authors: James Damewood
Rafael Gómez-Bombarelli
Jessica Karaguesian
Jaclyn R. Lunger
Jiayu Peng
Aik Rui Tan
Mingrou Xie
Publication date: 20 January 2023
Publisher

Abstract

High-throughput data generation methods and machine learning (ML) algorithms have given rise to a new era of computational materials science by learning relationships among composition, structure, and properties and by exploiting such relations for design. However, to build these connections, materials data must be translated into a numerical form, called a representation, that can be processed by a machine learning model. Datasets in materials science vary in format (ranging from images to spectra), size, and fidelity. Predictive models vary in scope and property of interests. Here, we review context-dependent strategies for constructing representations that enable the use of materials as inputs or outputs of machine learning models. Furthermore, we discuss how modern ML techniques can learn representations from data and transfer chemical and physical information between tasks. Finally, we outline high-impact questions that have not been fully resolved and thus, require further investigation.Comment: 20 pages, 5 figures, To Appear in Annual Review of Materials Research 5

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2301.08813

Last time updated on 04/02/2023