3 research outputs found

    Representations of Materials for Machine Learning

    Full text link
    High-throughput data generation methods and machine learning (ML) algorithms have given rise to a new era of computational materials science by learning relationships among composition, structure, and properties and by exploiting such relations for design. However, to build these connections, materials data must be translated into a numerical form, called a representation, that can be processed by a machine learning model. Datasets in materials science vary in format (ranging from images to spectra), size, and fidelity. Predictive models vary in scope and property of interests. Here, we review context-dependent strategies for constructing representations that enable the use of materials as inputs or outputs of machine learning models. Furthermore, we discuss how modern ML techniques can learn representations from data and transfer chemical and physical information between tasks. Finally, we outline high-impact questions that have not been fully resolved and thus, require further investigation.Comment: 20 pages, 5 figures, To Appear in Annual Review of Materials Research 5

    Combining density functional theory and machine learning for optimization of multicomponent oxide electrocatalysts

    No full text
    Multicomponent metal oxides, such as perovskite oxides, hold promise for use as sustainable alternatives to Ir-, Ru-, and Pt-based electrocatalysts at scale. Perovskites can accommodate a wide variety of elements in their A- and B-sites, enabling tuning of their structural and electronic properties through compositional alloying. These properties, which are obtainable from density functional theory (DFT) calculations, can be used as low-dimensional descriptors that correlate with experimental stability and activity in, for example, the oxygen evolution reaction (OER). Established descriptors of stability include energy above convex hull and energy above Pourbaix hull, while those for catalytic activity include oxygen 2p- and B-site metal d-band centers, for example. We are therefore presented with a combinatorial problem of determining which A- and B-site compositions optimize such descriptors. The compositional search space of ₓ′₁₋ₓᵧ′₁₋ᵧ₃ perovskites with up to two different elements in A- and B-sites is at least (10⁶), making it intractable to calculate descriptors exhaustively using DFT. We therefore combine high-throughput DFT calculations with crystal-based graph neural networks to screen multicomponent perovskites. Using a high-throughput virtual screening platform, a DFT-simulated dataset of over 5,000 multicomponent perovskites was generated, with varied A- and Bsite alloying ratios and over 3,000 unique cationic combinations. Leveraging this dataset, alongside calculations available in the literature, graph convolutional neural networks (GNNs) were trained to predict the aforementioned crystal descriptors from unrelaxed cubic structures and used to predict descriptors for (10⁶) ₓ′₁₋ₓᵧ′₁₋ᵧ₃ perovskites. GNNs were also combined with baseline estimates of multicomponent perovskite properties calculated as interpolations of constituent ₃ perovskites, thereby achieving improved model performance. Moreover, impacts of varied cationic ordering were modelled, showing that different decorations of cations within the perovskite lattice can modulate resulting properties to the same degree as—or more than—varying compositional ratios. Equivariant message passing neural networks were thus implemented to achieve cation decoration-aware property predictions. Lastly, GNNs predicting per-site properties were established, encoding local chemical environments to provide physical insights about each atom in a crystal lattice. The presented work provides the community with a benchmark multicomponent perovskite dataset, improved machine learning models, and physical insights to be used in further studies of alloyed perovskites, and thus lays groundwork for improved design of multicomponent oxide electrocatalysts.S.M
    corecore