Search CORE

315,634 research outputs found

Alchemical and structural distribution based representation for improved QML

Author: Christensen Anders S.
Faber Felix A.
Huang Bing
von Lilienfeld O. Anatole
Publication venue: 'AIP Publishing'
Publication date: 22/12/2017
Field of study

We introduce a representation of any atom in any chemical environment for the generation of efficient quantum machine learning (QML) models of common electronic ground-state properties. The representation is based on scaled distribution functions explicitly accounting for elemental and structural degrees of freedom. Resulting QML models afford very favorable learning curves for properties of out-of-sample systems including organic molecules, non-covalently bonded protein side-chains, (H

_2

_{40}

-clusters, as well as diverse crystals. The elemental components help to lower the learning curves, and, through interpolation across the periodic table, even enable "alchemical extrapolation" to covalent bonding between elements not part of training, as evinced for single, double, and triple bonds among main-group elements

arXiv.org e-Print Archive

edoc

Alchemical and structural distribution based representation for improved QML

Author: Feifei Ren (558363)
Jingchen Sun (330489)
Min Feng (141118)
Nan Zhang (46264)
Xiong Wang (384462)
Yaohong Zhou (5591045)
Publication venue: 'AIP Publishing'
Publication date: 22/12/2017
Field of study

_2

_{40}

arXiv.org e-Print Archive

edoc

FigShare

A simple yet effective baseline for non-attributed graph classification

Author: Cai Chen
Wang Yusu
Publication venue
Publication date: 03/05/2019
Field of study

Graphs are complex objects that do not lend themselves easily to typical learning tasks. Recently, a range of approaches based on graph kernels or graph neural networks have been developed for graph classification and for representation learning on graphs in general. As the developed methodologies become more sophisticated, it is important to understand which components of the increasingly complex methods are necessary or most effective. As a first step, we develop a simple yet meaningful graph representation, and explore its effectiveness in graph classification. We test our baseline representation for the graph classification task on a range of graph datasets. Interestingly, this simple representation achieves similar performance as the state-of-the-art graph kernels and graph neural networks for non-attributed graph classification. Its performance on classifying attributed graphs is slightly weaker as it does not incorporate attributes. However, given its simplicity and efficiency, we believe that it still serves as an effective baseline for attributed graph classification. Our graph representation is efficient (linear-time) to compute. We also provide a simple connection with the graph neural networks. Note that these observations are only for the task of graph classification while existing methods are often designed for a broader scope including node embedding and link prediction. The results are also likely biased due to the limited amount of benchmark datasets available. Nevertheless, the good performance of our simple baseline calls for the development of new, more comprehensive benchmark datasets so as to better evaluate and analyze different graph learning methods. Furthermore, given the computational efficiency of our graph summary, we believe that it is a good candidate as a baseline method for future graph classification (or even other graph learning) studies.Comment: 13 pages. Shorter version appears at 2019 ICLR Workshop: Representation Learning on Graphs and Manifolds. arXiv admin note: text overlap with arXiv:1810.00826 by other author

arXiv.org e-Print Archive

Machine-learning of atomic-scale properties based on physical principles

Author: A Glielmo
A Grisafi
A Haar
AAP Bartók
AN Tikhonov
AP Bartók
AP Bartók
AP Bartók
AP Bartók
AP Thompson
B Cheng
B Huang
B Schölkopf
BJ Braams
C Zeni
CE Rasmussen
CM Bishop
D Dragoni
DA Varshalovich
DM Wilkins
DW Brenner
E Prodan
EV Podryabinkin
F Faber
FA Faber
FA Faber
FM Paruzzo
G Galli
G Imbalzano
GR Medders
H Eshet
J Behler
J Tersoff
JA Hartigan
JA Moriarty
JQ Quinonero-Candela
JS Smith
KT Schütt
L Zhang
M Ceriotti
M Ceriotti
M Cuturi
M Rupp
MA Caro
MJ Willatt
MJ Willatt
MW Finnis
MW Mahoney
N Bernstein
P Rowe
R Drautz
S Chmiela
S De
S Fujikake
S Goedecker
S Kajita
S Prabhakaran
ST John
T Morawietz
TF Gonzalez
TT Nguyen
VL Deringer
VL Deringer
W Yang
WJ Szlachta
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/01/2019
Field of study

We briefly summarize the kernel regression approach, as used recently in materials modelling, to fitting functions, particularly potential energy surfaces, and highlight how the linear algebra framework can be used to both predict and train from linear functionals of the potential energy, such as the total energy and atomic forces. We then give a detailed account of the Smooth Overlap of Atomic Positions (SOAP) representation and kernel, showing how it arises from an abstract representation of smooth atomic densities, and how it is related to several popular density-based representations of atomic structure. We also discuss recent generalisations that allow fine control of correlations between different atomic species, prediction and fitting of tensorial properties, and also how to construct structural kernels---applicable to comparing entire molecules or periodic systems---that go beyond an additive combination of local environments

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Modeling of the Acute Toxicity of Benzene Derivatives by Complementary QSAR Methods

Author: Bertinetto Carlo
Duce Celia
Héberger Károly
Solaro Roberto
Publication venue: Institut für Strahlenchemie, Max-Planck-Institut für Kohlenforschung
Publication date: 01/01/2013
Field of study

A data set containing acute toxicity values (96-h LC50) of 69 substituted benzenes for fathead minnow (Pimephales promelas) was investigated with two Quantitative Structure- Activity Relationship (QSAR) models, either using or not using molecular descriptors, respectively. Recursive Neural Networks (RNN) derive a QSAR by direct treatment of the molecular structure, described through an appropriate graphical tool (variable-size labeled rooted ordered trees) by defining suitable representation rules. The input trees are encoded by an adaptive process able to learn, by tuning its free parameters, from a given set of structureactivity training examples. Owing to the use of a flexible encoding approach, the model is target invariant and does not need a priori definition of molecular descriptors. The results obtained in this study were analyzed together with those of a model based on molecular descriptors, i.e. a Multiple Linear Regression (MLR) model using CROatian MultiRegression selection of descriptors (CROMRsel). The comparison revealed interesting similarities that could lead to the development of a combined approach, exploiting the complementary characteristics of the two approaches

Repository of the Academy's Library

Hierarchical Visualization of Materials Space with Graph Convolutional Neural Networks

Author: Grossman Jeffrey C.
Xie Tian
Publication venue: 'AIP Publishing'
Publication date: 18/09/2018
Field of study

The combination of high throughput computation and machine learning has led to a new paradigm in materials design by allowing for the direct screening of vast portions of structural, chemical, and property space. The use of these powerful techniques leads to the generation of enormous amounts of data, which in turn calls for new techniques to efficiently explore and visualize the materials space to help identify underlying patterns. In this work, we develop a unified framework to hierarchically visualize the compositional and structural similarities between materials in an arbitrary material space with representations learned from different layers of graph convolutional neural networks. We demonstrate the potential for such a visualization approach by showing that patterns emerge automatically that reflect similarities at different scales in three representative classes of materials: perovskites, elemental boron, and general inorganic crystals, covering material spaces of different compositions, structures, and both. For perovskites, elemental similarities are learned that reflects multiple aspects of atom properties. For elemental boron, structural motifs emerge automatically showing characteristic boron local environments. For inorganic crystals, the similarity and stability of local coordination environments are shown combining different center and neighbor atoms. The method could help transition to a data-centered exploration of materials space in automated materials design.Comment: 22 + 7 pages, 6 + 5 figure

arXiv.org e-Print Archive

DSpace@MIT

Crossref