17,569 research outputs found
Recommended from our members
Unsupervised word embeddings capture latent knowledge from materials science literature.
The overwhelming majority of scientific knowledge is published as text, which is difficult to analyse by either traditional statistical analysis or modern machine learning methods. By contrast, the main source of machine-interpretable data for the materials research community has come from structured property databases1,2, which encompass only a small fraction of the knowledge present in the research literature. Beyond property values, publications contain valuable knowledge regarding the connections and relationships between data items as interpreted by the authors. To improve the identification and use of this knowledge, several studies have focused on the retrieval of information from scientific literature using supervised natural language processing3-10, which requires large hand-labelled datasets for training. Here we show that materials science knowledge present in the published literature can be efficiently encoded as information-dense word embeddings11-13 (vector representations of words) without human labelling or supervision. Without any explicit insertion of chemical knowledge, these embeddings capture complex materials science concepts such as the underlying structure of the periodic table and structure-property relationships in materials. Furthermore, we demonstrate that an unsupervised method can recommend materials for functional applications several years before their discovery. This suggests that latent knowledge regarding future discoveries is to a large extent embedded in past publications. Our findings highlight the possibility of extracting knowledge and relationships from the massive body of scientific literature in a collective manner, and point towards a generalized approach to the mining of scientific literature
A Map of the Inorganic Ternary Metal Nitrides
Exploratory synthesis in novel chemical spaces is the essence of solid-state
chemistry. However, uncharted chemical spaces can be difficult to navigate,
especially when materials synthesis is challenging. Nitrides represent one such
space, where stringent synthesis constraints have limited the exploration of
this important class of functional materials. Here, we employ a suite of
computational materials discovery and informatics tools to construct a large
stability map of the inorganic ternary metal nitrides. Our map clusters the
ternary nitrides into chemical families with distinct stability and
metastability, and highlights hundreds of promising new ternary nitride spaces
for experimental investigation--from which we experimentally realized 7 new Zn-
and Mg-based ternary nitrides. By extracting the mixed metallicity, ionicity,
and covalency of solid-state bonding from the DFT-computed electron density, we
reveal the complex interplay between chemistry, composition, and electronic
structure in governing large-scale stability trends in ternary nitride
materials
The 2019 materials by design roadmap
Advances in renewable and sustainable energy technologies critically depend on our ability to design and realize materials with optimal properties. Materials discovery and design efforts ideally involve close coupling between materials prediction, synthesis and characterization. The increased use of computational tools, the generation of materials databases, and advances in experimental methods have substantially accelerated these activities. It is therefore an opportune time to consider future prospects for materials by design approaches. The purpose of this Roadmap is to present an overview of the current state of computational materials prediction, synthesis and characterization approaches, materials design needs for various technologies, and future challenges and opportunities that must be addressed. The various perspectives cover topics on computational techniques, validation, materials databases, materials informatics, high-throughput combinatorial methods, advanced characterization approaches, and materials design issues in thermoelectrics, photovoltaics, solid state lighting, catalysts, batteries, metal alloys, complex oxides and transparent conducting materials. It is our hope that this Roadmap will guide researchers and funding agencies in identifying new prospects for materials design
icet - A Python library for constructing and sampling alloy cluster expansions
Alloy cluster expansions (CEs) provide an accurate and computationally
efficient mapping of the potential energy surface of multi-component systems
that enables comprehensive sampling of the many-dimensional configuration
space. Here, we introduce \textsc{icet}, a flexible, extensible, and
computationally efficient software package for the construction and sampling of
CEs. \textsc{icet} is largely written in Python for easy integration in
comprehensive workflows, including first-principles calculations for the
generation of reference data and machine learning libraries for training and
validation. The package enables training using a variety of linear regression
algorithms with and without regularization, Bayesian regression, feature
selection, and cross-validation. It also provides complementary functionality
for structure enumeration and mapping as well as data management and analysis.
Potential applications are illustrated by two examples, including the
computation of the phase diagram of a prototypical metallic alloy and the
analysis of chemical ordering in an inorganic semiconductor.Comment: 10 page
- …