Search CORE

45 research outputs found

Deep learning of experimental electrochemistry for battery cathodes across diverse compositions

Author: Ceder Gerbrand
Deng Bowen
He Tanjin
Lun Zhengyan
Zhong Peichen
Publication venue
Publication date: 11/04/2023
Field of study

Artificial intelligence (AI) has emerged as a powerful tool in the discovery and optimization of novel battery materials. However, the adoption of AI in battery cathode representation and discovery is still limited due to the complexity of optimizing multiple performance properties and the scarcity of high-fidelity data. In this study, we present a comprehensive machine-learning model (DRXNet) for battery informatics and demonstrate the application in discovery and optimization of disordered rocksalt (DRX) cathode materials. We have compiled the electrochemistry data of DRX cathodes over the past five years, resulting in a dataset of more than 30,000 discharge voltage profiles with 14 different metal species. Learning from this extensive dataset, our DRXNet model can automatically capture critical features in the cycling curves of DRX cathodes under various conditions. Illustratively, the model gives rational predictions of the discharge capacity for diverse compositions in the Li--Mn--O--F chemical space and high-entropy systems. As a universal model trained on diverse chemistries, our approach offers a data-driven solution to facilitate the rapid identification of novel cathode materials, accelerating the development of next-generation batteries for carbon neutralization

arXiv.org e-Print Archive

Machine-learning rationalization and prediction of solid-state synthesis conditions

Author: Bartel Christopher J.
Ceder Gerbrand
Dunn Alexander
He Tanjin
Huo Haoyan
Jain Anubhav
Ouyang Bin
Trewartha Amalie
Publication venue
Publication date: 17/04/2022
Field of study

There currently exist no quantitative methods to determine the appropriate conditions for solid-state synthesis. This not only hinders the experimental realization of novel materials but also complicates the interpretation and understanding of solid-state reaction mechanisms. Here, we demonstrate a machine-learning approach that predicts synthesis conditions using large solid-state synthesis datasets text-mined from scientific journal articles. Using feature importance ranking analysis, we discovered that optimal heating temperatures have strong correlations with the stability of precursor materials quantified using melting points and formation energies (

\Delta G_f

\Delta H_f

). In contrast, features derived from the thermodynamics of synthesis-related reactions did not directly correlate to the chosen heating temperatures. This correlation between optimal solid-state heating temperature and precursor stability extends Tamman's rule from intermetallics to oxide systems, suggesting the importance of reaction kinetics in determining synthesis conditions. Heating times are shown to be strongly correlated with the chosen experimental procedures and instrument setups, which may be indicative of human bias in the dataset. Using these predictive features, we constructed machine-learning models with good performance and general applicability to predict the conditions required to synthesize diverse chemical systems. Codes and data used in this work can be found at: https://github.com/CederGroupHub/s4

arXiv.org e-Print Archive

PubMed Central

eScholarship - University of California

Recommended from our members

Author Correction: Text-mined dataset of inorganic materials synthesis recipes.

Author: Botari Tiago
Ceder Gerbrand
He Tanjin
Huo Haoyan
Kononova Olga
Rong Ziqin
Sun Wenhao
Tshitoyan Vahe
Publication venue: eScholarship, University of California
Publication date: 01/11/2019
Field of study

An amendment to this paper has been published and can be accessed via a link at the top of the paper

eScholarship - University of California

Precursor recommendation for inorganic synthesis by machine learning materials similarity from scientific literature

Author: He Tanjin
Publication venue: Dryad
Publication date: 01/01/2023
Field of study

Synthesis prediction is a key accelerator for the rapid design of advanced materials. However, determining synthesis variables such as the choice of precursor materials is challenging for inorganic materials because the sequence of reactions during heating is not well understood. In this work, we use a knowledge base of 29,900 solid-state synthesis recipes, text-mined from the scientific literature, to automatically learn which precursors to recommend for the synthesis of a novel target material. The data-driven approach learns chemical similarity of materials and refers the synthesis of a new target to precedent synthesis procedures of similar materials, mimicking human synthesis design. When proposing five precursor sets for each of 2,654 unseen test target materials, the recommendation strategy achieves a success rate of at least 82%. Our approach captures decades of heuristic synthesis data in a mathematical form, making it accessible for use in recommendation engines and autonomous laboratories.Funding provided by: U.S. Department of EnergyCrossref Funder Registry ID: http://dx.doi.org/10.13039/100000015Award Number: DE-AC02-05-CH11231, D2S2 program KCD2S2Funding provided by: National Science FoundationCrossref Funder Registry ID: http://dx.doi.org/10.13039/100000001Award Number: DMR-192237

Ezid

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Recommended from our members

Machine Learning Inorganic Solid-state Synthesis from Materials Science Literature

Author: He Tanjin
Publication venue: eScholarship, University of California
Publication date: 01/01/2023
Field of study

Solid-state synthesis prediction is a key accelerator for the rapid design of advanced inorganic materials. However, determining synthesis variables such as the choice of precursor materials is challenging for inorganic materials because the sequence of reactions during heating is not well understood. To achieve predictive synthesis for the desired material, one potential approach is to learn synthesis design patterns from a large volume of experimental synthesis procedures. Nevertheless, a comprehensive, large-scale database of structured synthesis procedures for inorganic materials does not exist. Provided the ability of converting unstructured text to structured information, the decades of solid-state chemistry literature constitutes a treasure trove of synthesis data. Therefore, this study aims at: (1) developing natural language processing (NLP) algorithms to text mine a large-scale inorganic synthesis dataset from materials science literature, and (2) developing machine learning algorithms for precursor selection in solid-state synthesis based on the text-mined dataset.Although many general-purpose NLP methods exist, text mining for inorganic synthesis requires dedicated development of models for information retrieval (Chapter 2). During the development of a text-mining pipeline, one major problem is the difficulty of identifying which materials from a synthesis paragraph are precursors or are target materials. In this study, we developed a two-step Chemical Named Entity Recognition (CNER) model to identify precursors and targets, based on information from the context around material entities. By integrating our information retrieval model for precursors and targets, and also the ones for other synthesis variables, we established a fully automated text-mining pipeline that extracts the structured data of synthesis procedures from the literature. Starting from 4,973,165 materials science papers, we applied our text-mining pipeline and successfully extracted 33,343 solid-state synthesis procedures. The quality of the text-mined synthesis dataset is validated by the high accuracy of 93% at the chemistry level, where each extracted reaction has the target and precursor materials consistent with the original literature report. This dataset for inorganic solid-state synthesis is currently the largest of its kind and paves the way toward the development of data-driven approaches for rational synthesis design.Using the extracted data, we conducted a meta-analysis to study the similarities and differences between precursors in the context of solid-state synthesis (Chapter 3). To quantify precursor similarity, we built a substitution model to calculate the viability of substituting one precursor with another while retaining the target. From a hierarchical clustering of the precursors, we demonstrate that the "chemical similarity" of precursors can be extracted from text data, without the need to include any explicit domain knowledge. Quantifying the similarity of precursors offers a reference for suggesting candidate reactants when researchers alter existing recipes by replacing precursors. The capability of creating alternative recipes constitutes an important step toward developing a predictive synthesis model.While the selection of alternative precursors is enabled by the similarity of precursors, it is limited to existing materials. To learn which precursors to recommend for the synthesis of a novel target material, we further developed a representation learning model to evaluate the similarity of targets (Chapter 4). The data-driven approach learns "chemical similarity" of target materials and refers the synthesis of a new target to precedent synthesis procedures of similar target materials, mimicking human synthesis design. When proposing five precursor sets for each of 2,654 unseen test target materials, our recommendation strategy achieves a success rate of at least 82%. Our approach captures decades of heuristic synthesis data in a mathematical form, making it accessible for use in recommendation engines and autonomous laboratories. Overall, this study contributes a valuable large-scale synthesis dataset and interpretable precursor selection algorithms to the materials science community, representing a step forward in the prediction of solid-state synthesis

eScholarship - University of California

Recommended from our members

On the controlling mechanism of the upper turnover states in the NTC regime

Author: Farooq Aamir
He Tanjin
He Xin
Ji Weiqi
Law Chung K
Zhao Peng
Publication venue
Publication date: 01/02/2016
Field of study

Using n-butane, n-heptane and iso-octane as representative fuels exhibiting NTC (negative temperature coefficient) behavior, comprehensive computational studies with detailed mechanisms and theoretical analysis were performed to investigate the upper stationary point, denoted as turnover states, on the NTC curve near the higher temperature regime, where the ignition delay τ exhibits a local maximum. It is found that the global behavior of the turnover states exhibits distinctive thermodynamic and kinetic characteristics under different pressures, in that the ignition delay at the turnover states shows an Arrhenius dependence on the temperature T and an approximate inverse quadratic power law dependence on the pressure P. These global behaviors imply that the temperature and pressure of the turnover states are not independent and can be correlated by Arrhenius dependence, as ln P ∞ 1/T. Further theoretical analyses demonstrate that such turnover states result from the competition between the low-temperature chain branching reactions and the decomposition of the intermediate species, and therefore correspond to a critical value, α, of the ratio of OH production from low-temperature chemistry. In addition, the ignition delay at the turnover state can be well correlated by the analytical expression derived by Peters et al., with the further demonstration that the pressure dependence of the turnover ignition delay mainly result from the H2O2 decomposition reaction. Comparison of the present results with the literature experimental data of n-heptane ignition delay time shows very good agreement

Princeton University Open Access Repository

Crossref

Theoretical Analysis of the Effect of CC Double Bonds on the Low-Temperature Reactivity of Alkenylperoxy Radicals

Author: Tanjin He (2891264)
Xiaoqing You (1351686)
Yawei Chi (2891261)
Publication venue
Publication date
Field of study

Biodiesel contains a large proportion of unsaturated fatty acid methyl esters. Its combustion characteristics, especially its ignition behavior at low temperatures, have been greatly affected by these CC double bonds. In this work, we performed a theoretical analysis of the effect of CC double bonds on the low-temperature reactivity of alkenylperoxy radicals, the key intermediates from the low-temperature combustion of biodiesel. To understand how double bonds affect the fate of peroxy radicals, we selected three representative peroxy radicals from heptane, heptene, and heptadiene having zero, one, and two double CC bonds, respectively, for study. The potential energy surfaces were explored at the CBS-QB3 level, and the reaction rate constants were computed using canonical/variational transition state theories. We have found that the double bond is responsible for the very different bond dissociation energies of the various types of C–H bonds, which in turn affect significantly the reaction kinetics of alkenylperoxy radicals

FigShare

Synthetic accessibility and stability rules of NASICONs

Author: Bartel Christopher J.
Ceder Gerbrand
He Tanjin
Huo Haoyan
Kim Haegyeom
Lacivita Valentina
Ouyang Bin
Wang Jingyang
Wang Yan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/09/2021
Field of study

In this paper we develop the stability rules for NASICON structured materials, as an example of compounds with complex bond topology and composition. By applying machine learning to the ab-initio computed phase stability of 3881 potential NASICONs we can extract a simple two-dimensional descriptor that is extremely good at separating stable from unstable NASICONS. This machine-learned "tolerance factor" contains information on the Na content, the radii and electronegativities of the elements, and the Madelung energy. We test the predictive capability of this approach by selecting six predicted NASICON compositions. Five out of the six resulted in a phase pure NASICON while the sixth composition led to a NASICON that coexisted with other phases, validating the efficacy of this approach. This work not only provide tools to understand synthetic accessibility of NASICON-type materials, but also demonstrate an efficient paradigm for discovering new materials with complicate composition and atomic structure

arXiv.org e-Print Archive

eScholarship - University of California

Opportunities and challenges of text mining in materials research

Author: Ceder Gerbrand
He Tanjin
Huo Haoyan
Kononova Olga
Olivetti Elsa A
Trewartha Amalie
Publication venue: 'Elsevier BV'
Publication date: 18/05/2022
Field of study

© 2021 The Author(s) Research publications are the major repository of scientific knowledge. However, their unstructured and highly heterogenous format creates a significant obstacle to large-scale analysis of the information contained within. Recent progress in natural language processing (NLP) has provided a variety of tools for high-quality information extraction from unstructured text. These tools are primarily trained on non-technical text and struggle to produce accurate results when applied to scientific text, involving specific technical terminology. During the last years, significant efforts in information retrieval have been made for biomedical and biochemical publications. For materials science, text mining (TM) methodology is still at the dawn of its development. In this review, we survey the recent progress in creating and applying TM and NLP approaches to materials science field. This review is directed at the broad class of researchers aiming to learn the fundamentals of TM as applied to the materials science publications

DSpace@MIT

Recommended from our members

Opportunities and challenges of text mining in aterials research.

Author: Ceder Gerbrand
He Tanjin
Huo Haoyan
Kononova Olga
Olivetti Elsa A
Trewartha Amalie
Publication venue: eScholarship, University of California
Publication date: 01/03/2021
Field of study

Research publications are the major repository of scientific knowledge. However, their unstructured and highly heterogenous format creates a significant obstacle to large-scale analysis of the information contained within. Recent progress in natural language processing (NLP) has provided a variety of tools for high-quality information extraction from unstructured text. These tools are primarily trained on non-technical text and struggle to produce accurate results when applied to scientific text, involving specific technical terminology. During the last years, significant efforts in information retrieval have been made for biomedical and biochemical publications. For materials science, text mining (TM) methodology is still at the dawn of its development. In this review, we survey the recent progress in creating and applying TM and NLP approaches to materials science field. This review is directed at the broad class of researchers aiming to learn the fundamentals of TM as applied to the materials science publications

eScholarship - University of California