5,007 research outputs found

    Insightful classification of crystal structures using deep learning

    Full text link
    Computational methods that automatically extract knowledge from data are critical for enabling data-driven materials science. A reliable identification of lattice symmetry is a crucial first step for materials characterization and analytics. Current methods require a user-specified threshold, and are unable to detect average symmetries for defective structures. Here, we propose a machine-learning-based approach to automatically classify structures by crystal symmetry. First, we represent crystals by calculating a diffraction image, then construct a deep-learning neural-network model for classification. Our approach is able to correctly classify a dataset comprising more than 100 000 simulated crystal structures, including heavily defective ones. The internal operations of the neural network are unraveled through attentive response maps, demonstrating that it uses the same landmarks a materials scientist would use, although never explicitly instructed to do so. Our study paves the way for crystal-structure recognition of - possibly noisy and incomplete - three-dimensional structural data in big-data materials science.Comment: Nature Communications, in press (2018

    Machine learning for ultrafast X-ray diffraction patterns on large-scale GPU clusters

    Full text link
    The classical method of determining the atomic structure of complex molecules by analyzing diffraction patterns is currently undergoing drastic developments. Modern techniques for producing extremely bright and coherent X-ray lasers allow a beam of streaming particles to be intercepted and hit by an ultrashort high energy X-ray beam. Through machine learning methods the data thus collected can be transformed into a three-dimensional volumetric intensity map of the particle itself. The computational complexity associated with this problem is very high such that clusters of data parallel accelerators are required. We have implemented a distributed and highly efficient algorithm for inversion of large collections of diffraction patterns targeting clusters of hundreds of GPUs. With the expected enormous amount of diffraction data to be produced in the foreseeable future, this is the required scale to approach real time processing of data at the beam site. Using both real and synthetic data we look at the scaling properties of the application and discuss the overall computational viability of this exciting and novel imaging technique

    Advanced Imaging Techniques for Point-Measurement Analysis of Pharmaceutical Materials

    Get PDF
    Drugs are an essential element protecting human lives from many diseases such as cancer, diabetes, and cardiovascular disorders. One of the highlights in drug development in recent years is the establishment of rational drug design: a collection of various multi-disciplinary approaches that at the core, focus on designing molecules with specific properties for identified targets and biomolecules with known functional roles and structural information. The candidate molecules will then go through a series of examinations to characterize their physiochemical properties, and an iterative process is used to improve the design of the drug to achieve desirable attributes. The time consuming and highly expensive nature of drug development constantly calls for new analytical techniques that have increasingly higher throughput, faster analysis speed, richer chemical and structural information, and lower risk and cost. Conventional analytical methods for pharmaceutical materials, such as X-ray diffraction analysis and Raman spectroscopy, often suffer from prolonged measurement time. In many cases, the identification of regions of interest within the sample is non-trivial in itself. Nonlinear optical imaging techniques, including second harmonic generation (SHG) microscopy and two-photon excited ultraviolet fluorescence (TPE-UVF) microscopy were developed as fast, real-time, and non-destructive methods for selective identification and characterization of crystalline materials present in pharmaceutical samples. These techniques were integrated with synchrotron X-ray diffraction analysis and Raman spectroscopy to significantly reduce the overall measurement time of these structure characterization techniques. In the meanwhile, with the now increased speed of measurement, the amount of experimental data acquired per unit time has also drastically increased. The rate at which data are analyzed, digested, and interpreted is becoming the bottleneck in data-driving decision-making. Novel electronics that only collect data at the most information-rich time points were employed to significantly increase the signal-to-noise ratio (SNR) during data acquisition, reducing the total amount of data needed for material characterization. Advanced sampling algorithms to reduce the total amount of measurements required for perfect data space reconstruction, automated programs for data acquisition and analysis, and efficient data analysis algorithms based on machine learning were developed for accelerated data processing for nonlinear optical imaging analysis, Raman spectra processing, and X-ray diffraction indexing

    PeakNet: Bragg peak finding in X-ray crystallography experiments with U-Net

    Full text link
    Serial crystallography at X-ray free electron laser (XFEL) sources has experienced tremendous progress in achieving high data rate in recent times. While this development offers potential to enable novel scientific investigations, such as imaging molecular events at logarithmic timescales, it also poses challenges in regards to real-time data analysis, which involves some degree of data reduction to only save those features or images pertaining to the science on disks. If data reduction is not effective, it could directly result in a substantial increase in facility budgetary requirements, or even hinder the utilization of ultra-high repetition imaging techniques making data analysis unwieldy. Furthermore, an additional challenge involves providing real-time feedback to users derived from real-time data analysis. In the context of serial crystallography, the initial and critical step in real-time data analysis is finding X-ray Bragg peaks from diffraction images. To tackle this challenge, we present PeakNet, a Bragg peak finder that utilizes neural networks and runs about four times faster than Psocake peak finder, while delivering significantly better indexing rates and comparable number of indexed events. We formulated the task of peak finding into a semantic segmentation problem, which is implemented as a classical U-Net architecture. A key advantage of PeakNet is its ability to scale linearly with respect to data volume, making it well-suited for real-time serial crystallography data analysis at high data rates

    Multivariate Analysis Applications in X-ray Diffraction

    Get PDF
    : Multivariate analysis (MA) is becoming a fundamental tool for processing in an efficient way the large amount of data collected in X-ray diffraction experiments. Multi-wedge data collections can increase the data quality in case of tiny protein crystals; in situ or operando setups allow investigating changes on powder samples occurring during repeated fast measurements; pump and probe experiments at X-ray free-electron laser (XFEL) sources supply structural characterization of fast photo-excitation processes. In all these cases, MA can facilitate the extraction of relevant information hidden in data, disclosing the possibility of automatic data processing even in absence of a priori structural knowledge. MA methods recently used in the field of X-ray diffraction are here reviewed and described, giving hints about theoretical background and possible applications. The use of MA in the framework of the modulated enhanced diffraction technique is described in detail

    A deep convolutional neural network for real-time full profile analysis of big powder diffraction data

    Get PDF
    We present Parameter Quantification Network (PQ-Net), a regression deep convolutional neural network providing quantitative analysis of powder X-ray diffraction patterns from multi-phase systems. The network is tested against simulated and experimental datasets of increasing complexity with the last one being an X-ray diffraction computed tomography dataset of a multi-phase Ni-Pd/CeO2-ZrO2/Al2O3 catalytic material system consisting of ca. 20,000 diffraction patterns. It is shown that the network predicts accurate scale factor, lattice parameter and crystallite size maps for all phases, which are comparable to those obtained through full profile analysis using the Rietveld method, also providing a reliable uncertainty measure on the results. The main advantage of PQ-Net is its ability to yield these results orders of magnitude faster showing its potential as a tool for real-time diffraction data analysis during in situ/operando experiments
    • …
    corecore