33 research outputs found

    Infrared Spectral Analysis for Prediction of Functional Groups Based on Feature-Aggregated Deep Learning

    No full text
    Infrared (IR) spectroscopy is a powerful and versatile tool for analyzing functional groups in organic compounds. A complex and time-consuming interpretation of massive unknown spectra usually requires knowledge of chemistry and spectroscopy. This paper presents a new deep learning method for transforming IR spectral features into intuitive imagelike feature maps and prediction of major functional groups. We obtained 8272 gas-phase IR spectra from the NIST Chemistry WebBook. Feature maps are constructed using the intrinsic correlation of spectral data, and prediction models are developed based on convolutional neural networks. Twenty-one major functional groups for each molecule are successfully identified using binary and multilabel models without expert guidance and feature selection. The multilabel classification model can produce all prediction results simultaneously for rapid characterization. Further analysis of the detailed substructures indicates that our model is capable of obtaining abundant structural information from IR spectra for a comprehensive investigation. The interpretation of our model reveals that the peaks of most interest are similar to those often considered by spectroscopists. In addition to demonstrating great potential for spectral identification, our method may contribute to the development of automated analyses in many fields

    Virtual screening prediction of new potential organocatalysts for direct aldol reactions

    No full text
    A support vector machine (SVM)-based virtual screening method is demonstrated as a rapid computational tool for the prediction of potential asymmetric organocatalysts for the direct aldol reaction. Our models show good accuracy at cross-validation and independent testing. Structure analyses of screening hits from the PubChem database revealed several new classes of compounds, including beta-amino acids, diamines and hydrazides, as potential chiral organocatalysts.</p

    Virtual screening prediction of new potential organocatalysts for direct aldol reactions

    No full text
    A support vector machine (SVM)-based virtual screening method is demonstrated as a rapid computational tool for the prediction of potential asymmetric organocatalysts for the direct aldol reaction. Our models show good accuracy at cross-validation and independent testing. Structure analyses of screening hits from the PubChem database revealed several new classes of compounds, including beta-amino acids, diamines and hydrazides, as potential chiral organocatalysts.</p

    Effect of Selection of Molecular Descriptors on the Prediction of Blood−Brain Barrier Penetrating and Nonpenetrating Agents by Statistical Learning Methods

    No full text
    The ability or inability of a drug to penetrate into the brain is a key consideration in drug design. Drugs for treating central nervous system (CNS) disorders need to be able to penetrate the blood−brain barrier (BBB). BBB nonpenetration is desirable for non-CNS-targeting drugs to minimize potential CNS-related side effects. Computational methods have been employed for the prediction of BBB-penetrating (BBB+) and -nonpenetrating (BBB−) agents at impressive accuracies of 75∼92% and 60∼80%, respectively. However, the majority of these studies give a substantially lower BBB− accuracy, and thus overall accuracy, than the BBB+ accuracy. This work examined whether proper selection of molecular descriptors can improve both the BBB− and the overall accuracies of statistical learning methods. The methods tested include logistic regression, linear discriminate analysis, k nearest neighbor, C4.5 decision tree, probabilistic neural network, and support vector machine. Molecular descriptors were selected by using a feature selection method, recursive feature elimination (RFE). Results by using 415 BBB+ and BBB− agents show that RFE substantially improves both the BBB− and the overall accuracy for all of the methods studied. This suggests that statistical learning methods combined with proper feature selection is potentially useful for facilitating a more balanced and improved prediction of BBB+ and BBB− agents

    The targets and potency-enhancing synergistic molecular modes of the anticancer combination of Tetraarsenic tetrasulfide, Indirubin, and Tanshinone IIA (anticancer synergism reported).

    No full text
    <p>The detailed descriptions of the relevant molecular interaction profiles are in <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0049969#pone.0049969.s007" target="_blank">Table S7</a>.</p

    Synergism level of 124 synergistic NP combinations.

    No full text
    <p>VSS, SS, S, MS, sS: very strong, strong, normal, moderate, slight synergism, NA: nearly additive, SA, MA: slight, moderate antagonism.</p

    Expression profiles of the primary targets and some of the potency-enhancing secondary targets of the selected natural product combinations in specific patient groups.

    No full text
    <p>Expression profiles of the primary targets and some of the potency-enhancing secondary targets of the selected natural product combinations in specific patient groups.</p
    corecore