11 research outputs found

    Curated dataset of asphaltene structures

    Get PDF
    Asphaltenes, a distinct class of molecules found in crude oil, exhibit insolubility in nonpolar solvents like n-heptane but are soluble in aromatic solvents such as toluene and benzene. Understanding asphaltenes is crucial in the petroleum industry due to their detrimental effects on oil processing, resulting in significant economic losses and production disruptions. While no singular structure defines asphaltenes, two major molecular architectures, namely archipelago and continental models, have gained wide acceptance for their consistency with various experimental investigations and subsequent use in computational studies. The archipelago model comprises two or more polyaromatic hydrocarbon entities interconnected via aliphatic side chains. In contrast, the island or continental model features a unified polyaromatic hydrocarbon moiety with 4 to 10 fused aromatic rings, averaging around 7 rings. To establish a comprehensive collection, we meticulously curated over 250 asphaltene structures derived from previous experimental and computational studies in this field. Our curation process involved an extensive literature survey, conversion of figures from publications into molecular structure files, careful verification of conversion accuracy, and structure editing to ensure alignment with molecular formulas. Our database provides digital structure files and optimized geometries for both predominant structural motifs. The optimization procedure commenced with the PM6 semi-empirical method, followed by further optimization utilizing density functional theory employing the B3LYP functional and the 6-31+G(d,p) basis set. Furthermore, we compiled a range of structural and electronic features for these molecules, serving as a valuable foundation for employing machine learning algorithms to investigate asphaltenes. This work provides a ready to use structural database of asphaltenes and sets the stage for future research endeavours in this domain

    Machine learning to identify structural motifs in asphaltenes

    Get PDF
    Asphaltenes are organic compounds that aggregate in crude oil with two dominant molecular architectures: archipelago and continental. Continental architectures possess a single uniform island structure composed of aromatic rings in contrast to archipelago architectures with aromatic cores interconnected through aliphatic chains. The structural composition of asphaltenes varies globally due to geographical differences, posing challenges in their classification due to a lack of uniformity. This study is the first known exploration of using image-based supervised machine learning, particularly the ResNet-50 neural network, for the binary classification of asphaltenes into continental and archipelago motifs. 255 continental and archipelago models underwent structural augmentations to create a sample size of 1,530 asphaltene structures that is robust enough for accurate results in both the training and testing portions of the machine learning. These augmentations included the repeated addition of carbons until a complete pentane chain was added to a specified carbon on each asphaltene structure. Using Mathematica, supervised ResNet-50 image-based classification was used on both original and augmented structure datasets to classify as either archipelago or continental. The classification was also implemented using topological similarity searching for association between atoms and the distance between them for further molecule identification. This study demonstrates the surprising effectiveness of image-based classification compared to traditional topological feature-based methods. Our results reveal that deep learning techniques, especially image-based approaches, provide novel and insightful ways to differentiate complex molecular structures like asphaltenes, challenging the traditional reliance on topological features alone. This research opens new avenues in chemical analysis and molecular characterization, highlighting the potential of machine learning in complex molecular systems

    AI is a viable alternative to high throughput screening: a 318-target study

    Get PDF
    : High throughput screening (HTS) is routinely used to identify bioactive small molecules. This requires physical compounds, which limits coverage of accessible chemical space. Computational approaches combined with vast on-demand chemical libraries can access far greater chemical space, provided that the predictive accuracy is sufficient to identify useful molecules. Through the largest and most diverse virtual HTS campaign reported to date, comprising 318 individual projects, we demonstrate that our AtomNet® convolutional neural network successfully finds novel hits across every major therapeutic area and protein class. We address historical limitations of computational screening by demonstrating success for target proteins without known binders, high-quality X-ray crystal structures, or manual cherry-picking of compounds. We show that the molecules selected by the AtomNet® model are novel drug-like scaffolds rather than minor modifications to known bioactive compounds. Our empirical results suggest that computational methods can substantially replace HTS as the first step of small-molecule drug discovery

    Curated dataset of asphaltene structures

    No full text
    Asphaltenes, a distinct class of molecules found in crude oil, exhibit insolubility in nonpolar solvents like n-heptane but are soluble in aromatic solvents such as toluene and benzene. Understanding asphaltenes is crucial in the petroleum industry due to their detrimental effects on oil processing, resulting in significant economic losses and production disruptions. While no singular structure defines asphaltenes, two major molecular architectures, namely archipelago and continental models, have gained wide acceptance for their consistency with various experimental investigations and subsequent use in computational studies.The archipelago model comprises two or more polyaromatic hydrocarbon entities interconnected via aliphatic side chains. In contrast, the island or continental model features a unified polyaromatic hydrocarbon moiety with 4 to 10 fused aromatic rings, averaging around 7 rings. To establish a comprehensive collection, we meticulously curated over 250 asphaltene structures derived from previous experimental and computational studies in this field. Our curation process involved an extensive literature survey, conversion of figures from publications into molecular structure files, careful verification of conversion accuracy, and structure editing to ensure alignment with molecular formulas. Our database provides digital structure files and optimized geometries for both predominant structural motifs. The optimization procedure commenced with the PM6 semi-empirical method, followed by further optimization utilizing density functional theory employing the B3LYP functional and the 6-31+G(d,p) basis set. Furthermore, we compiled a range of structural and electronic features for these molecules, serving as a valuable foundation for employing machine learning algorithms to investigate asphaltenes. This work provides a ready to use structural database of asphaltenes and sets the stage for future research endeavours in this domain

    TRIQS/tprf: Version 3.3.0

    No full text
    <p>TPRF version 3.3.0 is a compatibility release for TRIQS version 3.3.0.</p> <p>We thank all contributors: Thomas Hahn, Alexander Hampel, Henri Menke, Hugo U. R. Strand, Yann in 't Veld, Nils Wentzell</p> <p>A detailed list of changes is provided in the <a href="https://github.com/TRIQS/tprf/blob/3.3.x/doc/ChangeLog.md">ChangeLog</a>.</p&gt

    TRIQS/tprf: Version 3.2.1

    No full text
    TPRF version 3.2.1 is a patch release that fixes an issue with the conda package building pipeline. We thank all contributors: Hugo U. R. Strand, Yann in 't Veld, Nils Wentzell Find below an itemized list of changes in this release. Documentation Correct Ubuntu Version in install instructions Fixed typos in GW indices in documentation Hartree-Fock Stablizie chemical potential finder that broke the conda package pipeline. Discrete Lehmann Representation (DLR) support Fix helper function for Pade analytical contiuation on momentum dependent DLR Green's function

    Dictionnaire des intellectuel.les au Québec

    No full text
    Qui connaît vraiment les intellectuel.les hors du cercle restreint des historiens et des littéraires ? Quelle mémoire avons-nous de celles et ceux qui, au Québec, eurent recours à la parole comme « mode d'action » ? Qui, comme Hubert Aquin, entreprirent et entreprennent encore de « comprendre dangereusement » la culture et la société de leur époque, remuant idées et images, bousculant pouvoirs et doxa ? Ce dictionnaire est conçu pour combler les lacunes d'une mémoire collective quelque peu défaillante, mais aussi pour donner envie de lire ou de relire les textes de ces femmes et hommes passionnés par les idées, qui ont contribué - et qui contribuent toujours - à bâtir la société québécoise. On y trouvera les noms de celles et ceux qui, depuis trois siècles, interviennent sur la place publique et soulèvent des questions d'intérêt civique et politique à propos d'enjeux collectifs importants ; de celles et ceux qui promeuvent ou incarnent la liberté de parole et la défendent contre différents pouvoirs et structures organisationnelles

    Dictionnaire

    No full text
    ACTION FRANÇAISE (1917-1928) / ACTION NATIONALE (1933- ) Lancée une dizaine d’années après l’affaire Dreyfus, L’Action française (AF) de Montréal existe toujours sous le titre L’Action nationale (AN). C’est dire la place de la revue pour l’histoire des intellectuels, d’autant plus qu’au-delà de La Revue canadienne (1864-1922) et des journaux nationalistes du début du siècle – Le Nationaliste (1903) d’Olivar Asselin*, L’Action (1911-1916) de Jules Fournier*, Le Devoir (1910) d’Henri Bourassa* ..

    Dictionnaire

    No full text
    corecore