107 research outputs found

    Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets

    Full text link
    Recently, pre-trained foundation models have enabled significant advancements in multiple fields. In molecular machine learning, however, where datasets are often hand-curated, and hence typically small, the lack of datasets with labeled features, and codebases to manage those datasets, has hindered the development of foundation models. In this work, we present seven novel datasets categorized by size into three distinct categories: ToyMix, LargeMix and UltraLarge. These datasets push the boundaries in both the scale and the diversity of supervised labels for molecular learning. They cover nearly 100 million molecules and over 3000 sparsely defined tasks, totaling more than 13 billion individual labels of both quantum and biological nature. In comparison, our datasets contain 300 times more data points than the widely used OGB-LSC PCQM4Mv2 dataset, and 13 times more than the quantum-only QM1B dataset. In addition, to support the development of foundational models based on our proposed datasets, we present the Graphium graph machine learning library which simplifies the process of building and training molecular machine learning models for multi-task and multi-level molecular datasets. Finally, we present a range of baseline results as a starting point of multi-task and multi-level training on these datasets. Empirically, we observe that performance on low-resource biological datasets show improvement by also training on large amounts of quantum data. This indicates that there may be potential in multi-task and multi-level training of a foundation model and fine-tuning it to resource-constrained downstream tasks

    Measurement of the W±Z boson pair-production cross section in pp collisions at √s=13TeV with the ATLAS detector

    Get PDF
    published_or_final_versio

    Measurement of the inelastic proton-proton cross section at √s=13 TeV with the ATLAS detector at the LHC

    Get PDF
    This Letter presents a measurement of the inelastic proton-proton cross section using 60  μb −1 of pp collisions at a center-of-mass energy √s of 13 TeV with the ATLAS detector at the LHC. Inelastic interactions are selected using rings of plastic scintillators in the forward region (2.0710 −6 , where M X is the larger invariant mass of the two hadronic systems separated by the largest rapidity gap in the event. In this ξ range the scintillators are highly efficient. For diffractive events this corresponds to cases where at least one proton dissociates to a system with M X >13  GeV . The measured cross section is compared with a range of theoretical predictions. When extrapolated to the full phase space, a cross section of 78.1±2.9  mb is measured, consistent with the inelastic cross section increasing with center-of-mass energy

    Measurements of integrated and differential cross sections for isolated photon pair production in pp collisions at √s=8 TeV with the ATLAS detector

    Get PDF
    A measurement of the production cross section for two isolated photons in proton-proton collisions at a center-of-mass energy of √s=8 TeV is presented. The results are based on an integrated luminosity of 20.2 fb−1 recorded by the ATLAS detector at the Large Hadron Collider. The measurement considers photons with pseudorapidities satisfying |ηγ|40GeV and EγT,2>30 GeV for the two leading photons ordered in transverse energy produced in the interaction. The background due to hadronic jets and electrons is subtracted using data-driven techniques. The fiducial cross sections are corrected for detector effects and measured differentially as a function of six kinematic observables. The measured cross section integrated within the fiducial volume is 16.8 ± 0.8  pb . The data are compared to fixed-order QCD calculations at next-to-leading-order and next-to-next-to-leading-order accuracy as well as next-to-leading-order computations including resummation of initial-state gluon radiation at next-to-next-to-leading logarithm or matched to a parton shower, with relative uncertainties varying from 5% to 20%

    Search for dark matter at √s=13 TeV in final states containing an energetic photon and large missing transverse momentum with the ATLAS detector

    Get PDF
    Results of a search for physics beyond the Standard Model in events containing an energetic photon and large missing transverse momentum with the ATLAS detector at the Large Hadron Collider are reported. As the number of events observed in data, corresponding to an integrated luminosity of 36.1 fb−1 of proton–proton collisions at a centre-of-mass energy of 13 TeV, is in agreement with the Standard Model expectations, model-independent limits are set on the fiducial cross section for the production of events in this final state. Exclusion limits are also placed in models where dark-matter candidates are pair-produced. For dark-matter production via an axial-vector or a vector mediator in the s-channel, this search excludes mediator masses below 750–1200 GeV for dark-matter candidate masses below 230–480 GeV at 95% confidence level, depending on the couplings. In an effective theory of dark-matter production, the limits restrict the value of the suppression scale M∗ to be above 790 GeV at 95% confidence level. A limit is also reported on the production of a high-mass scalar resonance by processes beyond the Standard Model, in which the resonance decays to Zγ and the Z boson subsequently decays into neutrinos

    Measurement of the photon identification efficiencies with the ATLAS detector using LHC Run-1 data

    Get PDF
    © 2016, CERN for the benefit of the ATLAS collaboration.The algorithms used by the ATLAS Collaboration to reconstruct and identify prompt photons are described. Measurements of the photon identification efficiencies are reported, using 4.9 fb- 1 of pp collision data collected at the LHC at s=7 TeV and 20.3 fb- 1 at s=8 TeV. The efficiencies are measured separately for converted and unconverted photons, in four different pseudorapidity regions, for transverse momenta between 10 GeV and 1.5 TeV. The results from the combination of three data-driven techniques are compared to the predictions from a simulation of the detector response, after correcting the electromagnetic shower momenta in the simulation for the average differences observed with respect to data. Data-to-simulation efficiency ratios used as correction factors in physics measurements are determined to account for the small residual efficiency differences. These factors are measured with uncertainties between 0.5% and 10% in 7 TeV data and between 0.5% and 5.6% in 8 TeV data, depending on the photon transverse momentum and pseudorapidity

    Measurement of single top-quark production in association with a W boson in the single-lepton channel at \sqrt{s} = 8\,\text {TeV} with the ATLAS detector

    Get PDF
    The production cross-section of a top quark in association with a W boson is measured using proton–proton collisions at \sqrt{s} = 8\,\text {TeV}. The dataset corresponds to an integrated luminosity of 20.2\,\text {fb}^{-1}, and was collected in 2012 by the ATLAS detector at the Large Hadron Collider at CERN. The analysis is performed in the single-lepton channel. Events are selected by requiring one isolated lepton (electron or muon) and at least three jets. A neural network is trained to separate the tW signal from the dominant t{\bar{t}} background. The cross-section is extracted from a binned profile maximum-likelihood fit to a two-dimensional discriminant built from the neural-network output and the invariant mass of the hadronically decaying W boson. The measured cross-section is \sigma _{tW} = 26 \pm 7\,\text {pb}, in good agreement with the Standard Model expectation

    Measurements of Higgs bosons decaying to bottom quarks from vector boson fusion production with the ATLAS experiment at √=13TeV

    Get PDF
    The paper presents a measurement of the Standard Model Higgs Boson decaying to b-quark pairs in the vector boson fusion (VBF) production mode. A sample corresponding to 126 fb−1 of s√=13TeV proton–proton collision data, collected with the ATLAS experiment at the Large Hadron Collider, is analyzed utilizing an adversarial neural network for event classification. The signal strength, defined as the ratio of the measured signal yield to that predicted by the Standard Model for VBF Higgs production, is measured to be 0.95+0.38−0.36 , corresponding to an observed (expected) significance of 2.6 (2.8) standard deviations from the background only hypothesis. The results are additionally combined with an analysis of Higgs bosons decaying to b-quarks, produced via VBF in association with a photon
    corecore