51 research outputs found

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe

    Measurements of Higgs boson production cross sections and couplings in the diphoton decay channel at root s=13 TeV

    Get PDF
    Measurements of Higgs boson production cross sections and couplings in events where the Higgs boson decays into a pair of photons are reported. Events are selected from a sample of proton-proton collisions at root s = 13TeV collected by the CMS detector at the LHC from 2016 to 2018, corresponding to an integrated luminosity of 137 fb(-1). Analysis categories enriched in Higgs boson events produced via gluon fusion, vector boson fusion, vector boson associated production, and production associated with top quarks are constructed. The total Higgs boson signal strength, relative to the standard model (SM) prediction, is measured to be 1.12 +/- 0.09. Other properties of the Higgs boson are measured, including SM signal strength modifiers, production cross sections, and its couplings to other particles. These include the most precise measurements of gluon fusion and vector boson fusion Higgs boson production in several different kinematic regions, the first measurement of Higgs boson production in association with a top quark pair in five regions of the Higgs boson transverse momentum, and an upper limit on the rate of Higgs boson production in association with a single top quark. All results are found to be in agreement with the SM expectations.Peer reviewe

    Measurement of the top quark mass using events with a single reconstructed top quark in pp collisions at root s=13 TeV

    Get PDF
    Abstract:A measurement of the top quark mass is performed using a data sample en-riched with single top quark events produced in thetchannel. The study is based on proton-proton collision data, corresponding to an integrated luminosity of 35.9 fb−1, recorded at√s= 13TeV by the CMS experiment at the LHC in 2016. Candidate events are selectedby requiring an isolated high-momentum lepton (muon or electron) and exactly two jets,of which one is identified as originating from a bottom quark. Multivariate discriminantsare designed to separate the signal from the background. Optimized thresholds are placedon the discriminant outputs to obtain an event sample with high signal purity. The topquark mass is found to be172.13+0.76−0.77GeV, where the uncertainty includes both the sta-tistical and systematic components, reaching sub-GeV precision for the first time in thisevent topology. The masses of the top quark and antiquark are also determined separatelyusing the lepton charge in the final state, from which the mass ratio and difference aredetermined to be0.9952+0.0079−0.0104and0.83+1.79−1.35GeV, respectively. The results are consistentwithCPTinvariance

    Search for a heavy resonance decaying to a top quark and a w boson at √s = 13 tev in the fully hadronic final state

    Get PDF
    A search for a heavy resonance decaying to a top quark and a W boson in the fully hadronic final state is presented. The analysis is performed using data from proton-proton collisions at a center-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 137 fb−1 recorded by the CMS experiment at the LHC. The search is focused on heavy resonances, where the decay products of each top quark or W boson are expected to be reconstructed as a single, large-radius jet with a distinct substructure. The production of an excited bottom quark, b*, is used as a benchmark when setting limits on the cross section for a heavy resonance decaying to a top quark and a W boson. The hypotheses of b* quarks with left-handed, right-handed, and vector-like chiralities are excluded at 95% confidence level for masses below 2.6, 2.8, and 3.1 TeV, respectively. These are the most stringent limits on the b* quark mass to date, extending the previous best limits by almost a factor of two

    A chromosome scale tomato genome built from complementary PacBio and Nanopore sequences alone reveals extensive linkage drag during breeding

    Get PDF
    The assembly and scaffolding of plant crop genomes facilitate the characterization of genetically diverse cultivated and wild germplasm. The cultivated tomato (Solanum lycopersicum) has been improved through the introgression of genetic material from related wild species, including resistance to pandemic strains of tobacco mosaic virus (TMV) from Solanum peruvianum. Here we applied PacBio HiFi and ONT Nanopore sequencing to develop independent, highly contiguous and complementary assemblies of an inbred TMV-resistant tomato variety. We show specific examples of how HiFi and ONT datasets can complement one another to improve assembly contiguity. We merged the HiFi and ONT assemblies to generate a long-read-only assembly where all 12 chromosomes were represented as 12 contiguous sequences (N50 = 68.5 Mbp). This chromosome scale assembly did not require scaffolding using an orthogonal data type. The merged assembly was validated by chromosome conformation capture data and is highly consistent with previous tomato genome assemblies that made use of genetic maps and Hi-C for scaffolding. Our long-read-only assembly reveals that a complex series of structural variants linked to the TMV resistance gene likely contributed to linkage drag of a 64.1-Mbp region of the S. peruvianum genome during tomato breeding. Through marker studies and ONT-based comprehensive haplotyping we show that this minimal introgression region is present in six cultivated tomato hybrid varieties developed in three commercial breeding programs. Our results suggest that complementary long read technologies can facilitate the rapid generation of near-complete genome sequences

    Precision luminosity measurement in proton-proton collisions at root S=13 TeV in 2015 and 2016 at CMS

    Get PDF
    The measurement of the luminosity recorded by the CMS detector installed at LHC interaction point 5, using proton-proton collisions at root S = 13 TeV in 2015 and 2016, is reported. The absolute luminosity scale is measured for individual bunch crossings using beam-separation scans (the van der Meer method), with a relative precision of 1.3 and 1.0% in 2015 and 2016, respectively. The dominant sources of uncertainty are related to residual differences between the measured beam positions and the ones provided by the operational settings of the LHC magnets, the factorizability of the proton bunch spatial density functions in the coordinates transverse to the beam direction, and the modeling of the effect of electromagnetic interactions among protons in the colliding bunches. When applying the van der Meer calibration to the entire run periods, the integrated luminosities when CMS was fully operational are 2.27 and 36.3 fb(-1) in 2015 and 2016, with a relative precision of 1.6 and 1.2%, respectively. These are among the most precise luminosity measurements at bunched-beam hadron colliders.Peer reviewe

    Measurements of angular distance and momentum ratio distributions in three-jet and Z plus two-jet final states in pp collisions

    Get PDF
    Collinear (small-angle) and large-angle, as well as soft and hard radiations are investigated in three-jet and Z + two-jet events collected in proton-proton collisions at the LHC. The normalized production cross sections are measured as a function of the ratio of transverse momenta of two jets and their angular separation. The measurements in the three-jet and Z + two-jet events are based on data collected at a center-of-mass energy of 8 TeV, corresponding to an integrated luminosity of 19.8 fb(-1). The Z + two-jet events are reconstructed in the dimuon decay channel of the Z boson. The three-jet measurement is extended to include root s = 13 TeV data corresponding to an integrated luminosity of 2.3 fb(-1). The results are compared to predictions from event generators that include parton showers, multiple parton interactions, and hadronization. The collinear and soft regions are in general well described by parton showers, whereas the regions of large angular separation are often best described by calculations using higher-order matrix elements.Peer reviewe

    Measurement of inclusive and differential cross sections for single top quark production in association with a W boson in proton-proton collisions at s \sqrt{s} = 13 TeV

    No full text
    International audienceMeasurements of the inclusive and normalised differential cross sections are presented for the production of single top quarks in association with a W boson in proton-proton collisions at a centre-of-mass energy of 13 TeV. The data used were recorded with the CMS detector at the LHC during 2016–2018, and correspond to an integrated luminosity of 138 fb1^{−1}. Events containing one electron and one muon in the final state are analysed. For the inclusive measurement, a multivariate discriminant, exploiting the kinematic properties of the events is used to separate the signal from the dominant tt \textrm{t}\overline{\textrm{t}} background. A cross section of 79.2±0.9(stat)8.0+7.7(syst)±1.2(lumi) 79.2\pm 0.9{\left(\textrm{stat}\right)}_{-8.0}^{+7.7}\left(\textrm{syst}\right)\pm 1.2\left(\textrm{lumi}\right) pb is obtained, consistent with the predictions of the standard model. For the differential measurements, a fiducial region is defined according to the detector acceptance, and the requirement of exactly one jet coming from the fragmentation of a bottom quark. The resulting distributions are unfolded to particle level and agree with the predictions at next-to-leading order in perturbative quantum chromodynamics.[graphic not available: see fulltext

    Search for pair production of vector-like quarks in leptonic final states in proton-proton collisions at s \sqrt{s} = 13 TeV

    No full text
    A search is presented for vector-like T \mathrm{T} and B \mathrm{B} quark-antiquark pairs produced in proton-proton collisions at a center-of-mass energy of 13 TeV. Data were collected by the CMS experiment at the CERN LHC in 2016-2018, with an integrated luminosity of 138 fb1 ^{-1} . Events are separated into single-lepton, same-sign charge dilepton, and multilepton channels. In the analysis of the single-lepton channel a multilayer neural network and jet identification techniques are employed to select signal events, while the same-sign dilepton and multilepton channels rely on the high-energy signature of the signal to distinguish it from standard model backgrounds. The data are consistent with standard model background predictions, and the production of vector-like quark pairs is excluded at 95% confidence level for T \mathrm{T} quark masses up to 1.54 TeV and B \mathrm{B} quark masses up to 1.56 TeV, depending on the branching fractions assumed, with maximal sensitivity to decay modes that include multiple top quarks. The limits obtained in this search are the strongest limits to date for TT \mathrm{T} \overline{\mathrm{T}} production, excluding masses below 1.48 TeV for all decays to third generation quarks, and are the strongest limits to date for BB \mathrm{B} \overline{\mathrm{B}} production with B \mathrm{B} quark decays to tW.A search is presented for vector-like T and B quark-antiquark pairs produced in proton-proton collisions at a center-of-mass energy of 13 TeV. Data were collected by the CMS experiment at the CERN LHC in 2016–2018, with an integrated luminosity of 138 fb1^{−1}. Events are separated into single-lepton, same-sign charge dilepton, and multi-lepton channels. In the analysis of the single-lepton channel a multilayer neural network and jet identification techniques are employed to select signal events, while the same-sign dilepton and multilepton channels rely on the high-energy signature of the signal to distinguish it from standard model backgrounds. The data are consistent with standard model background predictions, and the production of vector-like quark pairs is excluded at 95% confidence level for T quark masses up to 1.54 TeV and B quark masses up to 1.56 TeV, depending on the branching fractions assumed, with maximal sensitivity to decay modes that include multiple top quarks. The limits obtained in this search are the strongest limits to date for TT \textrm{T}\overline{\textrm{T}} production, excluding masses below 1.48 TeV for all decays to third generation quarks, and are the strongest limits to date for BB \textrm{B}\overline{\textrm{B}} production with B quark decays to tW.[graphic not available: see fulltext]A search is presented for vector-like T and B quark-antiquark pairs produced in proton-proton collisions at a center-of-mass energy of 13 TeV. Data were collected by the CMS experiment at the CERN LHC in 2016-2018, with an integrated luminosity of 138 fb1^{-1}. Events are separated into single-lepton, same-sign charge dilepton, and multilepton channels. In the analysis of the single-lepton channel a multilayer neural network and jet identification techniques are employed to select signal events, while the same-sign dilepton and multilepton channels rely on the high-energy signature of the signal to distinguish it from standard model backgrounds. The data are consistent with standard model background predictions, and the production of vector-like quark pairs is excluded at 95% confidence level for T quark masses up to 1.54 TeV and B quark masses up to 1.56 TeV, depending on the branching fractions assumed, with maximal sensitivity to decay modes that include multiple top quarks. The limits obtained in this search are the strongest limits to date for TT\mathrm{T\overline{T}} production, excluding masses below 1.48 TeV for all decays to third generation quarks, and are the strongest limits to date for BB\mathrm{B\overline{B}} production with B quark decays to tW

    Measurement of the production cross section for a W boson in association with a charm quark in proton-proton collisions at s\sqrt{s} = 13 TeV

    No full text
    International audienceThe strange quark content of the proton is probed through the measurement of the production cross section for a W boson and a charm (c) quark in proton-proton collisions at a center-of-mass energy of 13 TeV. The analysis uses a data sample corresponding to a total integrated luminosity of 138 fb1^{-1} collected with the CMS detector at the LHC. The W bosons are identified through their leptonic decays to an electron or a muon, and a neutrino. Charm jets are tagged using the presence of a muon or a secondary vertex inside the jet. The W+c production cross section and the cross section ratio Rc±R^\pm_\text{c} = σ\sigma(W+^++cˉ\bar{\text{c}})/σ\sigma(W^-+c\text{c}) are measured inclusively and differentially as functions of the transverse momentum and the pseudorapidity of the lepton originating from the W boson decay. The precision of the measurements is improved with respect to previous studies, reaching 1% in Rc±R^\pm_\text{c}. The measurements are compared with theoretical predictions up to next-to-next-to-leading order in perturbative quantum chromodynamics
    corecore