1,009 research outputs found

    Semantically-Shifted Incremental Adapter-Tuning is A Continual ViTransformer

    Full text link
    Class-incremental learning (CIL) aims to enable models to continuously learn new classes while overcoming catastrophic forgetting. The introduction of pre-trained models has brought new tuning paradigms to CIL. In this paper, we revisit different parameter-efficient tuning (PET) methods within the context of continual learning. We observe that adapter tuning demonstrates superiority over prompt-based methods, even without parameter expansion in each learning session. Motivated by this, we propose incrementally tuning the shared adapter without imposing parameter update constraints, enhancing the learning capacity of the backbone. Additionally, we employ feature sampling from stored prototypes to retrain a unified classifier, further improving its performance. We estimate the semantic shift of old prototypes without access to past samples and update stored prototypes session by session. Our proposed method eliminates model expansion and avoids retaining any image samples. It surpasses previous pre-trained model-based CIL methods and demonstrates remarkable continual learning capabilities. Experimental results on five CIL benchmarks validate the effectiveness of our approach, achieving state-of-the-art (SOTA) performance.Comment: To appear at CVPR 202

    Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning

    Full text link
    Open-source pre-trained Large Language Models (LLMs) exhibit strong language understanding and generation capabilities, making them highly successful in a variety of tasks. However, when used as agents for dealing with complex problems in the real world, their performance is far inferior to large commercial models such as ChatGPT and GPT-4. As intelligent agents, LLMs need to have the capabilities of task planning, long-term memory, and the ability to leverage external tools to achieve satisfactory performance. Various methods have been proposed to enhance the agent capabilities of LLMs. On the one hand, methods involve constructing agent-specific data and fine-tuning the models. On the other hand, some methods focus on designing prompts that effectively activate the reasoning abilities of the LLMs. We explore both strategies on the 7B and 13B models. We propose a comprehensive method for constructing agent-specific data using GPT-4. Through supervised fine-tuning with constructed data, we find that for these models with a relatively small number of parameters, supervised fine-tuning can significantly reduce hallucination outputs and formatting errors in agent tasks. Furthermore, techniques such as multi-path reasoning and task decomposition can effectively decrease problem complexity and enhance the performance of LLMs as agents. We evaluate our method on five agent tasks of AgentBench and achieve satisfactory results.Comment: To appear at NAACL 202

    Ancient mitochondrial genomes reveal extensive genetic influence of the steppe pastoralists in Western Xinjiang

    Get PDF
    The population prehistory of Xinjiang has been a hot topic among geneticists, linguists, and archaeologists. Current ancient DNA studies in Xinjiang exclusively suggest an admixture model for the populations in Xinjiang since the early Bronze Age. However, almost all of these studies focused on the northern and eastern parts of Xinjiang; the prehistoric demographic processes that occurred in western Xinjiang have been seldomly reported. By analyzing complete mitochondrial sequences from the Xiabandi (XBD) cemetery (3,500–3,300 BP), the up-to-date earliest cemetery excavated in western Xinjiang, we show that all the XBD mitochondrial sequences fall within two different West Eurasian mitochondrial DNA (mtDNA) pools, indicating that the migrants into western Xinjiang from west Eurasians were a consequence of the early expansion of the middle and late Bronze Age steppe pastoralists (Steppe_MLBA), admixed with the indigenous populations from Central Asia. Our study provides genetic links for an early existence of the Indo-Iranian language in southwestern Xinjiang and suggests that the existence of Andronovo culture in western Xinjiang involved not only the dispersal of ideas but also population movement.Introduction Materials and methods - Archaeological Background, Sampling, and Sequencing - Sequence Mapping and Mitochondrial DNA Haplogroup Determination - Analysis of Xiabandi Mitochondrial DNA Genomes Results - Mitochondrial DNA Authentication and Contamination Assessment - Major Bronze Age Steppe Pastoralist Origin of the Xiabandi Mitochondrial Haplogroups - Expansion of the Bronze Age Steppe Pastoralists as a Dynamic Process to Form the Genetic Landscape of Xiabandi Individuals Discussion Conclusion

    Longer telomere length in peripheral white blood cells is associated with risk of lung cancer and the rs2736100 (CLPTM1L-TERT) polymorphism in a prospective cohort study among women in China.

    Get PDF
    A recent genome-wide association study of lung cancer among never-smoking females in Asia demonstrated that the rs2736100 polymorphism in the TERT-CLPTM1L locus on chromosome 5p15.33 was strongly and significantly associated with risk of adenocarcinoma of the lung. The telomerase gene TERT is a reverse transcriptase that is critical for telomere replication and stabilization by controlling telomere length. We previously found that longer telomere length measured in peripheral white blood cell DNA was associated with increased risk of lung cancer in a prospective cohort study of smoking males in Finland. To follow up on this finding, we carried out a nested case-control study of 215 female lung cancer cases and 215 female controls, 94% of whom were never-smokers, in the prospective Shanghai Women's Health Study cohort. There was a dose-response relationship between tertiles of telomere length and risk of lung cancer (odds ratio (OR), 95% confidence interval [CI]: 1.0, 1.4 [0.8-2.5], and 2.2 [1.2-4.0], respectively; P trend = 0.003). Further, the association was unchanged by the length of time from blood collection to case diagnosis. In addition, the rs2736100 G allele, which we previously have shown to be associated with risk of lung cancer in this cohort, was significantly associated with longer telomere length in these same study subjects (P trend = 0.030). Our findings suggest that individuals with longer telomere length in peripheral white blood cells may have an increased risk of lung cancer, but require replication in additional prospective cohorts and populations

    Search for new particles in events with energetic jets and large missing transverse momentum in proton-proton collisions at root s=13 TeV

    Get PDF
    A search is presented for new particles produced at the LHC in proton-proton collisions at root s = 13 TeV, using events with energetic jets and large missing transverse momentum. The analysis is based on a data sample corresponding to an integrated luminosity of 101 fb(-1), collected in 2017-2018 with the CMS detector. Machine learning techniques are used to define separate categories for events with narrow jets from initial-state radiation and events with large-radius jets consistent with a hadronic decay of a W or Z boson. A statistical combination is made with an earlier search based on a data sample of 36 fb(-1), collected in 2016. No significant excess of events is observed with respect to the standard model background expectation determined from control samples in data. The results are interpreted in terms of limits on the branching fraction of an invisible decay of the Higgs boson, as well as constraints on simplified models of dark matter, on first-generation scalar leptoquarks decaying to quarks and neutrinos, and on models with large extra dimensions. Several of the new limits, specifically for spin-1 dark matter mediators, pseudoscalar mediators, colored mediators, and leptoquarks, are the most restrictive to date.Peer reviewe

    Combined searches for the production of supersymmetric top quark partners in proton-proton collisions at root s=13 TeV

    Get PDF
    A combination of searches for top squark pair production using proton-proton collision data at a center-of-mass energy of 13 TeV at the CERN LHC, corresponding to an integrated luminosity of 137 fb(-1) collected by the CMS experiment, is presented. Signatures with at least 2 jets and large missing transverse momentum are categorized into events with 0, 1, or 2 leptons. New results for regions of parameter space where the kinematical properties of top squark pair production and top quark pair production are very similar are presented. Depending on themodel, the combined result excludes a top squarkmass up to 1325 GeV for amassless neutralino, and a neutralinomass up to 700 GeV for a top squarkmass of 1150 GeV. Top squarks with masses from 145 to 295 GeV, for neutralino masses from 0 to 100 GeV, with a mass difference between the top squark and the neutralino in a window of 30 GeV around the mass of the top quark, are excluded for the first time with CMS data. The results of theses searches are also interpreted in an alternative signal model of dark matter production via a spin-0 mediator in association with a top quark pair. Upper limits are set on the cross section for mediator particle masses of up to 420 GeV

    Probing effective field theory operators in the associated production of top quarks with a Z boson in multilepton final states at root s=13 TeV

    Get PDF
    Peer reviewe

    Measurement of the W gamma Production Cross Section in Proton-Proton Collisions at root s=13 TeV and Constraints on Effective Field Theory Coefficients

    Get PDF
    A fiducial cross section for W gamma production in proton-proton collisions is measured at a center-of-mass energy of 13 TeV in 137 fb(-1) of data collected using the CMS detector at the LHC. The W -> e nu and mu nu decay modes are used in a maximum-likelihood fit to the lepton-photon invariant mass distribution to extract the combined cross section. The measured cross section is compared with theoretical expectations at next-to-leading order in quantum chromodynamics. In addition, 95% confidence level intervals are reported for anomalous triple-gauge couplings within the framework of effective field theory.Peer reviewe

    Search for top squark production in fully hadronic final states in proton-proton collisions at root s=13 TeV

    Get PDF
    A search for production of the supersymmetric partners of the top quark, top squarks, is presented. The search is based on proton-proton collision events containing multiple jets, no leptons, and large transverse momentum imbalance. The data were collected with the CMS detector at the CERN LHC at a center-of-mass energy of 13 TeV, and correspond to an integrated luminosity of 137 fb(-1). The targeted signal production scenarios are direct and gluino-mediated top squark production, including scenarios in which the top squark and neutralino masses are nearly degenerate. The search utilizes novel algorithms based on deep neural networks that identify hadronically decaying top quarks and W bosons, which are expected in many of the targeted signal models. No statistically significant excess of events is observed relative to the expectation from the standard model, and limits on the top squark production cross section are obtained in the context of simplified supersymmetric models for various production and decay modes. Exclusion limits as high as 1310 GeVare established at the 95% confidence level on the mass of the top squark for direct top squark production models, and as high as 2260 GeV on the mass of the gluino for gluino-mediated top squark production models. These results represent a significant improvement over the results of previous searches for supersymmetry by CMS in the same final state.Peer reviewe

    Observation of tW production in the single-lepton channel in pp collisions at root s=13 TeV

    Get PDF
    A measurement of the cross section of the associated production of a single top quark and a W boson in final states with a muon or electron and jets in proton-proton collisions at root s = 13 TeV is presented. The data correspond to an integrated luminosity of 36 fb(-1) collected with the CMS detector at the CERN LHC in 2016. A boosted decision tree is used to separate the tW signal from the dominant t (t) over bar background, whilst the subleading W+jets and multijet backgrounds are constrained using data-based estimates. This result is the first observation of the tW process in final states containing a muon or electron and jets, with a significance exceeding 5 standard deviations. The cross section is determined to be 89 +/- 4 (stat) +/- 12 (syst) pb, consistent with the standard model.Peer reviewe
    • 

    corecore