522 research outputs found

    FMViT: A multiple-frequency mixing Vision Transformer

    Full text link
    The transformer model has gained widespread adoption in computer vision tasks in recent times. However, due to the quadratic time and memory complexity of self-attention, which is proportional to the number of input tokens, most existing Vision Transformers (ViTs) encounter challenges in achieving efficient performance in practical industrial deployment scenarios, such as TensorRT and CoreML, where traditional CNNs excel. Although some recent attempts have been made to design CNN-Transformer hybrid architectures to tackle this problem, their overall performance has not met expectations. To tackle these challenges, we propose an efficient hybrid ViT architecture named FMViT. This approach enhances the model's expressive power by blending high-frequency features and low-frequency features with varying frequencies, enabling it to capture both local and global information effectively. Additionally, we introduce deploy-friendly mechanisms such as Convolutional Multigroup Reparameterization (gMLP), Lightweight Multi-head Self-Attention (RLMHSA), and Convolutional Fusion Block (CFB) to further improve the model's performance and reduce computational overhead. Our experiments demonstrate that FMViT surpasses existing CNNs, ViTs, and CNNTransformer hybrid architectures in terms of latency/accuracy trade-offs for various vision tasks. On the TensorRT platform, FMViT outperforms Resnet101 by 2.5% (83.3% vs. 80.8%) in top-1 accuracy on the ImageNet dataset while maintaining similar inference latency. Moreover, FMViT achieves comparable performance with EfficientNet-B5, but with a 43% improvement in inference speed. On CoreML, FMViT outperforms MobileOne by 2.6% in top-1 accuracy on the ImageNet dataset, with inference latency comparable to MobileOne (78.5% vs. 75.9%). Our code can be found at https://github.com/tany0699/FMViT

    A study of energy correction for the electron beam data in the BGO ECAL of the DAMPE

    Full text link
    The DArk Matter Particle Explorer (DAMPE) is an orbital experiment aiming at searching for dark matter indirectly by measuring the spectra of photons, electrons and positrons originating from deep space. The BGO electromagnetic calorimeter is one of the key sub-detectors of the DAMPE, which is designed for high energy measurement with a large dynamic range from 5 GeV to 10 TeV. In this paper, some methods for energy correction are discussed and tried, in order to reconstruct the primary energy of the incident electrons. Different methods are chosen for the appropriate energy ranges. The results of Geant4 simulation and beam test data (at CERN) are presented

    Simple synthetic data reduces sycophancy in large language models

    Full text link
    Sycophancy is an undesirable behavior where models tailor their responses to follow a human user's view even when that view is not objectively correct (e.g., adapting liberal views once a user reveals that they are liberal). In this paper, we study the prevalence of sycophancy in language models and propose a simple synthetic-data intervention to reduce this behavior. First, on a set of three sycophancy tasks (Perez et al., 2022) where models are asked for an opinion on statements with no correct answers (e.g., politics), we observe that both model scaling and instruction tuning significantly increase sycophancy for PaLM models up to 540B parameters. Second, we extend sycophancy evaluations to simple addition statements that are objectively incorrect, finding that despite knowing that these statements are wrong, language models will still agree with them if the user does as well. To reduce sycophancy, we present a straightforward synthetic-data intervention that takes public NLP tasks and encourages models to be robust to user opinions on these tasks. Adding these data in a lightweight finetuning step can significantly reduce sycophantic behavior on held-out prompts. Code for generating synthetic data for intervention can be found at https://github.com/google/sycophancy-intervention

    A Chemically Competent Thiosulfuranyl Radical on the Escherichia coli Class III Ribonucleotide Reductase

    Get PDF
    The class III ribonucleotide reductases (RNRs) are glycyl radical (G•) enzymes that provide the balanced pool of deoxynucleotides required for DNA synthesis and repair in many facultative and obligate anaerobic bacteria and archaea. Unlike the class I and II RNRs, where reducing equivalents for the reaction are delivered by a redoxin (thioredoxin, glutaredoxin, or NrdH) via a pair of conserved active site cysteines, the class III RNRs examined to date use formate as the reductant. Here, we report that reaction of the Escherichia coli class III RNR with CTP (substrate) and ATP (allosteric effector) in the absence of formate leads to loss of the G• concomitant with stoichiometric formation of a new radical species and a “trapped” cytidine derivative that can break down to cytosine. Addition of formate to the new species results in recovery of 80% of the G• and reduction of the cytidine derivative, proposed to be 3′-keto-deoxycytidine, to dCTP and a small amount of cytosine. The structure of the new radical has been identified by 9.5 and 140 GHz EPR spectroscopy on isotopically labeled varieties of the protein to be a thiosulfuranyl radical [RSSR[subscript 2]]•, composed of a cysteine thiyl radical stabilized by an interaction with a methionine residue. The presence of a stable radical species on the reaction pathway rationalizes the previously reported [[superscript 3]H]-(k[subscript cat]/K[subscript M]) isotope effect of 2.3 with [[superscript 3]H]-formate, requiring formate to exchange between the active site and solution during nucleotide reduction. Analogies with the disulfide anion radical proposed to provide the reducing equivalent to the 3′-keto-deoxycytidine intermediate by the class I and II RNRs provide further evidence for the involvement of thiyl radicals in the reductive half-reaction catalyzed by all RNRs.NWO of the Netherlands (Rubicon Fellowship)Singapore. Agency for Science, Technology and ResearchNational Institutes of Health (U.S.) (Grant GM29595)National Institutes of Health (U.S.) (Grant EB-002804)National Institutes of Health (U.S.) (Grant EB-002026

    Ethyl 2-hydr­oxy-5-oxo-4-phenyl-2,3,4,5-tetra­hydro­pyrano[3,2-c]chromene-2-carboxyl­ate

    Get PDF
    The main structural unit of the title compoud, C21H18O6, is a fused three-ring group consisting of coumarin and tetra­hydro­pyrane ring systems. Two C atoms of the tetra­hydro­pyran ring are displaced by 0.295 (3) and −0.360 (2) Å from the mean plane of coumarin ring. The dihedral angle between the phenyl and coumarin rings is 73.94 (3)°. Inter­molecular O—H⋯O hydrogen bonds are present in the crystal structure

    A Ferredoxin Disulfide Reductase Delivers Electrons to the to the Methanosarcina barkeri Class III Ribonucleotide Reductase

    Get PDF
    Two subtypes of class III anaerobic ribonucleotide reductases (RNRs) studied so far couple the reduction of ribonucleotides to the oxidation of formate, or the oxidation of NADPH via thioredoxin and thioredoxin reductase. Certain methanogenic archaea contain a phylogenetically distinct third subtype of class III RNR, with distinct active-site residues. Here we report the cloning and recombinant expression of the Methanosarcina barkeri class III RNR and show that the electrons required for ribonucleotide reduction can be delivered by a [4Fe-4S] protein ferredoxin disulfide reductase, and a conserved thioredoxin-like protein NrdH present in the RNR operon. The diversity of class III RNRs reflects the diversity of electron carriers used in anaerobic metabolism.Singapore. Agency for Science, Technology and ResearchNational Institutes of Health (U.S.) (Grant GM081393
    corecore