522 research outputs found
FMViT: A multiple-frequency mixing Vision Transformer
The transformer model has gained widespread adoption in computer vision tasks
in recent times. However, due to the quadratic time and memory complexity of
self-attention, which is proportional to the number of input tokens, most
existing Vision Transformers (ViTs) encounter challenges in achieving efficient
performance in practical industrial deployment scenarios, such as TensorRT and
CoreML, where traditional CNNs excel. Although some recent attempts have been
made to design CNN-Transformer hybrid architectures to tackle this problem,
their overall performance has not met expectations. To tackle these challenges,
we propose an efficient hybrid ViT architecture named FMViT. This approach
enhances the model's expressive power by blending high-frequency features and
low-frequency features with varying frequencies, enabling it to capture both
local and global information effectively. Additionally, we introduce
deploy-friendly mechanisms such as Convolutional Multigroup Reparameterization
(gMLP), Lightweight Multi-head Self-Attention (RLMHSA), and Convolutional
Fusion Block (CFB) to further improve the model's performance and reduce
computational overhead. Our experiments demonstrate that FMViT surpasses
existing CNNs, ViTs, and CNNTransformer hybrid architectures in terms of
latency/accuracy trade-offs for various vision tasks. On the TensorRT platform,
FMViT outperforms Resnet101 by 2.5% (83.3% vs. 80.8%) in top-1 accuracy on the
ImageNet dataset while maintaining similar inference latency. Moreover, FMViT
achieves comparable performance with EfficientNet-B5, but with a 43%
improvement in inference speed. On CoreML, FMViT outperforms MobileOne by 2.6%
in top-1 accuracy on the ImageNet dataset, with inference latency comparable to
MobileOne (78.5% vs. 75.9%). Our code can be found at
https://github.com/tany0699/FMViT
A study of energy correction for the electron beam data in the BGO ECAL of the DAMPE
The DArk Matter Particle Explorer (DAMPE) is an orbital experiment aiming at
searching for dark matter indirectly by measuring the spectra of photons,
electrons and positrons originating from deep space. The BGO electromagnetic
calorimeter is one of the key sub-detectors of the DAMPE, which is designed for
high energy measurement with a large dynamic range from 5 GeV to 10 TeV. In
this paper, some methods for energy correction are discussed and tried, in
order to reconstruct the primary energy of the incident electrons. Different
methods are chosen for the appropriate energy ranges. The results of Geant4
simulation and beam test data (at CERN) are presented
Simple synthetic data reduces sycophancy in large language models
Sycophancy is an undesirable behavior where models tailor their responses to
follow a human user's view even when that view is not objectively correct
(e.g., adapting liberal views once a user reveals that they are liberal). In
this paper, we study the prevalence of sycophancy in language models and
propose a simple synthetic-data intervention to reduce this behavior.
First, on a set of three sycophancy tasks (Perez et al., 2022) where models
are asked for an opinion on statements with no correct answers (e.g.,
politics), we observe that both model scaling and instruction tuning
significantly increase sycophancy for PaLM models up to 540B parameters.
Second, we extend sycophancy evaluations to simple addition statements that are
objectively incorrect, finding that despite knowing that these statements are
wrong, language models will still agree with them if the user does as well.
To reduce sycophancy, we present a straightforward synthetic-data
intervention that takes public NLP tasks and encourages models to be robust to
user opinions on these tasks. Adding these data in a lightweight finetuning
step can significantly reduce sycophantic behavior on held-out prompts. Code
for generating synthetic data for intervention can be found at
https://github.com/google/sycophancy-intervention
A Chemically Competent Thiosulfuranyl Radical on the Escherichia coli Class III Ribonucleotide Reductase
The class III ribonucleotide reductases (RNRs) are glycyl radical (G•) enzymes that provide the balanced pool of deoxynucleotides required for DNA synthesis and repair in many facultative and obligate anaerobic bacteria and archaea. Unlike the class I and II RNRs, where reducing equivalents for the reaction are delivered by a redoxin (thioredoxin, glutaredoxin, or NrdH) via a pair of conserved active site cysteines, the class III RNRs examined to date use formate as the reductant. Here, we report that reaction of the Escherichia coli class III RNR with CTP (substrate) and ATP (allosteric effector) in the absence of formate leads to loss of the G• concomitant with stoichiometric formation of a new radical species and a “trapped” cytidine derivative that can break down to cytosine. Addition of formate to the new species results in recovery of 80% of the G• and reduction of the cytidine derivative, proposed to be 3′-keto-deoxycytidine, to dCTP and a small amount of cytosine. The structure of the new radical has been identified by 9.5 and 140 GHz EPR spectroscopy on isotopically labeled varieties of the protein to be a thiosulfuranyl radical [RSSR[subscript 2]]•, composed of a cysteine thiyl radical stabilized by an interaction with a methionine residue. The presence of a stable radical species on the reaction pathway rationalizes the previously reported [[superscript 3]H]-(k[subscript cat]/K[subscript M]) isotope effect of 2.3 with [[superscript 3]H]-formate, requiring formate to exchange between the active site and solution during nucleotide reduction. Analogies with the disulfide anion radical proposed to provide the reducing equivalent to the 3′-keto-deoxycytidine intermediate by the class I and II RNRs provide further evidence for the involvement of thiyl radicals in the reductive half-reaction catalyzed by all RNRs.NWO of the Netherlands (Rubicon Fellowship)Singapore. Agency for Science, Technology and ResearchNational Institutes of Health (U.S.) (Grant GM29595)National Institutes of Health (U.S.) (Grant EB-002804)National Institutes of Health (U.S.) (Grant EB-002026
Ethyl 2-hydroxy-5-oxo-4-phenyl-2,3,4,5-tetrahydropyrano[3,2-c]chromene-2-carboxylate
The main structural unit of the title compoud, C21H18O6, is a fused three-ring group consisting of coumarin and tetrahydropyrane ring systems. Two C atoms of the tetrahydropyran ring are displaced by 0.295 (3) and −0.360 (2) Å from the mean plane of coumarin ring. The dihedral angle between the phenyl and coumarin rings is 73.94 (3)°. Intermolecular O—H⋯O hydrogen bonds are present in the crystal structure
A Ferredoxin Disulfide Reductase Delivers Electrons to the to the Methanosarcina barkeri Class III Ribonucleotide Reductase
Two subtypes of class III anaerobic ribonucleotide reductases (RNRs) studied so far couple the reduction of ribonucleotides to the oxidation of formate, or the oxidation of NADPH via thioredoxin and thioredoxin reductase. Certain methanogenic archaea contain a phylogenetically distinct third subtype of class III RNR, with distinct active-site residues. Here we report the cloning and recombinant expression of the Methanosarcina barkeri class III RNR and show that the electrons required for ribonucleotide reduction can be delivered by a [4Fe-4S] protein ferredoxin disulfide reductase, and a conserved thioredoxin-like protein NrdH present in the RNR operon. The diversity of class III RNRs reflects the diversity of electron carriers used in anaerobic metabolism.Singapore. Agency for Science, Technology and ResearchNational Institutes of Health (U.S.) (Grant GM081393
(4R)-Ethyl 4-(4-chlorophenyl)-2-hydroxy-5-oxo-2,3,4,5-tetrahydropyrano[3,2-c]chromene-2-carboxylate. Corrigendum
Corrigendum to Acta Cryst. (2010), E66, o217
- …