Search CORE

522 research outputs found

FMViT: A multiple-frequency mixing Vision Transformer

Author: Geng Yifeng
Tan Wei
Xie Xuansong
Publication venue
Publication date: 09/11/2023
Field of study

The transformer model has gained widespread adoption in computer vision tasks in recent times. However, due to the quadratic time and memory complexity of self-attention, which is proportional to the number of input tokens, most existing Vision Transformers (ViTs) encounter challenges in achieving efficient performance in practical industrial deployment scenarios, such as TensorRT and CoreML, where traditional CNNs excel. Although some recent attempts have been made to design CNN-Transformer hybrid architectures to tackle this problem, their overall performance has not met expectations. To tackle these challenges, we propose an efficient hybrid ViT architecture named FMViT. This approach enhances the model's expressive power by blending high-frequency features and low-frequency features with varying frequencies, enabling it to capture both local and global information effectively. Additionally, we introduce deploy-friendly mechanisms such as Convolutional Multigroup Reparameterization (gMLP), Lightweight Multi-head Self-Attention (RLMHSA), and Convolutional Fusion Block (CFB) to further improve the model's performance and reduce computational overhead. Our experiments demonstrate that FMViT surpasses existing CNNs, ViTs, and CNNTransformer hybrid architectures in terms of latency/accuracy trade-offs for various vision tasks. On the TensorRT platform, FMViT outperforms Resnet101 by 2.5% (83.3% vs. 80.8%) in top-1 accuracy on the ImageNet dataset while maintaining similar inference latency. Moreover, FMViT achieves comparable performance with EfficientNet-B5, but with a 43% improvement in inference speed. On CoreML, FMViT outperforms MobileOne by 2.6% in top-1 accuracy on the ImageNet dataset, with inference latency comparable to MobileOne (78.5% vs. 75.9%). Our code can be found at https://github.com/tany0699/FMViT

arXiv.org e-Print Archive

A study of energy correction for the electron beam data in the BGO ECAL of the DAMPE

Author: Huang Guangshun
Li Zhiying
Wang Chi
Wang Xiaolian
Wei Yifeng
Wen Sicheng
Xu Zizong
Zhang Yunlong
Zhang Zhiyong
Publication venue
Publication date: 10/11/2015
Field of study

The DArk Matter Particle Explorer (DAMPE) is an orbital experiment aiming at searching for dark matter indirectly by measuring the spectra of photons, electrons and positrons originating from deep space. The BGO electromagnetic calorimeter is one of the key sub-detectors of the DAMPE, which is designed for high energy measurement with a large dynamic range from 5 GeV to 10 TeV. In this paper, some methods for energy correction are discussed and tried, in order to reconstruct the primary energy of the incident electrons. Different methods are chosen for the appropriate energy ranges. The results of Geant4 simulation and beam test data (at CERN) are presented

arXiv.org e-Print Archive

CERN Document Server

Simple synthetic data reduces sycophancy in large language models

Author: Huang Da
Le Quoc V.
Lu Yifeng
Wei Jerry
Zhou Denny
Publication venue
Publication date: 07/08/2023
Field of study

Sycophancy is an undesirable behavior where models tailor their responses to follow a human user's view even when that view is not objectively correct (e.g., adapting liberal views once a user reveals that they are liberal). In this paper, we study the prevalence of sycophancy in language models and propose a simple synthetic-data intervention to reduce this behavior. First, on a set of three sycophancy tasks (Perez et al., 2022) where models are asked for an opinion on statements with no correct answers (e.g., politics), we observe that both model scaling and instruction tuning significantly increase sycophancy for PaLM models up to 540B parameters. Second, we extend sycophancy evaluations to simple addition statements that are objectively incorrect, finding that despite knowing that these statements are wrong, language models will still agree with them if the user does as well. To reduce sycophancy, we present a straightforward synthetic-data intervention that takes public NLP tasks and encourages models to be robust to user opinions on these tasks. Adding these data in a lightweight finetuning step can significantly reduce sycophantic behavior on held-out prompts. Code for generating synthetic data for intervention can be found at https://github.com/google/sycophancy-intervention

arXiv.org e-Print Archive

A Chemically Competent Thiosulfuranyl Radical on the Escherichia coli Class III Ribonucleotide Reductase

Author: Chen Jiahao
Griffin Robert Guy
Mathies Guinevere
Stubbe JoAnne
Wei Yifeng
Yokoyama Kenichi
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2014
Field of study

The class III ribonucleotide reductases (RNRs) are glycyl radical (G•) enzymes that provide the balanced pool of deoxynucleotides required for DNA synthesis and repair in many facultative and obligate anaerobic bacteria and archaea. Unlike the class I and II RNRs, where reducing equivalents for the reaction are delivered by a redoxin (thioredoxin, glutaredoxin, or NrdH) via a pair of conserved active site cysteines, the class III RNRs examined to date use formate as the reductant. Here, we report that reaction of the Escherichia coli class III RNR with CTP (substrate) and ATP (allosteric effector) in the absence of formate leads to loss of the G• concomitant with stoichiometric formation of a new radical species and a “trapped” cytidine derivative that can break down to cytosine. Addition of formate to the new species results in recovery of 80% of the G• and reduction of the cytidine derivative, proposed to be 3′-keto-deoxycytidine, to dCTP and a small amount of cytosine. The structure of the new radical has been identified by 9.5 and 140 GHz EPR spectroscopy on isotopically labeled varieties of the protein to be a thiosulfuranyl radical [RSSR[subscript 2]]•, composed of a cysteine thiyl radical stabilized by an interaction with a methionine residue. The presence of a stable radical species on the reaction pathway rationalizes the previously reported [[superscript 3]H]-(k[subscript cat]/K[subscript M]) isotope effect of 2.3 with [[superscript 3]H]-formate, requiring formate to exchange between the active site and solution during nucleotide reduction. Analogies with the disulfide anion radical proposed to provide the reducing equivalent to the 3′-keto-deoxycytidine intermediate by the class I and II RNRs provide further evidence for the involvement of thiyl radicals in the reductive half-reaction catalyzed by all RNRs.NWO of the Netherlands (Rubicon Fellowship)Singapore. Agency for Science, Technology and ResearchNational Institutes of Health (U.S.) (Grant GM29595)National Institutes of Health (U.S.) (Grant EB-002804)National Institutes of Health (U.S.) (Grant EB-002026

KOPS - The Institutional Repository of the University of Konstanz

The Francis Crick Institute

Ethyl 2-hydroxy-5-oxo-4-phenyl-2,3,4,5-tetrahydropyrano[3,2-c]chromene-2-carboxylate

Author: Bailin Li
Farrugia
Guangcun Zhang
Sheldrick
Vaijayanthi
Wei Zhang
Yifeng Wang
Publication venue: International Union of Crystallography
Publication date
Field of study

The main structural unit of the title compoud, C21H18O6, is a fused three-ring group consisting of coumarin and tetrahydropyrane ring systems. Two C atoms of the tetrahydropyran ring are displaced by 0.295 (3) and −0.360 (2) Å from the mean plane of coumarin ring. The dihedral angle between the phenyl and coumarin rings is 73.94 (3)°. Intermolecular O—H⋯O hydrogen bonds are present in the crystal structure

Crossref

PubMed Central

A Ferredoxin Disulfide Reductase Delivers Electrons to the to the Methanosarcina barkeri Class III Ribonucleotide Reductase

Author: Elliott Sean J.
Ferry James G.
Li Bin
Prakash Divya
Stubbe JoAnne
Wei Yifeng
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/10/2015
Field of study

Two subtypes of class III anaerobic ribonucleotide reductases (RNRs) studied so far couple the reduction of ribonucleotides to the oxidation of formate, or the oxidation of NADPH via thioredoxin and thioredoxin reductase. Certain methanogenic archaea contain a phylogenetically distinct third subtype of class III RNR, with distinct active-site residues. Here we report the cloning and recombinant expression of the Methanosarcina barkeri class III RNR and show that the electrons required for ribonucleotide reduction can be delivered by a [4Fe-4S] protein ferredoxin disulfide reductase, and a conserved thioredoxin-like protein NrdH present in the RNR operon. The diversity of class III RNRs reflects the diversity of electron carriers used in anaerobic metabolism.Singapore. Agency for Science, Technology and ResearchNational Institutes of Health (U.S.) (Grant GM081393

DSpace@MIT

Crossref

PubMed Central

(4R)-Ethyl 4-(4-chlorophenyl)-2-hydroxy-5-oxo-2,3,4,5-tetrahydropyrano[3,2-c]chromene-2-carboxylate. Corrigendum

Author: Farrugia
Farrugia
Flack
Fylaktakidou
Guangcun Zhang
Hoult
Sheldrick
Wei Zhang
Xiangsheng Xu
Yifeng Wang
Zhang
Publication venue: International Union of Crystallography
Publication date: 01/01/2010
Field of study

Corrigendum to Acta Cryst. (2010), E66, o217

Crossref

Directory of Open Access Journals

PubMed Central