Search CORE

1,192 research outputs found

Learning Where To Look -- Generative {NAS} is Surprisingly Efficient

Author: Jung S.
Keuper M.
Lukasik J.
Publication venue
Publication date: 01/01/2022
Field of study

Preserving In-Context Learning ability in Large Language Model Fine-tuning

Author: Dhillon Inderjit S
Hsieh Cho-Jui
Kumar Sanjiv
Li Daliang
Lukasik Michal
Si Si
Wang Yihan
Yu Felix
Publication venue
Publication date: 01/11/2022
Field of study

Pretrained large language models (LLMs) are strong in-context learners that are able to perform few-shot learning without changing model parameters. However, as we show, fine-tuning an LLM on any specific task generally destroys its in-context ability. We discover an important cause of this loss, format specialization, where the model overfits to the format of the fine-tuned task and is unable to output anything beyond this format. We further show that format specialization happens at the beginning of fine-tuning. To solve this problem, we propose Prompt Tuning with MOdel Tuning (ProMoT), a simple yet effective two-stage fine-tuning framework that preserves in-context abilities of the pretrained model. ProMoT first trains a soft prompt for the fine-tuning target task, and then fine-tunes the model itself with this soft prompt attached. ProMoT offloads task-specific formats into the soft prompt that can be removed when doing other in-context tasks. We fine-tune mT5 XXL with ProMoT on natural language inference (NLI) and English-French translation and evaluate the in-context abilities of the resulting models on 8 different NLP tasks. ProMoT achieves similar performance on the fine-tuned tasks compared with vanilla fine-tuning, but with much less reduction of in-context learning performances across the board. More importantly, ProMoT shows remarkable generalization ability on tasks that have different formats, e.g. fine-tuning on a NLI binary classification task improves the model's in-context ability to do summarization (+0.53 Rouge-2 score compared to the pretrained model), making ProMoT a promising method to build general purpose capabilities such as grounding and reasoning into LLMs with small but high quality datasets. When extended to sequential or multi-task training, ProMoT can achieve even better out-of-domain generalization performance

arXiv.org e-Print Archive

On the elliptical flow in asymmetric collisions and nuclear equation of state

Author: A Bohnet
AB Larionov
C Alt
Ch Hartnack
GF Bertsch
H Ch Hartnack
H Sorge
H Stöcker
H Stöcker
J Aichelin
J Lukasik
JY Ollitrault
LG Moretto
M Gyulassy
MB Tsang
R Popescu
S Voloshin
SUNEEL KUMAR
TZ Yan
VARINDERJIT KAUR
Y Zhang
YM Zheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/09/2010
Field of study

We here present the results of elliptical flow for the collision of different asymmetric nuclei (10Ne20 +13 Al27, 18Ar40 +21 Sc45, 30Zn64 +28 Ni58, 36Kr86 +41 Nb93) by using the Quantum Molecular Dynamics (QMD) model. General features of elliptical flow are investigated with the help of theoretical simulations. The simulations are performed at different beam energies between 40 and 105 MeV/nucleon. A significant change can be seen from in-plane to out-of-plane elliptical flow of different fragments with incident energy. A comparison with experimental data is also made. Further, we predict, for the first time that, elliptical flow for different kind of fragments follow power law dependence ? C(Atot)? for asymmetric systems

arXiv.org e-Print Archive

Crossref

The prominent role of the heaviest fragment in multifragmentation and phase transition for hot nuclei

Author: Bonnet E.
Borderie Bernard
Bougault R.
Chbihi A.
Dayras R.
Frankland J. D.
Gagnon-Moisan F.
Galichet E.
Guinet D.
Gulminelli F.
Lautesse P.
Lukasik J.
Mercier D.
Neindre N. Le
Parlog M.
Piantelli S.
Raduta Ad. R.
Rivet M. F.
Rosato E.
Roy R.
Tamain B.
Vigilante M.
Wieleczko J. P.
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 23/08/2009
Field of study

The role played by the heaviest fragment in partitions of multifragmenting hot nuclei is emphasized. Its size/charge distribution (mean value, fluctuations and shape) gives information on properties of fragmenting nuclei and on the associated phase transition.Comment: 11 pages, Proceedings of IWND09, August 23-25, Shanghai (China

arXiv.org e-Print Archive

HAL - Normandie Université

HAL-IN2P3

Archivio della ricerca - Università degli studi di Napoli Federico II

HAL-CEA

Hal-Diderot

Coincidence measurement of residues and light particles in the reaction 56Fe+p at 1 GeV per nucleon with SPALADIN

Author: Aumann T.
Bacri C. O.
Benlliure J.
Bianchin S.
Boudard A.
Brzychczyk J.
Böhmer M.
Casarejos E.
Combet M.
Donadille L.
Ducret J. E.
Fernandez-Ordoñez M.
Fèvre A. Le
Gentil E. Le
Gernhäuser R.
Johansson H.
Kezzar K.
Kurtukian-Nieto T.
Lafriakh A.
Lavaud F.
Leray S.
Lukasik J.
Lynen U.
Lühning J.
Müller W. F.
Pawlowski P.
Pietri S.
Rejmund F.
Schwarz C.
Sfienti C.
Simon H.
Trautmann W.
Volant C.
Yordanov O.
Publication venue: 'American Physical Society (APS)'
Publication date: 15/11/2007
Field of study

The spallation of

^{56}

Fe in collisions with hydrogen at 1 A GeV has been studied in inverse kinematics with the large-aperture setup SPALADIN at GSI. Coincidences of residues with low-center-of-mass kinetic energy light particles and fragments have been measured allowing the decomposition of the total reaction cross-section into the different possible de-excitation channels. Detailed information on the evolution of these de-excitation channels with excitation energy has also been obtained. The comparison of the data with predictions of several de-excitation models coupled to the INCL4 intra-nuclear cascade model shows that only GEMINI can reasonably account for the bulk of collected results, indicating that in a light system with no compression and little angular momentum, multifragmentation might not be necessary to explain the data.Comment: 4 pages, 5 figures, revised version accepted in Phys. Rev. Let

arXiv.org e-Print Archive

Chalmers Publication Library

GSI Repository

Z-dependent Barriers in Multifragmentation from Poissonian Reducibility and Thermal Scaling

Author: A. S. Botvina
B. Borderie
C. P. Montoya
D. R. Bowman
G. J. Wozniak
J. F. Dempsey
J. F. Lecolley
J. Lukasik
J. Toke
J. Toke
K. Tso
L. Beaulieu
L. G. Moretto
L. G. Moretto
L. G. Moretto
L. G. Moretto
L. G. Moretto
L. G. Moretto
L. Phair
L. Phair
L. Phair
M. B. Tsang
Y. Larochelle
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/1998
Field of study

We explore the natural limit of binomial reducibility in nuclear multifragmentation by constructing excitation functions for intermediate mass fragments (IMF) of a given element Z. The resulting multiplicity distributions for each window of transverse energy are Poissonian. Thermal scaling is observed in the linear Arrhenius plots made from the average multiplicity of each element. ``Emission barriers'' are extracted from the slopes of the Arrhenius plots and their possible origin is discussed.Comment: 15 pages including 4 .ps figures. Submitted to Phys. Rev. Letters. Also available at http://csa5.lbl.gov/moretto

arXiv.org e-Print Archive

CiteSeerX

Crossref

eScholarship - University of California

UNT Digital Library

Rapidity distribution as a probe for elliptical flow at intermediate energies

Interplay between the spectator and participant matter in heavy-ion collisions is investigated within isospin dependent quantum molecular dynamics (IQMD) model in term of rapidity distribution of light charged particles. The effect of different types and size rapidity distributions is studied in elliptical flow. The elliptical flow patterns show important role of the nearby spectator matter on the participant zone. This role is further explained on the basis of passing time of the spectator and expansion time of the participant zone. The transition from the in-plane to out-of-plane is observed only when the mid-rapidity region is included in the rapidity bin, otherwise no transition occurs. The transition energy is found to be highly sensitive towards the size of the rapidity bin, while weakly on the type of the rapidity distribution. The theoretical results are also compared with the experimental findings and are found in good agreement.Comment: 8 figure

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Tracing the Evolution of Temperature in Near Fermi Energy Heavy Ion Collisions

Author: A. Botvina
A. Keksis
A. Makeev
A. Martinez-Davalos
A. Menchaca-Rocha
A. Ono
A. Ruangma
D. Fabris
D. V. Shetty
E. Fioretto
E. M. Winchester
E. Martin
F. Hubert
G. Nebbia
G. Prete
G. Souliotis
G. Viesti
J. B. Natowitz
J. Cibor
J. Cibor
J. Lukasik
J. Wang
K. Hagel
L. Qin
M. A. Preston
M. Cinausero
M. Lunardon
M. Murray
M. Veselsky
N. Marie
P. Staszel
R. Alfarro
R. Wada
S. Albergo
S. J. Yennello
S. Kowalski
T. Keutgen
T. Materna
V. Rizzi
W. Zipper
Y. El Masri
Y. G. Ma
Z. Majka
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2004
Field of study

The kinetic energy variation of emitted light clusters has been employed as a clock to explore the time evolution of the temperature for thermalizing composite systems produced in the reactions of 26A, 35A and 47A MeV

^{64}

Zn with

^{58}

Ni,

^{92}

Mo and

^{197}

Au. For each system investigated, the double isotope ratio temperature curve exhibits a high maximum apparent temperature, in the range of 10-25 MeV, at high ejectile velocity. These maximum values increase with increasing projectile energy and decrease with increasing target mass. The time at which the maximum in the temperature curve is reached ranges from 80 to 130 fm/c after contact. For each different target, the subsequent cooling curves for all three projectile energies are quite similar. Temperatures comparable to those of limiting temperature systematics are reached 30 to 40 fm/c after the times corresponding to the maxima, at a time when AMD-V transport model calculations predict entry into the final evaporative or fragmentation stage of de-excitation of the hot composite systems. Evidence for the establishment of thermal and chemical equilibrium is discussed.Comment: 9 pages, 5 figure

arXiv.org e-Print Archive

Crossref

OAKTrust Digital Repository (Texas A&M Univ)

HAL Descartes

CERN Document Server

Archivio istituzionale della ricerca - Università di Padova

Pion radii in nonlocal chiral quark model

Author: Auger G
Begemann-Blaich M L
Bellaize N
Bittiger R
Bocage F
Borderie B
Botvina A S
Bougault R
Bouriquet B
Charvet J L
Chbihi A
Dayras R
Durand D
Frankland J D
Galíchet E
Gourio D
Guinet D
Hudan S
Imme G
Lautesse P
Lavaud F
Lefèvre A
Legrain R
Lukasik J
Lynen U
López O
Müller W F J
Nalpas L
Orth H
Plagnol E
Raciti G
Rosato E
Saija A
Schwarz C
Seidel W
Sfienti C
Tamain B
Trautmann W
Trzcinski A
Turzó K
Vient E
Vigilante M
Volant C
Zwieglinski B
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/11/2003
Field of study

The electromagnetic radius of the charged pion and the transition radius of the neutral pion are calculated in the framework of the nonlocal chiral quark model. It is shown in this model that the contributions of vector mesons to the pion radii are noticeably suppressed in comparison with a similar contribution in the local Nambu--Jona-Lasinio model. The form-factor for the process gamma*pi+pi- is calculated for the -1 GeV^2<q^2<1.6 GeV^2. Our results are in satisfactory agreement with experimental data.Comment: 7 pages, 7 figure

arXiv.org e-Print Archive

HAL - Normandie Université

Crossref

HAL-IN2P3

Archivio della ricerca - Università degli studi di Napoli Federico II

EDP Sciences OAI-PMH repository (1.2.0)

HAL-CEA

CERN Document Server

GSI Repository