Search CORE

719 research outputs found

Forbidden Facts: An Investigation of Competing Objectives in Llama-2

Author: Hariharan Kaivalya
Shavit Nir
Wang Miles
Wang Tony T.
Publication venue
Publication date: 31/12/2023
Field of study

LLMs often face competing pressures (for example helpfulness vs. harmlessness). To understand how models resolve such conflicts, we study Llama-2-chat models on the forbidden fact task. Specifically, we instruct Llama-2 to truthfully complete a factual recall statement while forbidding it from saying the correct answer. This often makes the model give incorrect answers. We decompose Llama-2 into 1000+ components, and rank each one with respect to how useful it is for forbidding the correct answer. We find that in aggregate, around 35 components are enough to reliably implement the full suppression behavior. However, these components are fairly heterogeneous and many operate using faulty heuristics. We discover that one of these heuristics can be exploited via a manually designed adversarial attack which we call The California Attack. Our results highlight some roadblocks standing in the way of being able to successfully interpret advanced ML systems. Project website available at https://forbiddenfacts.github.io .Comment: Accepted to the ATTRIB and SoLaR workshops at NeurIPS 2023; (v3: clarified experimental details

arXiv.org e-Print Archive

Cliff-Learning

Author: Rosenfeld Jonathan S.
Shavit Nir
Wang Tony T.
Zablotchi Igor
Publication venue
Publication date: 14/02/2023
Field of study

We study the data-scaling of transfer learning from foundation models in the low-downstream-data regime. We observe an intriguing phenomenon which we call cliff-learning. Cliff-learning refers to regions of data-scaling laws where performance improves at a faster than power law rate (i.e. regions of concavity on a log-log scaling plot). We conduct an in-depth investigation of foundation-model cliff-learning and study toy models of the phenomenon. We observe that the degree of cliff-learning reflects the degree of compatibility between the priors of a learning algorithm and the task being learned.Comment: 13 page

arXiv.org e-Print Archive

Eigen electric moments of magnetic-dipolar modes in quasi-2D ferrite disk particles

Author: E. O. Kamenetskii
E.O. Kamenetskii
E.O. Kamenetskii
E.O. Kamenetskii
E.O. Kamenetskii
J. Miltat
J.F. Dillon Jr.
K.Yu. Guslienko
M. Sigalov
R. Shavit
T. Shinjo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/08/2007
Field of study

A property associated with a vortex structure becomes evident from an analysis of confinement phenomena of magnetic oscillations in a quasi-2D ferrite disk with a dominating role of magnetic-dipolar (non-exchange-interaction) spectra. The vortices are guaranteed by the chiral edge states of magnetic-dipolar modes which result in appearance of eigen electric moments oriented normally to the disk plane. Due to the eigen-electric-moment properties, a ferrite disk placed in a microwave cavity is strongly affected by the cavity RF electric field with a clear evidence for multi-resonance oscillations. For different cavity parameters, one may observe the "resonance absorption" and "resonance repulsion" behaviors

arXiv.org e-Print Archive

Crossref

Batalin-Vilkovisky Integrals in Finite Dimensions

Author: G. Zavattaro
G. Zavattaro
J. Vitek
J.A. Bergstra
M. Odersky
M.J. Butler
N. Busi
N. Shavit
R. Amadio
R. Bruni
R. Gorrieri
R. Milner
R. Nicola De
S. Bhiri
T. Chothia
T. Harris
V. Danos
V. Danos
Publication venue
Publication date: 24/10/2006
Field of study

The Batalin-Vilkovisky method (BV) is the most powerful method to analyze functional integrals with (infinite-dimensional) gauge symmetries presently known. It has been invented to fix gauges associated with symmetries that do not close off-shell. Homological Perturbation Theory is introduced and used to develop the integration theory behind BV and to describe the BV quantization of a Lagrangian system with symmetries. Localization (illustrated in terms of Duistermaat-Heckman localization) as well as anomalous symmetries are discussed in the framework of BV.Comment: 35 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

HAL AMU

Type Inference for Deadlock Detection in a Multithreaded Polymorphic Typed Assembly Language

Author: Alastair R. Beresford
Andrew Birrell
Bryan Cantrill
Chandrasekhar Boyapati
Cormac Flanagan
David Cunningham
E. G. Coffman
Francisco Martins
Greg Morrisett
Guillaume Brat
Gérard Boudol
Kohei Suenaga
Leonidas I. Kontothanassis
Nir Shavit
Rahul Agarwal
Rahul Agarwal
Simon Gay
Thomas Ball
Tiago Cogumbreiro
Tiago Cogumbreiro
Vasco T. Vasconcelos
Vasco T. Vasconcelos
Publication venue: 'Open Publishing Association'
Publication date: 12/01/2010
Field of study

We previously developed a polymorphic type system and a type checker for a multithreaded lock-based polymorphic typed assembly language (MIL) that ensures that well-typed programs do not encounter race conditions. This paper extends such work by taking into consideration deadlocks. The extended type system verifies that locks are acquired in the proper order. Towards this end we require a language with annotations that specify the locking order. Rather than asking the programmer (or the compiler's backend) to specifically annotate each newly introduced lock, we present an algorithm to infer the annotations. The result is a type checker whose input language is non-decorated as before, but that further checks that programs are exempt from deadlocks

arXiv.org e-Print Archive

CiteSeerX

Crossref

Directory of Open Access Journals

Higher education and unemployment in Europe : an analysis of the academic subject and national effects

Author: A Woodley
C Crouch
DB Audretsch
G Jones
H Ehlert
H Schomburg
H Siebert
HG Bloemen
Ilias Livanos
Imanol Núñez
International Labour Organization
J Gines
JJ Paul
M Gangl
M Spence
MP Moreau
O Kivinen
OECD
OECD
R Lucas
R Moscati
RH Topel
S Nickell
S Nickell
T Heinze
T Plumper
U Teichler
U Teichler
United Nations
Y Shavit
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2010
Field of study

This paper examines the impact of an academic degree and field of study on short and long-term unemployment across Europe (EU15). Labour Force Survey (LFS) data on over half a million individuals are utilised for that purpose. The harmonized LFS classification of level of education and field of study overcomes past problems of comparability across Europe. The study analyses (i) the effect of an academic degree at a European level, (ii) the specific effect of 14 academic subjects and (iii) country specific effects. The results indicate that an academic degree is more effective on reducing the likelihood of short-term than long-term unemployment. This general pattern even though it is observed for most of the academic subjects its levels show significant variation across disciplines and countries

Crossref

Warwick Research Archives Portal Repository

Laboratory investigation of lateral dispersion within dense arrays of randomly distributed cylinders at transitional Reynolds number

Author: Corrsin S.
Finnigan J. J.
Heidi M. Nepf
Hill R. J.
Kobashi D.
Kundu P. K.
Lienhard J. H.
Lightbody A. F.
Masuoka T.
Schwarzenbach R. P.
Shavit U.
Takatsu Y.
Tanino Y.
Yukie Tanino
Publication venue: 'AIP Publishing'
Publication date: 01/03/2009
Field of study

Published versio

Aberdeen University Research

Crossref

DSpace@MIT

Spiral - Imperial College Digital Repository

An abort-aware model of transactional programming

Author: D. Peled
G. Ramalingam
K. Etessami
N. Shavit
P. Bernstein
R. Alur
R. Alur
R. Alur
T. Ball
V. Kahlon
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Abstract There has been a lot of recent research on transaction-based concurrent programming, aimed at offering an easier concurrent programming paradigm that enables programmers to better exploit the parallelism of modern multi-processor machines, such as multi-core microprocessors. We introduce Transactional State Machines (TSMs) as an abstract finite-data model of transactional shared-memory concurrent programs. TSMs are a variant of concurrent boolean programs (or concurrent extended recursive state machines) augmented with additional constructs for specifying potentially nested transactions. Namely, some procedures (or code segments) can be marked as transactions and are meant to be executed “atomically”, and there are also explicit commit and abort operations for transactions. The TSM model is non-blocking and allows interleaved executions where multiple processes can simultaneously be executing inside transactions. It also allows nested transactions, transactions which may never terminate, and transactions which may be aborted explicitly, or aborted automatically by the run-time environment due to memory conflicts. We show that concurrent executions of TSMs satisfy a correctness criterion closely related to serializability, which we call stutter-serializability, with respect to shared memory. We initiate a study of model checking problems for TSMs. Model checking arbitrary TSMs is easily seen to be undecidable, but we show it is decidable in the following case: when recursion The work of K. Etessami was done partly while visiting Microsof

CiteSeerX

Crossref

Edinburgh Research Explorer

An Abort-Aware Model of Transactional Programming

Author: D. Peled
G. Ramalingam
K. Etessami
N. Shavit
P. Bernstein
R. Alur
R. Alur
R. Alur
T. Ball
V. Kahlon
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Abstract. There has been a lot of recent research on transaction-based concurrent programming, aimed at offering an easier concurrent programming paradigm that enables programmers to better exploit the parallelism of modern multi-processor machines, such as multi-core microprocessors. We introduce Transactional State Machines (TSMs) as an abstract finite-data model of transactional shared-memory concurrent programs. TSMs are a variant of concurrent boolean programs (or concurrent extended recursive state machines) augmented with additional constructs for specifying potentially nested transactions. Namely, some procedures (or code segments) can be marked as transactions and are meant to be executed “atomically”, and there are also explicit commit and abort operations for transactions. The TSM model is non-blocking and allows interleaved executions where multiple processes can simultaneously be executing inside transactions. It also allows nested transactions, transactions which may never terminate, and transactions which may be aborted explicitly, or aborted automatically by the run-time environment due to memory conflicts. We show that concurrent executions of TSMs satisfy a correctness criterion closely related to serializability, which we call stutter-serializability, with respect to shared memory. We initiate a study of model checking problems for TSMs. Model checking arbitrary TSMs is easily seen to be undecidable, but we show it is decidable in the following case: when recursion is exclusively used inside transactions in all (but one) of the processes, we show that model checking such TSMs against all stutterinvariant ω-regular properties of shared memory is decidable.

CiteSeerX

Crossref

Edinburgh Research Explorer

Design of Personal Health Libraries for People Returning from Incarceration in the United States

Author: Brandt Cynthia A
Campbell Britton Meredith
Fooladi Hadi
Foumakoye Marisol
Harikrishnan Vignesh
Levi Amanda
Mccall Terika
Peng Mary
Puglisi Lisa B
Saunders Monya
Shavit Shira
Swaminath Meera
Teng Sarah
Wang Emily A
Wang Karen H
Workman T Elizabeth
Yin Ying
Zeng-Treitler Qing
Zhou Kristal
Publication venue
Publication date: 03/01/2024
Field of study

ScholarSpace at University of Hawai'i at Manoa