Search CORE

1,346 research outputs found

Word Representation Models for Morphologically Rich Languages in Neural Machine Translation

Author: Cohn Trevor
Haffari Gholamreza
He Xuanli
Vylomova Ekaterina
Publication venue
Publication date: 14/06/2016
Field of study

Dealing with the complex word forms in morphologically rich languages is an open problem in language processing, and is particularly important in translation. In contrast to most modern neural systems of translation, which discard the identity for rare words, in this paper we propose several architectures for learning word representations from character and morpheme level word decompositions. We incorporate these representations in a novel machine translation model which jointly learns word alignments and translations via a hard attention mechanism. Evaluating on translating from several morphologically rich languages into English, we show consistent improvements over strong baseline methods, of between 1 and 1.5 BLEU points

arXiv.org e-Print Archive

Crossref

Monash University Research Portal

Fast Ensemble Smoothing

Author: A Gelb
AE Bryson Jr
B Moore John
CH Bishop
D Entekhabi
D Stammer
Dennis McLaughlin
DT Pham
E Lorenz
G Evensen
G Evensen
G Evensen
HE Rauch
JL Anderson
JS Whitaker
L Nerger
Sai Ravela
SE Cohn
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/03/2006
Field of study

Smoothing is essential to many oceanographic, meteorological and hydrological applications. The interval smoothing problem updates all desired states within a time interval using all available observations. The fixed-lag smoothing problem updates only a fixed number of states prior to the observation at current time. The fixed-lag smoothing problem is, in general, thought to be computationally faster than a fixed-interval smoother, and can be an appropriate approximation for long interval-smoothing problems. In this paper, we use an ensemble-based approach to fixed-interval and fixed-lag smoothing, and synthesize two algorithms. The first algorithm produces a linear time solution to the interval smoothing problem with a fixed factor, and the second one produces a fixed-lag solution that is independent of the lag length. Identical-twin experiments conducted with the Lorenz-95 model show that for lag lengths approximately equal to the error doubling time, or for long intervals the proposed methods can provide significant computational savings. These results suggest that ensemble methods yield both fixed-interval and fixed-lag smoothing solutions that cost little additional effort over filtering and model propagation, in the sense that in practical ensemble application the additional increment is a small fraction of either filtering or model propagation costs. We also show that fixed-interval smoothing can perform as fast as fixed-lag smoothing and may be advantageous when memory is not an issue

arXiv.org e-Print Archive

Crossref

Measuring the Behavioural Component of the S&P 500 and its Relationship to Financial Stress and Aggregated Earnings Surprises

Author: Andreou
Ataullah
Baker
Ball
Barberis
Barberis
Benartzi
Bergman
Berkman
Billio
Black
Brown
Caballero
Cohn
Cready
Delgado-García
Delis
DeMiguel
Durbin
Engle
Fama
Fama
Fishburn
Grossman
Hakkio
Hamilton
He
Healy
Hirshleifer
Hribar
Kahneman
Kahneman
Kindleberger
Kliesen
Kothari
Lamont
Latham
Lee
Long
Malmendier
Maule
Merton
Mian
Miller
Mitchell
Morgenstern
Seybert
Sharpe
Shiller
Shleifer
Siegel
Starmer
Tibiletti
Tversky
Vinogradov
Zakamouline
Zakamouline
Zeeman
Zolotoy
Zona
Publication venue: John Wiley & Sons Ltd, 9600 Garsington Road, Oxford OX4 2DQ, UK and 350 Main Street, Malden, MA, 02148, USA
Publication date: 30/07/2018
Field of study

Scholars in management and economics have shown increasing interest in isolating the behavioural dimension of market evolution. Indeed, by improving forecast accuracy and precision, this exercise would certainly help firms to anticipate economic fluctuations, thus leading to more profitable business and investment strategies. Yet, how to extract the behavioural component from real market data remains an open question. By using monthly data on the returns of the constituents of the S&P 500 index, we propose a Bayesian methodology to measure the extent to which market data conform to what is predicted by prospect theory (the behavioural perspective), relative to the (standard) subjective expected utility theory baseline.We document a significant behavioural component that reaches its peaks during recession periods and is correlated to measures of financial volatility, market sentiment and financial stress with expected sign. Moreover, the behavioural component decreases around macroeconomic corporate earnings news, while it reacts positively to the number of surprising announcements

Crossref

ZENODO

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Archivio istituzionale della ricerca - Università di Padova

IMBERT: Making BERT Immune to Insertion-based Backdoor Attacks

Author: Cohn Trevor
He Xuanli
Rubinstein Benjamin
Wang Jun
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/07/2023
Field of study

Backdoor attacks are an insidious security threat against machine learning models. Adversaries can manipulate the predictions of compromised models by inserting triggers into the training phase. Various backdoor attacks have been devised which can achieve nearly perfect attack success without affecting model predictions for clean inputs. Means of mitigating such vulnerabilities are underdeveloped, especially in natural language processing. To fill this gap, we introduce IMBERT, which uses either gradients or self-attention scores derived from victim models to self-defend against backdoor attacks at inference time. Our empirical studies demonstrate that IMBERT can effectively identify up to 98.5% of inserted triggers. Thus, it significantly reduces the attack success rate while attaining competitive accuracy on the clean dataset across widespread insertion-based attacks compared to two baselines. Finally, we show that our approach is model-agnostic, and can be easily ported to several pre-trained transformer models

UCL Discovery

Colossal dielectric constants in transition-metal oxides

Author: A. Loidl
A. Moreo
A. Pimenov
A. Reller
A. Seeger
A. Tselev
A.A. Bokov
A.I. Ritus
A.K. Jonscher
A.P. Kampf
A.P. Ramirez
A.R. Long
B. Kundys
B. Renner
B. Shri Parkash
B. Shri Parkash
B.A. Bender
B.E. Vugmeister
B.I. Shklovskii
C. Aebischer
C.-F. Yang
C.C. Homes
C.C. Wang
C.D. Batista
C.H. Chen
C.H. Du
C.M. Rey
Ch. Kant
D. Reznik
D. Starešinić
D. Viehland
D.C. Sinclair
E. Dagotto
F. Rivadulla
F. Schrettle
G. Chern
G. Chern
G. Deng
G. Deng
G. Grüner
G. Zang
G.A. Samara
G.E. Pike
G.P. Mazzara
H. Sillescu
H.F. Hess
I.P. Raevski
J. Dumas
J. Halblützel
J. Mira
J. Rivas
J. Sebald
J. Sichelschmidt
J. Yang
J.B. Shi
J.J. Liu
J.J. Liu
J.L. Cohn
J.L. García-Muñoz
J.L. Zhang
J.M. Tranquada
J.M. Tranquada
J.M. Tranquada
J.M. Tranquada
K. Ishizaka
K.A. Müller
K.W. Wagner
L. He
L. Zhang
L.E. Cross
M. Capizzi
M. Filippi
M. Fujimoto
M. Li
M. Li
M. Pollak
M. Uehara
M.A. Subramanian
M.A. Subramanian
M.C. Ferrarelli
M.C. Ferrarelli
M.D. Ediger
M.H. Cohen
M.J. Verkerk
N. Biškup
N. Ikeda
O. Bidault
P. Fiorenza
P. Fiorenza
P. Lunkenheimer
P. Lunkenheimer
P. Lunkenheimer
P. Lunkenheimer
P. Lunkenheimer
P. Lunkenheimer
P. Lunkenheimer
P. Lunkenheimer
P. Lunkenheimer
P. Lunkenheimer
P. Monceau
P. Simon
P.B. Littlewood
R. Viana
R.J. Cava
R.K. Grubbs
R.L. Nigro
R.M. Fleming
R.N. Bhatt
S. Krohns
S. Krohns
S. Krohns
S. Krohns
S. Krohns
S. Krohns
S. Mercone
S. Riegg
S. Yamanouchi
S.-Y. Chung
S.-Y. Chung
S.G. Ebbinghaus
S.H. Lee
S.H. Lee
S.R. Elliott
S.V. Kalinin
T. Götzfried
T. Park
T. Portengen
T. Van Dijk
T.B. Adams
T.B. Adams
T.G. Castner
U. Schneider
V. Bobnar
V. Bobnar
V. Brize
V. Sachan
V. Westphal
V.J. Emery
W. Kobayashi
W.L. McMillan
X.L. Zhou
X.Q. Liu
Y. Lin
Y. Yamada
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Many transition-metal oxides show very large ("colossal") magnitudes of the dielectric constant and thus have immense potential for applications in modern microelectronics and for the development of new capacitance-based energy-storage devices. In the present work, we thoroughly discuss the mechanisms that can lead to colossal values of the dielectric constant, especially emphasising effects generated by external and internal interfaces, including electronic phase separation. In addition, we provide a detailed overview and discussion of the dielectric properties of CaCu3Ti4O12 and related systems, which is today's most investigated material with colossal dielectric constant. Also a variety of further transition-metal oxides with large dielectric constants are treated in detail, among them the system La2-xSrxNiO4 where electronic phase separation may play a role in the generation of a colossal dielectric constant.Comment: 31 pages, 18 figures, submitted to Eur. Phys. J. for publication in the Special Topics volume "Cooperative Phenomena in Solids: Metal-Insulator Transitions and Ordering of Microscopic Degrees of Freedom

arXiv.org e-Print Archive

OPUS Augsburg

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Rethinking Round-Trip Translation for Machine Translation Evaluation

Author: Cohn Trevor
He Xuanli
Xu Qiongkai
Zhuo Terry Yue
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/07/2023
Field of study

Automatic evaluation methods for translation often require model training, and thus the availability of parallel corpora limits their applicability to low-resource settings. Round-trip translation is a potential workaround, which can reframe bilingual evaluation into a much simpler monolingual task. Early results from the era of statistical machine translation (SMT) raised fundamental concerns about the utility of this approach, based on poor correlation with human translation quality judgments. In this paper, we revisit this technique with modern neural translation (NMT) and show that round-trip translation does allow for accurate automatic evaluation without the need for reference translations. These opposite findings can be explained through the copy mechanism in SMT that is absent in NMT. We demonstrate that round-trip translation benefits multiple machine translation evaluation tasks: i) predicting forward translation scores; ii) improving the performance of a quality estimation model; and iii) identifying adversarial competitors in shared tasks via cross-system verification

UCL Discovery

3-(4-Nitrophenyl)-N-phenyloxirane-2-carboxamide

Author: Bhatia
Farrugia
Long He
Meth-Cohn
Righi
Sheldrick
Thijs
Publication venue: International Union of Crystallography
Publication date: 01/08/2009
Field of study

The molecule of the title compound, C15H12N2O4, adopts a syn conformation with the terminal benzene rings located on the same sides of the central epoxide ring. The epoxide ring makes dihedral angles of 71.08 (18) and 60.83 (17)° with the two benzene rings. Weak intermolecular C—H⋯O hydrogen bonding is present in the crystal structure

Crossref

Directory of Open Access Journals

PubMed Central

Mitigating Backdoor Poisoning Attacks through the Lens of Spurious Correlation

Author: Cohn Trevor
He Xuanli
Rubinstein Benjamin
Wang Jun
Xu Qiongkai
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2023
Field of study

Modern NLP models are often trained over large untrusted datasets, raising the potential for a malicious adversary to compromise model behaviour. For instance, backdoors can be implanted through crafting training instances with a specific textual trigger and a target label. This paper posits that backdoor poisoning attacks exhibit a spurious correlation between simple text features and classification labels, and accordingly, proposes methods for mitigating spurious correlation as means of defence. Our empirical study reveals that the malicious triggers are highly correlated to their target labels; therefore such correlations are extremely distinguishable compared to those scores of benign features, and can be used to filter out potentially problematic instances. Compared with several existing defences, our defence method significantly reduces attack success rates across backdoor attacks, and in the case of insertion-based attacks, our method provides a near-perfect defence

UCL Discovery

Surgical treatment of tricuspid regurgitation after mitral valve surgery: a retrospective study in China

Author: A Matsunaga
AA Mangoni
BC Chang
C Izumi
CM Duran
G Bianchi
GD Dreyfus
Guo-Wei He
JH Rogers
JM Bernal
LH Cohn
MJ Antunes
PM McCarthy
RK Ghanta
SG Raja
SK Singh
T Goto
Tie-Nan Chen
V Chan
Wan-Li Lu
Wen-Bin Jing
Xiang-Rong Kong
Xiao-Cheng Liu
YJ Kim
Zhi-Peng Guo
Zong-Xiao Li
Publication venue: BMC
Publication date: 01/01/2012
Field of study

Abstract Background Functional tricuspid regurgitation (TR) occurs in patients with rheumatic mitral valve disease even after mitral valve surgery. The aim of this study was to analyze surgical results of TR after previous successful mitral valve surgery. Methods From September 1996 to September 2008, 45 patients with TR after previous mitral valve replacement underwent second operation for TR. In those, 43 patients (95.6%) had right heart failure symptoms (edema of lower extremities, ascites, hepatic congestion, etc.) and 40 patients (88.9%) had atrial fibrillation. Twenty-six patients (57.8%) were in New York Heart Association (NYHA) functional class III, and 19 (42.2%) in class IV. Previous operations included: 41 for mechanical mitral valve replacement (91.1%), 4 for bioprosthetic mitral valve replacement (8.9%), and 7 for tricuspid annuloplasty (15.6%). Results The tricuspid valves were repaired with Kay's (7 cases, 15.6%) or De Vega technique (4 cases, 8.9%). Tricuspid valve replacement was performed in 34 cases (75.6%). One patient (2.2%) died. Postoperative low cardiac output (LCO) occurred in 5 patients and treated successfully. Postoperative echocardiography showed obvious reduction of right atrium and ventricle. The anterioposterior diameter of the right ventricle decreased to 25.5 ± 7.1 mm from 33.7 ± 6.2 mm preoperatively (P < 0. 05). Conclusion TR after mitral valve replacement in rheumatic heart disease is a serious clinical problem. If it occurs or progresses late after mitral valve surgery, tricuspid valve annuloplasty or replacement may be performed with satisfactory results. Due to the serious consequence of untreated TR, aggressive treatment of existing TR during mitral valve surgery is recommended.</p

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

SEEP: Training Dynamics Grounds Latent Representation Search for Mitigating Backdoor Poisoning Attacks

Author: Cohn Trevor
He Xuanli
Rubinstein Benjamin I. P.
Wang Jun
Xu Qiongkai
Publication venue
Publication date: 19/05/2024
Field of study

Modern NLP models are often trained on public datasets drawn from diverse sources, rendering them vulnerable to data poisoning attacks. These attacks can manipulate the model's behavior in ways engineered by the attacker. One such tactic involves the implantation of backdoors, achieved by poisoning specific training instances with a textual trigger and a target class label. Several strategies have been proposed to mitigate the risks associated with backdoor attacks by identifying and removing suspected poisoned examples. However, we observe that these strategies fail to offer effective protection against several advanced backdoor attacks. To remedy this deficiency, we propose a novel defensive mechanism that first exploits training dynamics to identify poisoned samples with high precision, followed by a label propagation step to improve recall and thus remove the majority of poisoned instances. Compared with recent advanced defense methods, our method considerably reduces the success rates of several backdoor attacks while maintaining high classification accuracy on clean test sets.Comment: accepted to TAC

arXiv.org e-Print Archive