Search CORE

7 research outputs found

Retrospective Loss: Looking Back to Improve Training of Deep Neural Networks

Author: Busso Carlos
Fey Matthias
Hadsell Raia
Hasselt Hado V.
Jin Chi
Netzer Yuval
Nguyen Lam M.
Noh Hyeonwoo
Russakovsky Olga
Schuller Björn
Schulman John
Simonyan Karen
Szegedy Christian
Thomas
Yann
Publication venue
Publication date: 01/01/2020
Field of study

Deep neural networks (DNNs) are powerful learning machines that have enabled breakthroughs in several domains. In this work, we introduce a new retrospective loss to improve the training of deep neural network models by utilizing the prior experience available in past model states during training. Minimizing the retrospective loss, along with the task-specific loss, pushes the parameter state at the current training step towards the optimal parameter state while pulling it away from the parameter state at a previous training step. Although a simple idea, we analyze the method as well as to conduct comprehensive sets of experiments across domains - images, speech, text, and graphs - to show that the proposed loss results in improved performance across input domains, tasks, and architectures.Comment: Accepted at KDD 2020; The first two authors contributed equall

arXiv.org e-Print Archive

Crossref

Research Archive of Indian Institute of Technology Hyderabad

DeepSearch: A Simple and Effective Blackbox Attack for Deep Neural Networks

Although deep neural networks have been very successful in image-classification tasks, they are prone to adversarial attacks. To generate adversarial inputs, there has emerged a wide variety of techniques, such as black- and whitebox attacks for neural networks. In this paper, we present DeepSearch, a novel fuzzing-based, query-efficient, blackbox attack for image classifiers. Despite its simplicity, DeepSearch is shown to be more effective in finding adversarial inputs than state-of-the-art blackbox approaches. DeepSearch is additionally able to generate the most subtle adversarial inputs in comparison to these approaches

arXiv.org e-Print Archive

Crossref

MPG.PuRe

Motivational Drivers for Serial Position Effects in High-Stakes Legal Decisions

Author: Chen Daniel L.
Feldman Yuval
Netzer Liat
Plonsky Ori
Steiner Talya
Publication venue
Publication date: 01/07/2023
Field of study

Experts and employees in many domains make multiple similar but independent decisions in sequence. Often, the serial position of the case in the sequence influences the decision. Explanations for these serial position effects focus on the role of decision makers’ fatigue, but these effects emerge also when fatigue is unlikely. Here, we suggest that serial position effects can emerge due to decision makers’ motivation to be or appear to be consistent. For example, to avoid having inconsistencies revealed, decisions may become more favorable towards the side that is more likely to put a decision under scrutiny. As a context, we focus on the legal domain in which many high-stakes decisions are made in sequence and in which there are clear institutional processes of decision scrutiny. We analyze two field datasets: 386,109 US immigration judges’ decisions on asylum requests and 20,796 jury decisions in 18th century London criminal court. We distinguish between five mechanisms that can drive serial position effects and examine their predictions in these settings. We find that consistent with motivation-based explanations of serial position effects, but inconsistent with fatigue-based explanations, decisions become more lenient as a function of serial position, and the effect persists over breaks. We further find, as is predicted by motivational accounts, that the leniency effect is stronger among more experienced decision makers. By elucidating the different drivers of serial position effects, our investigation clarifies why they are common, when they are expected, and how to reduce them

Toulouse Capitole Publications

Cycle-consistent Conditional Adversarial Transfer Networks

Author: Bousmalis Konstantinos
Ding Zhengming
Gong Boqing
Netzer Yuval
Pan Sinno Jialin
Pan Sinno Jialin
Sugiyama Masashi
Tzeng Eric
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2019
Field of study

Domain adaptation investigates the problem of cross-domain knowledge transfer where the labeled source domain and unlabeled target domain have distinctive data distributions. Recently, adversarial training have been successfully applied to domain adaptation and achieved state-of-the-art performance. However, there is still a fatal weakness existing in current adversarial models which is raised from the equilibrium challenge of adversarial training. Specifically, although most of existing methods are able to confuse the domain discriminator, they cannot guarantee that the source domain and target domain are sufficiently similar. In this paper, we propose a novel approach named cycle-consistent conditional adversarial transfer networks (3CATN) to handle this issue. Our approach takes care of the domain alignment by leveraging adversarial training. Specifically, we condition the adversarial networks with the cross-covariance of learned features and classifier predictions to capture the multimodal structures of data distributions. However, since the classifier predictions are not certainty information, a strong condition with the predictions is risky when the predictions are not accurate. We, therefore, further propose that the truly domain-invariant features should be able to be translated from one domain to the other. To this end, we introduce two feature translation losses and one cycle-consistent loss into the conditional adversarial domain adaptation networks. Extensive experiments on both classical and large-scale datasets verify that our model is able to outperform previous state-of-the-arts with significant improvements

arXiv.org e-Print Archive

Crossref

IUPUIScholarWorks

University of Queensland eSpace

Evolving Deep Neural Networks for Efficient Image Classification

Author: Abhinav Agnihotri
Agarap Abien Fred
Alex Krizhevsky
D Mark
D Matthew
Han Xiao
J Ian
Jiquan Ngiam
Kevin Jarrett
Kui Tsung-Han Chan
Pierre Sermanet
Rajitha B
Yann Lecun
Yuval Netzer
Zichao Yang
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

Crossref

A Survey of Unsupervised Deep Domain Adaptation

Author: Arjovsky Martin
Arora Sanjeev
Arora Sanjeev
Atapour-Abarghouei Amir
Athiwaratkun Ben
Bak Slawomir
Benaim Sagie
Berthelot David
Bińkowski Mikołaj
Blanchard Gilles
Blitzer John
Bousmalis Konstantinos
Bungum Lars
Cao Jinming
Cao Zhangjie
Chapelle Olivier
Chen Changhao
Chen Cheng
Chen Minmin
Chen Xi
Chu Chenhui
Chung Junyoung
Courty Nicolas
Csurka Gabriela
Damodaran Bharath Bhushan
Das Debasmit
Denton Emily L.
Donahue Jeff
Duan Lixin
Durugkar Ishan
Dziugaite Gintare Karolina
Fedus William
French Geoff
Fu Lisheng
Gan Zhe
Ganin Yaroslav
Ganin Yaroslav
Ghifary Muhammad
Ghosh Arnab
Goodfellow Ian
Goodfellow Ian
Gretton Arthur
Gulrajani Ishaan
Hal Daumé
Heusel Martin
Hindupur Avinash
Hoang Quan
Hoffman Judy
Hoffman Judy
Hosseini-Asl Ehsan
Hsu Yen-Chang
Huang Ling
Hubert Tsai Yao-Hung
Ioffe Sergey
Isola Phillip
Joshi Mahesh
Kang Guoliang
Karras Tero
Khayatkhoei Mahyar
Kim Taeksoo
Kumar Abhishek
Kundu Jogendra Nath
Kurmi Vinod Kumar
Laine Samuli
LeCun Yann
Lee Kuan-Hui
Li Chun-Liang
Li Yujia
Liu Ming-Yu
Long Mingsheng
Long Mingsheng
Long Mingsheng
Mansour Yishay
Mejjati Youssef Alami
Metz Luke
Miyato Takeru
Miyato Takeru
Miyato Takeru
Moiseev Boris
Morerio Pietro
Muandet Krikamol
Netzer Yuval
Nowozin Sebastian
Odena Augustus
Odena Augustus
Pei Zhongyi
Purushotham Sanjay
Redko Ievgen
Rippel Oren
Rippel Oren
Royer Amélie
Saenko Kate
Saito Kuniaki
Saito Kuniaki
Salimans Tim
Santurkar Shibani
Schneider Steffen
Sebag Alice Schoenauer
Shao Ling
Shen Jian
Shu Rui
Sinclair Stephen
Sohn Kihyuk
Sun Baochen
Sutherland Dougal J.
Taigman Yaniv
Tan Chuanqi
Tarvainen Antti
Taylor Matthew E.
Theis Lucas
Tolstikhin Ilya O.
Vercruyssen Vincent
Vu Tuan-Hung
Wang Chang
Wei Kai-Ya
Wu Yuhuai
Wu Yuxin
Xie Qizhe
Xin Zhao
Yang Yongxin
Yu Fisher
Zhang JiChao
Zhang Yue
Zhao Han
Zhao Junbo
Zhao Mingmin
Zhong Erheng
Zhou Joey Tianyi
Zhu Jun-Yan
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref