Search CORE

8 research outputs found

Causal Interpretation of Self-Attention in Pre-Trained Transformers

Author: Gurwicz Yaniv
Nisimov Shami
Rohekar Raanan Y.
Publication venue
Publication date: 31/10/2023
Field of study

We propose a causal interpretation of self-attention in the Transformer neural network architecture. We interpret self-attention as a mechanism that estimates a structural equation model for a given input sequence of symbols (tokens). The structural equation model can be interpreted, in turn, as a causal structure over the input symbols under the specific context of the input sequence. Importantly, this interpretation remains valid in the presence of latent confounders. Following this interpretation, we estimate conditional independence relations between input symbols by calculating partial correlations between their corresponding representations in the deepest attention layer. This enables learning the causal structure over an input sequence using existing constraint-based algorithms. In this sense, existing pre-trained Transformers can be utilized for zero-shot causal-discovery. We demonstrate this method by providing causal explanations for the outcomes of Transformers in two tasks: sentiment classification (NLP) and recommendation.Comment: 37th Conference on Neural Information Processing Systems (NeurIPS 2023). arXiv admin note: text overlap with arXiv:2210.1062

arXiv.org e-Print Archive

From Temporal to Contemporaneous Iterative Causal Discovery in the Presence of Latent Confounders

Author: Gurwicz Yaniv
Nisimov Shami
Novik Gal
Rohekar Raanan Y.
Publication venue
Publication date: 01/06/2023
Field of study

We present a constraint-based algorithm for learning causal structures from observational time-series data, in the presence of latent confounders. We assume a discrete-time, stationary structural vector autoregressive process, with both temporal and contemporaneous causal relations. One may ask if temporal and contemporaneous relations should be treated differently. The presented algorithm gradually refines a causal graph by learning long-term temporal relations before short-term ones, where contemporaneous relations are learned last. This ordering of causal relations to be learnt leads to a reduction in the required number of statistical tests. We validate this reduction empirically and demonstrate that it leads to higher accuracy for synthetic data and more plausible causal graphs for real-world data compared to state-of-the-art algorithms.Comment: Proceedings of the 40-th International Conference on Machine Learning (ICML), 202

arXiv.org e-Print Archive

Application of foreground object patterns analysis for event detection in an innovative video surveillance system

Author: A Yilmaz
D Comaniciu
D Frejlichowski
Dariusz Frejlichowski
Katarzyna Gościewska
N Dalal
P Forczmański
P Viola
Paweł Forczmański
Radosław Hofman
Y Cheng
Y Gurwicz
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Integration of complex wavelet transform and Zernike moment for multi‐class classification

Author: A Khare
A Saju
CW Chong
CW Hsu
D Clonda
G Jemilda
GA Papakostas
HVR Mohana
I Daubechies
KR Castleman
L Wang
M Khare
M Sokolova
M Teague
M Zhenjiang
NG Pedrajas
R Rifkin
SK Hwang
T Serre
W Hu
Y Bin
Y Gurwicz
Y Lee
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Biometric recognition in surveillance scenarios: a survey

Author: A Bobick
A Dantcheva
A Jain
A Kasinski
A Milan
A Mittal
A Sodemann
A Talukder
AM Martinez
B Babenko
B He
B Wu
BK Horn
C Chen
C Cédras
C Stauffer
C Wren
CH Chan
CH Chen
CT Chu
CW Tan
D Comaniciu
D Gavrila
D Gavrila
D Huttenlocher
D Reid
D Reid
D Ross
D Weinland
D Weinland
DE Butler
E Saber
F Fleuret
F Jean
Fabio Narducci
GB Huang
H Ailisto
H Liu
H Proença
H Proença
H Stern
H Zhou
HL Eng
Hugo Proença
I Haritaoglu
I Kim
I Tsochantaridis
J Aggarwal
J Aggarwal
J Aggarwal
J Berclaz
J Daugman
J Daugman
J Gu
J Han
J Klontz
J Matey
J Wright
J Yao
J Zhang
João Neves
JR Lyle
K Bashir
K Kim
L Maddalena
L Maddalena
L Wang
L Wang
M Breitenstein
M Goffredo
M Grgic
M McCahill
MA Hossain
MP Murray
MW Szeto
N Haering
N McFarlane
N Oliver
O Barnich
O Popoola
P Belhumeur
P Dollar
P KaewTrakulPong
P Tome
P Turaga
P Viola
Q Zhao
Q Zhou
R Bolle
R Gross
R Poppe
R Poppe
R Sanchez-Reillo
R Vezzani
R Wildes
RE Kalman
S Belongie
S Julier
S Li
S Munder
S Samangooei
S Zhou
SF Lin
Silvio Barra
SJ McKenna
T Ahonen
T Ojala
T Raty
T Zhao
T Zhao
TB Moeslund
TB Moeslund
TE Fortmann
U Park
U Park
V Blanz
V Krger
W Hu
W Hu
X Ji
X Mei
X Tan
X Zhang
Y Cheng
Y Gurwicz
Y Wu
Y Xu
Y Yao
YL Hou
Z Kalal
Z Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref