256 research outputs found
A General Theory of Equivariant CNNs on Homogeneous Spaces
We present a general theory of Group equivariant Convolutional Neural
Networks (G-CNNs) on homogeneous spaces such as Euclidean space and the sphere.
Feature maps in these networks represent fields on a homogeneous base space,
and layers are equivariant maps between spaces of fields. The theory enables a
systematic classification of all existing G-CNNs in terms of their symmetry
group, base space, and field type. We also consider a fundamental question:
what is the most general kind of equivariant linear map between feature spaces
(fields) of given types? Following Mackey, we show that such maps correspond
one-to-one with convolutions using equivariant kernels, and characterize the
space of such kernels
Coordinate Independent Convolutional Networks -- Isometry and Gauge Equivariant Convolutions on Riemannian Manifolds
Motivated by the vast success of deep convolutional networks, there is a
great interest in generalizing convolutions to non-Euclidean manifolds. A major
complication in comparison to flat spaces is that it is unclear in which
alignment a convolution kernel should be applied on a manifold. The underlying
reason for this ambiguity is that general manifolds do not come with a
canonical choice of reference frames (gauge). Kernels and features therefore
have to be expressed relative to arbitrary coordinates. We argue that the
particular choice of coordinatization should not affect a network's inference
-- it should be coordinate independent. A simultaneous demand for coordinate
independence and weight sharing is shown to result in a requirement on the
network to be equivariant under local gauge transformations (changes of local
reference frames). The ambiguity of reference frames depends thereby on the
G-structure of the manifold, such that the necessary level of gauge
equivariance is prescribed by the corresponding structure group G. Coordinate
independent convolutions are proven to be equivariant w.r.t. those isometries
that are symmetries of the G-structure. The resulting theory is formulated in a
coordinate free fashion in terms of fiber bundles. To exemplify the design of
coordinate independent convolutions, we implement a convolutional network on
the M\"obius strip. The generality of our differential geometric formulation of
convolutional networks is demonstrated by an extensive literature review which
explains a large number of Euclidean CNNs, spherical CNNs and CNNs on general
surfaces as specific instances of coordinate independent convolutions.Comment: The implementation of orientation independent M\"obius convolutions
is publicly available at https://github.com/mauriceweiler/MobiusCNN
Gauge Equivariant Convolutional Networks and the Icosahedral CNN
The principle of equivariance to symmetry transformations enables a
theoretically grounded approach to neural network architecture design.
Equivariant networks have shown excellent performance and data efficiency on
vision and medical imaging problems that exhibit symmetries. Here we show how
this principle can be extended beyond global symmetries to local gauge
transformations. This enables the development of a very general class of
convolutional neural networks on manifolds that depend only on the intrinsic
geometry, and which includes many popular methods from equivariant and
geometric deep learning. We implement gauge equivariant CNNs for signals
defined on the surface of the icosahedron, which provides a reasonable
approximation of the sphere. By choosing to work with this very regular
manifold, we are able to implement the gauge equivariant convolution using a
single conv2d call, making it a highly scalable and practical alternative to
Spherical CNNs. Using this method, we demonstrate substantial improvements over
previous methods on the task of segmenting omnidirectional images and global
climate patterns.Comment: Proceedings of the International Conference on Machine Learning
(ICML), 201
General -Equivariant Steerable CNNs
The big empirical success of group equivariant networks has led in recent
years to the sprouting of a great variety of equivariant network architectures.
A particular focus has thereby been on rotation and reflection equivariant CNNs
for planar images. Here we give a general description of -equivariant
convolutions in the framework of Steerable CNNs. The theory of Steerable CNNs
thereby yields constraints on the convolution kernels which depend on group
representations describing the transformation laws of feature spaces. We show
that these constraints for arbitrary group representations can be reduced to
constraints under irreducible representations. A general solution of the kernel
space constraint is given for arbitrary representations of the Euclidean group
and its subgroups. We implement a wide range of previously proposed and
entirely new equivariant network architectures and extensively compare their
performances. -steerable convolutions are further shown to yield
remarkable gains on CIFAR-10, CIFAR-100 and STL-10 when used as a drop-in
replacement for non-equivariant convolutions.Comment: Conference on Neural Information Processing Systems (NeurIPS), 201
Gravitational radiation from gamma-ray bursts as observational opportunities for LIGO and VIRGO
Gamma-ray bursts are believed to originate in core-collapse of massive stars.
This produces an active nucleus containing a rapidly rotating Kerr black hole
surrounded by a uniformly magnetized torus represented by two counter-oriented
current rings. We quantify black hole spin-interactions with the torus and
charged particles along open magnetic flux-tubes subtended by the event
horizon. A major output of Egw=4e53 erg is radiated in gravitational waves of
frequency fgw=500 Hz by a quadrupole mass-moment in the torus. Consistent with
GRB-SNe, we find (i) Ts=90s (tens of s, Kouveliotou et al. 1993), (ii)
aspherical SNe of kinetic energy Esn=2e51 erg (2e51 erg in SN1998bw, Hoeflich
et al. 1999) and (iii) GRB-energies Egamma=2e50 erg (3e50erg in Frail et al.
2001). GRB-SNe occur perhaps about once a year within D=100Mpc. Correlating
LIGO/Virgo detectors enables searches for nearby events and their spectral
closure density 6e-9 around 250Hz in the stochastic background radiation in
gravitational waves. At current sensitivity, LIGO-Hanford may place an upper
bound around 150MSolar in GRB030329. Detection of Egw thus provides a method
for identifying Kerr black holes by calorimetry.Comment: to appear in PRD, 49
Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems
Advances in artificial intelligence (AI) are fueling a new paradigm of
discoveries in natural sciences. Today, AI has started to advance natural
sciences by improving, accelerating, and enabling our understanding of natural
phenomena at a wide range of spatial and temporal scales, giving rise to a new
area of research known as AI for science (AI4Science). Being an emerging
research paradigm, AI4Science is unique in that it is an enormous and highly
interdisciplinary area. Thus, a unified and technical treatment of this field
is needed yet challenging. This work aims to provide a technically thorough
account of a subarea of AI4Science; namely, AI for quantum, atomistic, and
continuum systems. These areas aim at understanding the physical world from the
subatomic (wavefunctions and electron density), atomic (molecules, proteins,
materials, and interactions), to macro (fluids, climate, and subsurface) scales
and form an important subarea of AI4Science. A unique advantage of focusing on
these areas is that they largely share a common set of challenges, thereby
allowing a unified and foundational treatment. A key common challenge is how to
capture physics first principles, especially symmetries, in natural systems by
deep learning methods. We provide an in-depth yet intuitive account of
techniques to achieve equivariance to symmetry transformations. We also discuss
other common technical challenges, including explainability,
out-of-distribution generalization, knowledge transfer with foundation and
large language models, and uncertainty quantification. To facilitate learning
and education, we provide categorized lists of resources that we found to be
useful. We strive to be thorough and unified and hope this initial effort may
trigger more community interests and efforts to further advance AI4Science
- …