Search CORE

17 research outputs found

Parameter Sharing in Coagent Networks

Author: Zini Modjtaba Shokrian
Publication venue
Publication date: 28/01/2020
Field of study

In this paper, we aim to prove the theorem that generalizes the Coagent Network Policy Gradient Theorem (Kostas et. al., 2019) to the context where parameters are shared among the function approximators involved. This provides the theoretical foundation to use any pattern of parameter sharing and leverage the freedom in the graph structure of the network to possibility exploit relational bias in a given task. As another application, we will apply our result to give a more intuitive proof for the Hierarchical Option Critic Policy Gradient Theorem, first shown in (Riemer et. al., 2019)

arXiv.org e-Print Archive

The Smallest Interacting Universe

Author: Brown Adam R.
Freedman Michael
Zini Modjtaba Shokrian
Publication venue
Publication date: 01/08/2022
Field of study

The co-emergence of locality between the Hamiltonian and initial state of the universe is studied in a simple toy model. We hypothesize a fundamental loss functional for the combined Hamiltonian and quantum state and minimize it by gradient descent. This minimization yields a tensor product structure simultaneously respected by both the Hamiltonian and the state, suggesting that locality can emerge by a process analogous to spontaneous symmetry breaking. We discuss the relevance of this program to the arrow of time problem. In our toy model, we interpret the emergence of a tensor factorization as the appearance of individual degrees of freedom within a previously undifferentiated (raw) Hilbert space. Earlier work [5, 6] looked at the emergence of locality in Hamiltonians only, and found strong numerical confirmation of that raw Hilbert spaces of

\dim = n

are unstable and prefer to settle on tensor factorization when

n=pq

is not prime, and in [6] even primes were seen to "factor" after first shedding a small summand, e.g.

7=1+2\cdot 3

. This was found in the context of a rather general potential functional

F

on the space of metrics

\{g_{ij}\}

\mathfrak{su}(n)

, the Lie algebra of symmetries. This emergence of qunits through operator-level spontaneous symmetry breaking (SSB) may help us understand why the world seems to consist of myriad interacting degrees of freedom. But understanding why the universe has an initial Hamiltonian

H_0

with a many-body structure is of limited conceptual value unless the initial state,

|\psi_0\rangle

, is also structured by this tensor decomposition. Here we adapt

F

to become a functional on

\{g,|\psi_0\rangle\}=(\text{metrics})\times (\text{initial states})

, and find SSB now produces a conspiracy between

g

and

|\psi_0\rangle

, where they simultaneously attain low entropy by settling on the same qubit decomposition

arXiv.org e-Print Archive

Quantum computing with Octonions

Author: Freedman Michael
Shokrian-Zini Modjtaba
Wang Zhenghan
Publication venue
Publication date: 07/10/2019
Field of study

There are two schools of "measurement-only quantum computation". The first ([11]) using prepared entanglement (cluster states) and the second ([4]) using collections of anyons, which according to how they were produced, also have an entanglement pattern. We abstract the common principle behind both approaches and find the notion of a graph or even continuous family of equiangular projections. This notion is the leading character in the paper. The largest continuous family, in a sense made precise in Corollary 4.2, is associated with the octonions and this example leads to a universal computational scheme. Adiabatic quantum computation also fits into this rubric as a limiting case: nearby projections are nearly equiangular, so as a gapped ground state space is slowly varied the corrections to unitarity are small.Comment: Added some new results in section

arXiv.org e-Print Archive

eScholarship - University of California

Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning

Author: Aggarwal Vaneet
Moradipari Ahmadreza
Pedramfar Mohammad
Zini Modjtaba Shokrian
Publication venue
Publication date: 30/10/2023
Field of study

In this paper, we prove the first Bayesian regret bounds for Thompson Sampling in reinforcement learning in a multitude of settings. We simplify the learning problem using a discrete set of surrogate environments, and present a refined analysis of the information ratio using posterior consistency. This leads to an upper bound of order

\widetilde{O}(H\sqrt{d_{l_1}T})

in the time inhomogeneous reinforcement learning problem where

H

is the episode length and

d_{l_1}

is the Kolmogorov

l_1-

dimension of the space of environments. We then find concrete bounds of

d_{l_1}

in a variety of settings, such as tabular, linear and finite mixtures, and discuss how how our results are either the first of their kind or improve the state-of-the-art.Comment: 37th Conference on Neural Information Processing Systems (NeurIPS 2023

arXiv.org e-Print Archive

Quantum simulation of battery materials using ionic pseudopotentials

Author: Arrazola Juan Miguel
Casares Pablo A. M.
Delgado Alain
Mueller Jonathan E.
Reis Roberto dos
Voigt Arne-Christian
Zini Modjtaba Shokrian
Publication venue
Publication date: 15/02/2023
Field of study

Ionic pseudopotentials are widely used in classical simulations of materials to model the effective potential due to the nucleus and the core electrons. Modeling fewer electrons explicitly results in a reduction in the number of plane waves needed to accurately represent the states of a system. In this work, we introduce a quantum algorithm that uses pseudopotentials to reduce the cost of simulating periodic materials on a quantum computer. We use a qubitization-based quantum phase estimation algorithm that employs a first-quantization representation of the Hamiltonian in a plane-wave basis. We address the challenge of incorporating the complexity of pseudopotentials into quantum simulations by developing highly-optimized compilation strategies for the qubitization of the Hamiltonian. This includes a linear combination of unitaries decomposition that leverages the form of separable pseudopotentials. Our strategies make use of quantum read-only memory subroutines as a more efficient alternative to quantum arithmetic. We estimate the computational cost of applying our algorithm to simulating lithium-excess cathode materials for batteries, where more accurate simulations are needed to inform strategies for gaining reversible access to the excess capacity they offer. We estimate the number of qubits and Toffoli gates required to perform sufficiently accurate simulations with our algorithm for three materials: lithium manganese oxide, lithium nickel-manganese oxide, and lithium manganese oxyfluoride. Our optimized compilation strategies result in a pseudopotential-based quantum algorithm with a total runtime four orders of magnitude lower than the previous state of the art for a fixed target accuracy

arXiv.org e-Print Archive