Quantum Logic Gate Synthesis as a Markov Decision Process

Alam, M. Sohaib

Quantum Logic Gate Synthesis as a Markov Decision Process

Authors: M. Sohaib Alam
Publication date: 27 December 2019
Publisher

Abstract

Reinforcement learning has witnessed recent applications to a variety of tasks in quantum programming. The underlying assumption is that those tasks could be modeled as Markov Decision Processes (MDPs). Here, we investigate the feasibility of this assumption by exploring its consequences for two of the simplest tasks in quantum programming: state preparation and gate compilation. By forming discrete MDPs, focusing exclusively on the single-qubit case, we solve for the optimal policy exactly through policy iteration. We find optimal paths that correspond to the shortest possible sequence of gates to prepare a state, or compile a gate, up to some target accuracy. As an example, we find sequences of H and T gates with length as small as 11 producing ~99% fidelity for states of the form (HT)^{n} |0> with values as large as n=10^{10}. This work provides strong evidence that reinforcement learning can be used for optimal state preparation and gate compilation for larger qubit spaces.Comment: 10 pages, 2 figures, 2 tables. Comments and feedback welcom

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:1912.12002

Last time updated on 12/10/2020