Quantum Algorithms For Fixed Qubit Architectures
Quantum Algorithms For Fixed Qubit Architectures
2
Center for Theoretical Physics
Massachusetts Institute of Technology
Cambridge, MA 02139
Abstract
Gate model quantum computers with too many qubits to be simulated by available classical
computers are about to arrive. We present a strategy for programming these devices without
error correction or compilation. This means that the number of logical qubits is the same as
the number of qubits on the device. The hardware determines which pairs of qubits can be
addressed by unitary operators. The goal is to build quantum states that solve computational
problems such as maximizing a combinatorial objective function or minimizing a Hamiltonian.
These problems may not fit naturally on the physical layout of the qubits. Our algorithms use
a sequence of parameterized unitaries that sit on the qubit layout to produce quantum states
depending on those parameters. Measurements of the objective function (or Hamiltonian) guide
the choice of new parameters with the goal of moving the objective function up (or lowering
the energy). As an example we consider finding approximate solutions to MaxCut on 3-regular
graphs whereas the hardware is physical qubits laid out on a rectangular grid. We prove that
the lowest depth version of the Quantum Approximate Optimization Algorithm will achieve
an approximation ratio of at least 0.5293 on all large enough instances which beats random
guessing (0.5). We open up the algorithm to have different parameters for each single qubit X
rotation and for each ZZ interaction associated with the nearest neighbor interactions on the
grid. Small numerical experiments indicate that an enveloping classical algorithm can be used
to find the parameters which sit on the grid to optimize an objective function with a different
connectivity. We discuss strategies for finding good parameters but offer no evidence yet that
the proposed approach can beat the best classical algorithms. Ultimately the strength of this
approach will be determined by running on actual hardware.
1 Introduction
We are entering an era when various experimental groups are building gate model quantum comput-
ers. In the near future these devices will have tens of qubits and soon many more [1, 2, 3, 4, 5, 6, 7, 8]
A characteristic of hardware is that the qubits are positioned in space and the allowed direct two
qubit gates are constrained by the architecture. [1, 9, 10] However quantum algorithms are often
written as a sequence of one and two qubit gates where the two qubit gates are between any pair of
qubits. The algorithm designer can work in an idealized setting where all qubits are coupled to all
others. To be run on hardware, the idealized quantum code must be compiled so that a two qubit
1
gate between qubits which are physically separated is written as a sequence of gates involving a
path through the hardware that comprises only two qubit gates that are physically coupled. This
means that the depth of the compiled circuit can be much greater than the depth of the idealized
circuit. With limited coherence times this may be a drawback.
Recently there has been interest in using quantum computers for approximate combinatorial
optimization [11, 12] and for simulating quantum systems [13, 14, 15, 16]. In the first case the
quantum computer is used to produce a quantum state that is dominated by computational basis
states with a high value of some classical objective function. We can think of the objective function
C as a sum over individual terms:
X
C(z) = Cα (z) (1)
α
where each Cα acts on a small subset of the bits and has the value 1 for certain assignments of those
bits and the value 0 on the other assignments. In the second case the quantum computer is used to
produce a quantum state whose energy is near the ground state energy of a given Hamiltonian H.
In both cases the ingredients are an initial state, |initiali, and a sequence of unitary transformations
that act on the initial state to produce a quantum state that depends on the parameters that define
the sequence of unitaries. With L unitaries we can write
where θ denotes the collection θ1 , . . . , θL and each Ua depends on set of parameters θa . The goal
is then to choose the parameters, θ, so that the expected value of the objective function C
2
Pseudo Code for a Generic Variational Quantum Algorithm
Applied to Combinatorial Optimization
Input: A classical objective function C on n bit strings that can efficiently be evaluated on any
input string z
Resources:
1. An n qubit quantum computer that can produce states of the form (2)
2. A classical optimization routine that takes as input a sequence of parameters θ and associated
values fL (θ) and outputs a new value of θ
Goal: A string z that has a “large” value of C(z)
Procedure:
Pick a repetition number R
Pick a stopping criterion which may depend on the quality of the objective function value or
on the number of times the quantum computer has been called
Pick an initial state |initiali
Pick one of the n! ways of assigning the n bits associated with the objective function to the n
qubits on the hardware
Pick an initial set of parameters θ
while the stopping criterion is not satisfied
for R times
Run the Quantum Computer and make the state |θi
Measure in the computational basis: this produces a string z
Evaluate C(z)
end for
Average the R values of C to get an estimate fbL (θ) of fL (θ)
Use the previous values of θ and fbL (θ) to choose a new value of θ
end while
Output: The string z seen during the Procedure with the highest value of C(z)
unitaries that are given to us by the hardware. The hardware architecture determines which qubit
pairs can be acted on by the two qubit unitaries. And also the form of the two qubit unitaries should
be dictated by hardware limitations. However for most experimental implementations the range of
possible two qubit unitaries is large and typically any one qubit unitary can be implemented. Note
that when writing the objective function there is an implicit ordering of the qubits. The qubits
on the hardware can also be labelled from 1 to n, the number of qubits. We have the freedom to
decide how to match these two sets and we will attempt to make judicious choices.
Imagine that we have an n qubit quantum computer and can make the states of the form (2)
where we have applied L unitaries depending on the parameters θ. Here we give pseudo code
for how to run the quantum computer attempting to find a large value of C. The algorithm will
generate a sequence of parameters θ and associated estimates of fL (θ). We require an enveloping
classical algorithm that given such a sequence will pick new values of θ that tend to move fL (θ) to
higher values. For now assume that such an algorithm is at hand. It might involve gradient ascent
or some gradient independent method (see for example [19]).
3
array of qubits
1-qubit gates
2-qubit gates
measurement
circuit depth
Figure 1: Space-time volume of a quantum circuit computation. An array of qubits, here arranged
on a 7 by 7 grid, is initialized by preparing the quantum state of each qubit. Then a layer of 1-qubit
gates and a layer of 2-qubit gates act on the qubits. In our case we apply 1-qubit gates of the form
exp(−iβj Xj ) followed by a round of 2-qubit gates of the form exp(iγjk Zj Zk ). Note that due to
hardware restrictions it may be necessary to apply the 2-qubit gates in several rounds as depicted,
as it may not be possible for one qubit to participate in two gates simultaneously. For simplicity
the figure only shows one round of 1- and 2-qubit gates but many rounds can be applied. However
in pre-error corrected processors the circuit depth is limited by the imperfect fidelity of the gates.
The final step in the circuit consists of measuring the qubits.
4
H, it is natural to expect that a quantum computer will be more efficient. This is the situation in
quantum simulation and one can imagine the application of variational quantum circuits to chem-
istry, materials research, field theory or nuclear physics. In the case that we work with classical
data, for instance when we want to solve an optimization problem or prepare a probability dis-
tribution it is less obvious where the advantage of a quantum neural network would come. Our
hope is that the presence of entangling gates enables interference along the computational path
which allows the preparation of desirable output states with shallower circuits or less training effort
than would be possible with just classical gates. As work on quantum supremacy has shown we do
know that quantum circuits can prepare probability distributions that are out of reach for classical
machines [20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31]. Even though many approaches to quantum
machine learning have been proposed [32] quantum neural networks as discussed here are in a way
minimal in that they directly optimize the native gate library of a quantum processor.
which depends on the single parameter γ. Here we are discussing the original version of the QAOA
so the objective function appearing in U corresponds to the 3-regular graph under consideration.
There is also the operator
U (B, β) = exp(−iβB) (8)
where X
B= Xi and Xi = σxi , (9)
i
and the sum is over all qubits. The quantum computer is run to produce the state
5
where
1 X
|si = √ |zi . (11)
2n z
We include the C label in the ket to remind us that the state has an explicit dependence on the
objective function C.
The goal is to make
hγ, β, C| C |γ, β, Ci (12)
large. It was shown that there is a choice of γ and β such that the approximation ratio for this
problem is at least 0.6924 for all instances of 3-regular MaxCut. The approximation ratio is the
value of (12) divided by the maximum of C over all strings. The significance of this result is that
it holds for all instances and it beats random guessing which gives an approximation ratio of 0.5.
(Random guessing may seem like a low bar but until the introduction of the Goemans-Williamson
algorithm [33], random guessing was the only algorithm with a provable worst case performance
guarantee.) There are classical algorithms that give better performance guarantees than this quan-
tum algorithm. However this result is for the shallowest version of the quantum algorithm and
performance improves with circuit depth so perhaps with sufficient depth the quantum algorithm
will outperform the best classical.
In the discussion of the previous paragraph the objective function C appears in (12) both as
the operator being evaluated and through its use in building the state. Now we assume that the
hardware gives us a graph, to be specific a square grid, and we want the state to be constructed
using the same clauses living on the grid. Let
X
1
G(z) = 2 (1 − zi zj ) (13)
hijiin the grid
which makes a contribution to the expected value of the objective function C in (15). There are 4
cases to consider.
6
j j
or i j
i i
Figure 2.0: An example where Figure 2.1: Qubits i and j are one edge apart
qubits i and j are more than 2 edges apart.
j j
i or
i i
Figure 2.2: An example where qubits i and j Figure 2.3: Qubits i and j are
are one vertical and one horizontal edge apart two horizontal or two vertical edges apart
0. i and j are separated on the grid by more than two edges. As an example see Figure 2.0. In
this case the expected value of (16) is 0 which corresponds to a clause value of 0.5 as can be
seen from (5). The calculations for this case and the next three are in appendix A.
1. i and j are separated on the grid by one vertical edge or by one horizontal edges. See Figure
2.1. In this case the expected value in (16) is
F1 = 1
2 sin 4β sin γ cos3 γ. (17)
2. i and j are separated on the grid by one vertical edge and one horizontal edge. See Figure
2.2. Now the expected value in (12) is
3. i and j are separated on the grid by two vertical edges or two horizontal edge. See Figure
2.3. The expected value in (16) is
Suppose the number of edges from the 3-regular graph which end up in case 1 is m1 and similarly
define m2 and m3 . Summing (16) over all edges gives
m1 F1 + m2 F2 + m3 F3 . (20)
7
We want to choose γ and β to make this as positive as possible. Just using calculus we see that
the maximum occurs at
1 −1 m1
β = tan √ and γ = π/6 (21)
4 m0 λ
Note that for any m1 > 0 this is greater than 0. Now the expected value of the objective function
C in the state |γ, β, Gi is q
m 1 2 0 2 2 0
+ m1 λ + (m ) λ − m λ , (23)
2 2
where m is number of edges in the 3-regular graph, 3n/2. For the approximation ratio the denom-
inator is upper bounded by the number of clauses, m, so the approximation ratio is at least
q
1 1 2 0 2 2 0
+ m1 λ + (m ) λ − m λ . (24)
2 2m
To bound the approximation ratio away from 1/2 as n gets big we need m1 to scale with m. Assume
that the ordering of the vertices on the grid is fixed. We now need to label the vertices on the
3-regular graph so that the number of edges that end up being type 1 scales with n. We can achieve
this with a simple greedy approach.
The algorithm we now give will produce a way of assigning the vertices of the 3-regular graph to
the grid such that the number of type 1 edges in the grid is at least n/2. Start by choosing distinct
vertices two at a time, v1 , v2 then v3 , v4 , such that there is an edge in the 3-regular graph between
the vertices in each pair. At some point, after k pairs have been chosen no further connected pairs
can be found among {v2k+1 , ...vn }. (Possibly k = n/2 if there is a perfect matching and the process
finds it.) Thus the remaining (n − 2k) vertices each have 3 edges terminating in {v1 , . . . v2k }. Now
pick two vertices vi and vj with i ≤ 2k and j > 2k such that there is an edge between vi and vj .
After vi and vj are paired they are not considered for further pairing. Of the 3(n − 2k) edges shown
on the right in figure 3 at most 4 are no longer available once vi is paired with vj . (In addition to
the edge between vi and vj there are two more edges starting at vj and there may be one additional
edge between some vl , where l > 2k, and vi .) So this pairing can be carried out at least 3(n − 2k)/4
times. Adding the k pairs previously found we have
pairs which since k ≤ n/2 is at least n/2. We have identified a subgraph of all n vertices and at least
n/2 edges of the the original 3-regular graph. This subgraph consists of disconnected segments of
2, 3 or 4 vertices and isolated vertices. This subgraph can easily be superimposed (in many ways)
√ √
on a n by n grid. A simple way to embed the segments is to start in the upper left hand corner
of the grid and start placing the segments horizontally. When you reach the right border of the
grid, if your current segment does not fit on the horizontal row, make a bend so that it ends up on
the horizontal row one below the top. Now continue moving to the left and complete the zigzag
8
v1
v2k+1
v2
v3
v2k+2
v4
v5
v4
vn
v2k-1
v2k
Figure 3: A representation of the greedy algorithm used to achieve a pairing of the bits
pattern. All the segments can be fit on the grid in this way. This guarantees that we can pick m1
to be at least n/2 = m/3. (We don’t claim this is best possible.)
We now know that we can, with minimal computational cost, assign the vertices of the 3-regular
graph to the vertices of the grid such that m1 ≥ m/3. However the approximation ratio in (24)
decreases with m0 . This is because the contributions from type 2 and type 3 pairs (see (18) and
(19)) come with negative signs. So it might pay to arrange the assignment of the vertices on the
grid to minimize the number of type 2 and type 3 pairs. Even without this extra effort we know
that m0 ≤ 2m/3 since m1 + m2 + m3 ≤ m. Picking m1 = m/3 and m0 = 2m/3 we have that
√
the approximation ratio in (24) is at least 0.52938... The error made at the border is of order n
√
in (23) and 1/ n in (24) so for n large enough we have that the approximation ratio is at least
0.5293 for any 3-regular graph using the grid for the unitary that builds the quantum state as long
as we assign the vertices as just described. This discussion is for the lowest depth version of the
algorithm using only two angles β and γ.
Here is a small numerical example. We generated a 20 bit 3-regular graph and ran the original
QAOA at p = 1 to produce the state (10). Optimizing the two angles γ and β gave an approximation
ratio of 0.7519. We then switched to a 4 by 5 grid and assigned the vertices of the graph to the
grid using the procedure just given which produced 11 pairs. Optimizing the two angles in (15)
gave an approximation ratio of 0.6424. As expected the mismatch between the grid and the graph
lessened performance.
What we showed sets a baseline for the idea that we can use a unitary that lives on the grid to
9
solve a problem on an unrelated graph. Admittedly we lined the two up somewhat at a marginal
computation cost. This step was necessary because at the shallowest depth version of the algorithm,
qubits that are separated by more than two edges on the grid do not sense each other. See case
√
0 above. However as the depth increases to n, all qubits are entangled with each other and this
type of preprocessing may not be crucial.
so that each qubit has its own parameter βj which we can combine into β = (β1 , . . . βn ) and write
Y
US (β) = US (βj ) (27)
j
for the application of all the single qubit rotations at once. For each pair of qubits j and k connected
on the grid we introduce a parameter γjk and then
where γ has all of the individual γjk ’s combined. Now for a level-p circuit we introduce a set of p
β’s, β (1) , . . . β (p) and a set of p γ’s, γ (1) , . . . γ (p) and the state
E
(1)
γ , . . . γ (p) , β (1) , . . . β (p) , G = US (β (p) )UP (γ (p) ) · · · US (β (1) )UP (γ (1) ) |si . (30)
The goal now is to find the parameters γ (1) , . . . γ (p) , β (1) , . . . β (p) for which the expected value of
some objective function C is large. Since Xj and Zj Zk have eigenvalues ±1, it suffices to let every
γ and every β lie between 0 and π.
10
where w̄ is the bit flip of w. This means that we are forced into producing states that are quantum
entangled. We view this as a virtue since we want to stay away from purely classical computation.
We start by making the usual cat state
whose action is
U12 |0+i = |00i (34)
U12 |1+i = |11i
|0i+|1i
where as usual |+i = √
2
. This gives us
Now append a third qubit in the |+i state to the cat state in (35) and act with U23 on the second
and third qubits to get a 3 qubit cat state:
Using the grid states (30) we search for angles that maximize Cquad . The idea here is that the ob-
jective function is fairly simple and perhaps finding angles on the grid that maximize this objective
function is not hard. Again the unitary that produces the state is not associated with the objective
function explicitly.
11
√
previous section.) We chose p to be n which is 4. The 4 by 4 grid has 24 edges so at each level
there are 24 γ’s and 16 β’s for a total of 160 parameters. To run the simulation we needed a classical
optimizer to drive up the expected value of C in the the state (30). We used a home made Nelder
Mead routine [34] running Matlab on a laptop. For the initial simplex the routine tosses 161 points
in the [0, π]160 cube. The best of ten runs achieved an approximation ratio of 0.9399. We contrast
this with using an opened up set of parameters for the QAOA with the unitaries associated with
the edges of the 3-regular graph. Coincidentally a 3-regular graph with 16 bits has 24 edges so this
p = 4 search was also over 160 parameters. The best of ten gave an approximation ratio of 0.9534
which shows that the degradation in moving from the original graph to the grid was small, at least
in this example.
The simulation took advantage of the fact that with a classical computer simulating a quantum
computer, after building the state (30), we can simply calculate the expected value of C without
performing measurements. In the algorithm given as pseudo code, running on a quantum com-
puter there would be R repetitions of the building of the state (30) with fixed angles followed by
measurements producing R strings z and values C(z). Say R is 100 to get good statistics. We
tracked the probability of finding a string z which has C(z) equal to the global maximum of C.
We found that this probability went above 1/100 long before the expected value of C achieved the
stopping criterion. This will only be of potential asymptotic computation value if the standard
deviation of C in the state (30) scales with n. We know that with p fixed (that is not growing with
√ √
n) for 3-regular MaxCut the standard deviation grows only as n. However with p of order n
the argument in [11] no longer applies.
12
the 4 by 4 grid. Starting our Nelder Mead outer loop routine requires 161 sets of 160 angles. For
one of these we took the set of angles just found so that one of the starting sets of angles produces
a state with an expected value of C very close to 17. The Nelder Mead optimizer found angles that
produced a state with objective function value very near 20, that is, with an approximation ratio
very near 1. This is better performance than we typically observed with 161 random inputs to the
Nelder Mead optimizer. The takeaway message here is that the simulation moved beyond the value
of the warm start and did not get stuck in a local minimum.
For a combinatorial search problem such as MaxCut there are classical approximation algorithms
with worst case performance guarantees. For example without any restriction on the form of the
graph the Goemans Williamson algorithm guarantees an approximation ratio of 0.878. Improving
this to 0.878 + for any > 0 would imply that the Unique Games Conjecture is false [35]. This
offers a complexity theoretic context for perhaps understanding why no improvement on GW has
been found to date. However the quantum domain is less explored than the classical and it seems
worthwhile to try to run quantum computers at this type of boundary. We can start the quantum
algorithm at parameter values that give the GW ratio and hope to go uphill. The improvement in
the value of the cost value must scale with n to be significant. Of course with a fixed number of
qubits it will never be possible to prove an asymptotic result. Still we can run quantum computers
in this region and see what happens.
13
4.1 More or Fewer Parameters
We began our discussion of working on a fixed qubit architecture by looking at the problem MaxCut.
In this case the objective function operator is a sum of ZZ terms. In the original version of the
QAOA this operator appears in the unitaries that drive the evolution as well as single qubit X
rotations. We imported these types of operators to the grid architecture. At first we looked at
the lowest depth version of the algorithm and restricted all of the ZZ terms to have the same
parameter and each X rotation to be by the same angle. So we began with two parameters. We
then advocated opening up the search space to include different angles for each ZZ coupling and
for each X rotation. Moving to higher depth, these parameters can be different each time the
corresponding operator is applied. We can go further still and imagine any single qubit unitary
(3 parameters) and any two qubit unitary (15 parameters) being applied as long as the unitaries
are between qubits that are connected by the architecture and within the range of experimental
capabilities.
Opening up the search space has the advantage that it may allow us to access more favorable
computational pathways. But it has the disadvantage of possibly making discovering these advan-
tageous pathways more difficult. For example suppose we introduce single Y rotations. Acting with
the gate exp(±iY π4 ) on a single |+i state gives the computational basis state |0i or |1i depending
on the sign. This means that if we just use a depth one algorithm with only these gates acting
on the state |si we can produce any computational basis state. With these gates the solution to
any NP-hard problem is in the domain of a depth one circuit with n parameters. But this pro-
cedure uses only product states and is essentially equivalent to replacing bits by classical rotating
spins and presumably requires exponential time in worst case. So introducing these gates may not
facilitate finding good parameters in an opened up search space. Our hope, and it is really only
a hope, is that some form of restricted entangling quantum procedure will allow us to find good
approximate solutions to combinatorial search problems that evade our best classical algorithms.
Future algorithm designers will have to decide whether to use more or fewer parameters as part of
their search strategies.
14
evaluated by directly measuring in the computational basis and then classically computing C(z).
Note that the evaluation of C(z) on a conventional computer allows for greater freedom in the set of
objective functions than those that are tied to the connectivity of the hardware. As the algorithm
proceeds the quantum computer may produce a string with a high value of the objective function
and terminate early. It is not clear how this might carry over to the Hamiltonian case.
4.3 Error
There are two sources of error which need to be discussed. First there is control error where the
actual applied gates do not correspond perfectly to the desired gates. If this is due to systematic
error, the optimization strategy outlined above will not be sensitive to this kind of error. If you
are walking on terrain and want to go uphill it is not important to know your coordinates as long
as you can sense which way is up. However if the control error is different each time the quantum
computer is called, this may be more problematic. There can also be decoherence which leads to
degradation of the fidelity between the actual output state and the desired state. If the fidelity is
high we can be sure that the output objective function value will be close to the ideal. But it is
possible even with low fidelity that the output objective function value will be reliable (and it is
easily checked). It is also possible that the outer loop optimizer will avoid computational pathways
where performance is degraded due to decoherence. It may be possible to study the effects of error
by using simple objective functions and setting parameters to create states with calculable objective
function values and then comparing with experimental observation.
4.4 Numerics
Some of our numerical work was at 16 qubits working on a 4 by 4 grid at p = 4 building quantum
states that depend on 160 parameters. The enveloping classical outer loop was called thousands
of times to create a quantum state with a high value of the objective function. Running on an
actual quantum computer would require many measurements to produce good estimates of the
expected value of the objective function so the thousands of simulation calls might correspond
to hundreds of thousands of calls to a quantum computer. (This is an overestimate if an early
measurement produces a string with a sufficiently high value of the objective function.) And this
is all to find the maximum of a function that depends on 16 bits where brute force search over
216 inputs is almost instantaneous on a classical computer. So our numerics only demonstrate that
these strategies can work, not that they are better than classical. As we go to higher bit number
we presume that classical methods become unwieldly for computationally difficult problems. Still
we remain optimistic that the quantum neural network will have pathways towards good solutions
through routes unavailable to classical algorithms. Perhaps these can be used to solve problems
more efficiently than classical algorithms. Perhaps we will see signs of this running actual devices
at scores of qubits.
15
with the angles initially chosen to guarantee classical performance thresholds. Or we might wish to
start with restrictions on the search space to make the search more efficient and then use the result
of the restricted search to seed an opened up search for example as performed in coarse-to-fine
strategies. Another idea is to restrict a subset of the gates such that they correspond to known
quantum subroutines such as phase estimation [36] or amplitude amplification [37]. The success
of our proposed approach relies on having an outer loop classical optimizer tailored to generating
suitable gate parameters. An attractive choice to explore is deep reinforcement learning which
recently achieved impressive results [38, 39]. Of course ultimate success would be to find strategies
that guarantee certain performance levels in advance of running on the quantum computer. But as
actual devices become available we should run them to help discover these strategies or to give us
heuristics that convince us that quantum computers have power beyond classical.
5 Acknowledgements
E.F. and H.N. would like to thank Ryan Babbush, Aram Harrow, Sergio Boixo for stimulating
discussions throughout this project. We thank Dave Bacon, John Martinis and Masoud Mohseni
for comments on the manuscript and Charles Suggs for technical support. This work is supported
by the National Science Foundation under grant contract number CCF-1525130.
Appendix
Here we show how we evaluate (16) for the four cases pictured in figure 2. We can write (16) as
− 12 hs| exp (iγG) exp (iβB) Zi Zj exp (−iβB) exp (−iγG) |si . (A.1)
− 12 hs| exp (iγG) [Zi cos 2β + Yi sin 2β][Zj cos 2β + Yj sin 2β] exp(−iγG) |si . (A.2)
Inserting exp (−iγG) exp (iγG) in the middle we see that we need to evaluate
and the same thing with i replaced by j. G is a sum of products of pairs of Z operators and to
evaluate (A.3) we need only consider pairing of the form Zi Zk where hiki is an edge in the grid.
Note that k might be equal to j. Now
γ γ
exp −i Zi Zk Zi exp i Zi Zk = Zi
2 2
γ γ
exp −i Zi Zk Yi exp i Zi Zk = Yi cos γ − Xi Zk sin γ (A.4)
2 2
γ γ
exp −i Zi Zk Xi exp i Zi Zk = Xi cos γ + Yi Zk sin γ.
2 2
By repeated applications of (A.4) we can expand (A.3) out as well as (A.3) with i replaced by j.
We take the expectation of the product in the state |si. Since |si is an eigenstate of each X` with
16
eigenvalue 1 and hs| Y` |si = hs| Z` |si = 0 for each `, the only terms that survive are those which
are products of X` ’s with no unpaired Y` or Z` .
Consider case 0 which is depicted in Figure 2.0. Label the four vertices connected to i as 1,
2,3,4. In (A.3) the only relevant piece of G is − 21 [Zi Z1 + Zi Z2 + Zi Z3 + Zi Z4 ] which is a sum of
commuting terms. Consider the action of exp −i γ2 Zi Z1 using (A.4). Since hs| Z1 |si = 0 we need
only keep terms proportional to Zi and Yi . Repeating with 2, 3, and 4 we still only end up with
terms proportional to Zi and Yi , but the expected value of these terms in the state |si is 0. So we
see that for case 0, the expression (A.1) is 0.
We now turn to case 1. Vertex i has four neighbors, one of which is j and the other three are
not neighbors of j. Apply (A.4) three times to (A.3) with the three neighbors of i other than j.
This gives
Zi cos 2β + Yi sin 2β cos3 γ. (A.5)
since the expectation of Z` in |si is 0 for these three neighbors. Now apply Zi Zj using (A.4) to get
Taking the expectation in |si of the product of (A.6) and (A.7) gives
which is (17).
In Cases 2 and 3 there is no term Zi Zj in G but vertices i and j do have neighbors in common
(two in case 2 and one in case 3) and the non-zero contributions to the expectation come from
pairing the Zk from these common neighbors in the expressions of (A.4), which leads to (18) and
(19).
References
[1] R. Barends et al. “Superconducting quantum circuits at the surface code threshold for fault
tolerance”. Nature 508, 500-503 (2014).
[2] IBM. A new way of thinking: The IBM quantum experience. 2017. URL: http://www.
research.ibm.com/quantum.
[3] A. D. Córcoles et al. “Demonstration of a quantum error detection code using a square lattice
of four superconducting qubits”. Nature Communications 6, 6979 (2015).
[4] S. Debnath et al. “Demonstration of a small programmable quantum computer with atomic
qubits”. Nature 536, 63-66 (2016).
17
[5] D. Ristè et al. “Detecting bit-flip errors in a logical qubit using stabilizer measurements”.
Nature Communications 6, 6983 (2015).
[6] N. Ofek et al. “Extending the lifetime of a quantum bit with error correction in superconducting
circuits”. Nature 536, 441-445 (2016).
[7] T. Monz et al. “Realization of a scalable Shor algorithm”. Science 351, 1068-1070 (2016).
[8] F. Yan et al. “The flux qubit revisited to enhance coherence and reproducibility”. Nature
Communications 7, 12964 (2016).
[9] E. A. Sete, W. J. Zeng, and C. T. Rigetti. “A functional architecture for scalable quantum
computing”. 2016 IEEE International Conference on Rebooting Computing (ICRC). 2016, pp.
1-6.
[10] David Aasen et al. “Milestones Toward Majorana-Based Quantum Computing”. Phys. Rev. X
6, 031016 (2016).
[14] J. R. McClean et al. “The theory of variational hybrid quantum-classical algorithms”. New J.
Phys. 18, 023023 (2016).
[15] J. Romero et al. “Strategies for quantum computing molecular energies using the unitary
coupled cluster ansatz” (Jan. 2017). arXiv:1701.02691 [quant-ph].
[16] D. Wecker, M. B. Hastings, and M. Troyer. “Progress towards practical quantum variational
algorithms”. Phys. Rev. A 92, 042303 (Oct. 2015).
[17] L. K. Grover. “A fast quantum mechanical algorithm for database search” (May 1996).
arXiv:quant-ph/9605043.
[19] G. Giacomo Guerreschi and M. Smelyanskiy. “Practical optimization for hybrid quantum-
classical algorithms” (Jan. 2017). arXiv:1701.01450 [quant-ph].
[20] S. Aaronson. “Quantum Lower Bound for Recursive Fourier Sampling”. QIC 3, 165–174 (Mar.
2003).
[21] B. M. Terhal and D. P. DiVincenzo. “Adaptive Quantum Computation, Constant Depth Quan-
tum Circuits and Arthur-Merlin Games”. QIC 4, 134 (May 2004).
18
[22] S. Aaronson. “Quantum computing, postselection, and probabilistic polynomial-time”. Proc.
Roy. Soc. London A Ser A 461, 3473–3482 (2005).
[23] R.J. Bremner and D.J. Shepard. Proc. Roy. Soc. London A Ser A 467, 459 (2011).
[24] S. Aaronson and A. Arkhipov. “The Computational Complexity of Linear Optics”. Proceedings
of the Forty-third Annual ACM Symposium on Theory of Computing, STOC ’11. San Jose,
California, USA. ACM, pages 333–342.
[25] K. Fujii and T. Morimae. “Quantum Commuting Circuits and Complexity of Ising Partition
Functions” (Nov. 2013). arXiv:1311.2128 [quant-ph].
[28] R. Jozsa and M. Van Den Nest. QIC 14, 633 (2014).
[30] E. Farhi and A. W Harrow. “Quantum Supremacy through the Quantum Approximate Opti-
mization Algorithm” (2016). arxiv:1602.07674 (quant-ph).
[31] S. Boixo et. al. “Characterizing Quantum Supremacy in Near-Term Devices” (2016).
arXiv:1608.00263 [quant-ph].
[32] Steiger, Mohseni, Wiebe, and Troyer. “Review of quantum machine learning”. To appear soon
(2017).
[33] Michel X. Goemans and David P. Williamson. “Improved approximation algorithms for maxi-
mum cut and satisfiability problems using semidefinite programming”. J. ACM 42, 1115–1145
(1995).
[34] J. A. Nelder and R. Mead. “A Simplex Method for Function Minimization”. The Computer
Journal 7, 308 (1965).
[35] S. Khot, G. Kindler, E. Mossel, and R. ODonnell. “Optimal Inapproximability Results for
MAX-CUT and Other 2-Variable CSPs?” SIAM 37, 319-357 (2007). Eprint: http://dx.doi.
org/10.1137/S0097539705447372.
[36] K. M. Svore, M. B. Hastings, and M. Freedman. “Faster Phase Estimation”. ArXiv e-prints,
(2013). arXiv:1304.0741 [quant-ph]
[37] G. Brassard et al. “Quantum Amplitude Amplification and Estimation” (2000). arXiv:quant-
ph/0005055. 2000.
[38] D. Silver et al. “Mastering the game of Go with deep neural networks and tree search”. Nature,
529, 484489 (Jan. 2016). URL: http://dx.doi.org/10.1038/nature16961.
19
[39] V. Mnih et al. “Human-level control through deep reinforcement learning”. Nature 518, 529
533 (2015).
20