A Quantum-Classical Collaborative Training Architecture Based On Quantum State Fidelity
A Quantum-Classical Collaborative Training Architecture Based On Quantum State Fidelity
This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/TQE.2024.3367234
A Quantum-Classical Collaborative
Training Architecture Based on Quantum
State Fidelity
RYAN L’ABBATE1 , ANTHONY D’ONOFRIO JR.1 , SAMUEL STEIN2 , SAMUEL YEN-CHI
CHEN3 , ANG LI2 , PIN-YU CHEN4 , JUNTAO CHEN1 , YING MAO1
1
Computer and Information Science Department, Fordham University, New York, USA. E-mail: {rlabbate, adonofrio10, jchen504, ymao41}@fordham.edu
2
Pacific Northwest National Laboratory, Richland, WA, USA. Email: {samuel.stein, ang.li}@pnnl.gov
3
Brookhaven National Laboratory, Upton, NY, USA. Email:{[email protected]}
4
IBM Research, Yorktown Heights, NY, USA. Email:{[email protected]}
Corresponding author: Ying Mao (e-mail: [email protected]).
This research was supported in part by the National Science Foundation (NSF) under grant agreements 2329020, 2335788, and 2301884. It
was also supported in part by the U.S. Department of Energy (DOE) through the Office of Advanced Scientific Computing Research’s
“Advanced Memory to Support Artificial Intelligence for Science”. PNNL is operated by Battelle for the DOE under Contract
DE-AC05-76RL01830.
ABSTRACT Recent advancements have highlighted the limitations of current quantum systems, particu-
larly the restricted number of qubits available on near-term quantum devices. This constraint greatly inhibits
the range of applications that can leverage quantum computers. Moreover, as the available qubits increase,
the computational complexity grows exponentially, posing additional challenges. Consequently, there is
an urgent need to use qubits efficiently and mitigate both present limitations and future complexities.
To address this, existing quantum applications attempt to integrate classical and quantum systems in a
hybrid framework. In this study, we concentrate on quantum deep learning and introduce a collaborative
classical-quantum architecture called co-TenQu. The classical component employs a tensor network for
compression and feature extraction, enabling higher-dimensional data to be encoded onto logical quantum
circuits with limited qubits. On the quantum side, we propose a quantum-state-fidelity-based evaluation
function to iteratively train the network through a feedback loop between the two sides. co-TenQu has
been implemented and evaluated with both simulators and the IBM-Q platform. Compared to state-of-
the-art approaches, co-TenQu enhances a classical deep neural network by up to 41.72% in a fair setting.
Additionally, it outperforms other quantum-based methods by up to 1.9 times and achieves similar accuracy
while utilizing 70.59% fewer qubits.
INDEX TERMS Quantum Deep Learning, Quantum-Classical Hybrid Systems, Collaborative Training
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4
This article has been accepted for publication in IEEE Transactions on Quantum Engineering. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/TQE.2024.3367234
L’Abbate et al.: A Quantum-Classical Collaborative Training Architecture Based on Quantum State Fidelity
substantially greater representational power compared to an sical tensor network (TN) into the feature extraction stage
equivalent number of classical bits. In 1998, the first quantum to facilitate dimensionality reduction. Specifically, the TN
computer capable of executing computations was developed serves as a trainable module designed to capture high-level
[2]. The IBM-Q Experience was introduced in 2016, granting abstractions of the input data, the output of which is sub-
developers access to state-of-the-art quantum resources [3]. sequently fed into a variational quantum circuit (VQC) for
In 2020, Google AI demonstrated that a 53-qubit quantum classification purposes. Furthermore, we employ a quantum-
computer could complete a task in 200 seconds that would state-fidelity based cost function to train the model directly
require a classical computer more than 10,000 years. This on qubits’ states. Our proposed solution presents signifi-
advantage of quantum computing over classical computing is cant advantages over existing techniques, such as Principal
frequently referred to as "quantum supremacy" [4]. Component Analysis (PCA), which lacks trainability, and
Researchers inspired by the concept of quantum conventional neural networks that require a considerable
supremacy are actively exploring methods to convert clas- number of parameters to be optimized or pre-trained. The
sical algorithms into their quantum versions, aiming to integration of our hybrid system enables more efficient data
achieve significant reductions in time complexity compared encoding, thereby enhancing the overall performance of the
to classical counterparts. Quantum speed-ups have already quantum machine learning pipeline. The main contributions
been demonstrated for Shor’s algorithm [5] , which addresses are summarized as follows.
prime factorization and discrete logarithms, and Grover’s
• We propose co-TenQu, a quantum-classical collabo-
algorithm, which tackles database searches [6]. Quantum
rative training architecture. On the classical part, it
computing can be applied to machine learning tasks by
employs tensor network layers for data pre-processing
employing variational quantum circuits—quantum circuits
and preparation. In the quantum part, it utilizes a pre-
with trainable parameters. Specific areas within classical
processed dataset to build circuits with fewer qubits to
learning, such as Deep Learning and Support Vector Ma-
reduce the overall qubit requirement and noise interfer-
chines, could potentially benefit from quantum computing
ence.
[7], [8]. Quantum speed-ups have been achieved for several
• We introduce a quantum state fidelity based cost func-
algorithms, including expectation maximization solving [9]
tion. Instead of converting back to classical states,
(where the algorithm’s speed has been increased to sub-
co-TenQu train the model directly on quantum states
linear time [10]), Support Vector Machines [11], and natural
aiming at accelerating the training process and improv-
language processing [12].
ing performance.
However, in the noisy intermediate-scale quantum (NISQ)
• We implement co-TenQu with popular quantum toolk-
era, the qubits are both limited in number and subject to
its, e.g., Qiskit and PennyLane, and compare it with
noise. For instance, IBM-Q provides only 5-7 qubit machines
state-of-the-art solutions in the literature, by up to 1.9x
to the public. Furthermore, as the qubit count increases, the
and 70.59% less quantum resources. Additionally, we
computational complexity of the system grows exponentially
conduct proof-of-concept experiments on 14 different
[13], which leads to a higher overall noise level in a quan-
IBM-Q quantum machines.
tum machine. In the context of deep learning, an increased
number of qubits may employ a greater number of gates,
potentially augmenting circuit depth and noise interference. II. RELATED WORK
Consequently, it is crucial to efficiently and reliably utilize Recent developments [19]–[23] in quantum computing
the representational power of qubits through effective en- show great potential to enhance current learning algorithms
coding, making quantum algorithms more feasible on both through utilization of the qubit, the unit of quantum infor-
current and NISQ quantum computers, while mitigating the mation. In this field, quantum neural networks (QNN) have
surge in computational complexity as the number of qubits emerged as a promising research area in quantum machine
increases. A potential solution to data encoding challenges learning [24], [25]. Due to the limited quantum resources
involves performing classical pre-processing of the data for available, most of the existing works focused on numerical
compression and/or feature extraction. One prevalent method analysis or datasets with lower dimensionalities [17], [26],
for dimension reduction is Principal Component Analysis [27], such as MNIST [28].
(PCA), as demonstrated in prior works [14]–[18]. However, Farhi et al. [29] introduced a QNN for binary classifica-
PCA may not possess the representational power necessary to tion, which utilizes quantum entanglement to enhance the
compress data accurately. More sophisticated methods, such model’s computational power. In addition, quantum circuit
as employing neural network layers, demand substantial pre- learning [30], [31] developed a quantum-classical hybrid
training and significantly increase the number of parameters algorithm. They employed an iterative optimization of the
requiring tuning. Therefore, there is a pressing need for parameters to circumvent the high-depth circuit. Moreover,
efficient data compression techniques tailored to quantum Stokes et al. [32] presented a novel method for gradient
machine learning. descent using quantum circuits, enabling the optimization
In this work, we introduce a novel classical-quantum col- of variational quantum circuits in a manner analogous to
laborative training architecture, which incorporates a clas- classical neural networks. However, these solutions focused
2 VOLUME 4, 2023
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4
This article has been accepted for publication in IEEE Transactions on Quantum Engineering. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/TQE.2024.3367234
L’Abbate et al.: A Quantum-Classical Collaborative Training Architecture Based on Quantum State Fidelity
on theoritical analysis and only numerical experiments were distribution for the circuit results. Calculations are performed
provided. by manipulating the probability distributions of qubits. 0 and
In NISQ era, QCNN [33] suggests a design for a quantum 1 can be represented in vector notation as seen in Equation 1.
convolutional neural network that uses O(log(N )) trainable Quantum systems are often described using ⟨bra| |ket⟩
parameters for N dimensional inputs and can be realized on notation, where ⟨bra| and |ket⟩ represent horizontal and ver-
near-term quantum computers. Additionally, QuCNN [34] tical quantum state vectors, respectively. Because a qubit is a
employs an entanglement based backpropagation for NISQ mixture of 0 and 1, qubit states are described mathematically
machines. Jiang et al. [35] proposed a co-design framework as a linear combination of |0⟩ and |1⟩ as seen in Equation 1
named QuantumFlow, which features quantum-friendly neu- and 2.
ral networks, a mapping tool to generate quantum circuits,
and an execution engine. However, QuantumFlow requires 1 0 α
|0⟩ = , |1⟩ = , |Ψ⟩ = (1)
local training of the network prior to mapping to quantum cir- 0 1 β
cuits, which leads to sensitivity to noise when implemented
on real quantum computers as opposed to simulations. |Ψ⟩ = α|0⟩ + β|1⟩ (2)
Expanding upon the use of quantum operations to perform
distance measurements, Stein et. al proposed the QuClassi This linear combination of qubit states is referred to as a
system: a hybrid quantum-classical system with a quantum- qubit’s statevector. |0⟩ and |1⟩ are orthonormal vectors in an
state-fidelity based loss function [14], [15]. QuClassi was eigenspace. In Equation 2, |Ψ⟩ represents the qubit state, a
able to provide improvements in accuracy compared to other probabilistic combination of |0⟩ and |1⟩.
contemporary quantum-based solutions such as TensorFlow The tensor product of qubit states can be used to describe
Quantum [36] and QuantumFlow. The QuClassi system the quantum states of multiple qubits. The tensor product
demonstrated success in both binary and multi-class clas- between the qubits shown in Equations 2 and 3 can be
sification. It used Principal Component Analysis (PCA) to described using Equation 4.
compress dataset classically. However, PCA fails to fully
utilize the classical resources by providing trainable layers. |Φ⟩ = γ|0⟩ + ω|1⟩ (3)
TN-VQC [37] proposed the use of tensor networks for feature
extraction and data compression to achieve higher classi-
fication accuracy for variational quantum circuits. Tensor |ΨΦ⟩ = |Ψ⟩ ⊗ |Φ⟩ = γα|00⟩ + ωα|01⟩ + γβ|10⟩ + ωβ|11⟩
networks do provide the advantage of having fewer pa- (4)
rameters compared to neural networks while still providing |0⟩ and |1⟩ represent opposite points of the sphere on
some trainability unlike PCA. TN-VQC employed a circuit the z axis. Measurements of qubit states can be taken with
architecture involving CNOT gates rather than CSWAP gates respect to any basis, but convention typically dictates that
like QuClassi. measurements are taken against the z-axis. However, the x-
This paper proposes co-TenQu, a hybrid quantum-classical axis, y-axis, or any pair of opposite points on the sphere could
architecture for deep neural networks. Comparing with exist- potentially be used as a basis of measurement. Quantum
ing literature, it utilizes a quantum-state fidelity based cost states are responsible for encoding data, and to perform op-
function to train the quantum section directly on qubits’ erations on quantum states quantum gates are used. Quantum
states. Additionally, tensor networks are employed to fully gates apply a transformation over a quantum state into some
exploit classical resources to compensate for the limitations new quantum state.
(e.g., low qubit count and noises) of quantum resources.
Through a collaborative training process, co-TenQu is able B. QUANTUM GATES
to outperform state-of-the-arts. Similar to classical data which is manipulated and encoded
using gates, quantum data is manipulated and encoded using
III. BACKGROUND quantum gates. Quantum gates can either perform a rotation
In this section, we present the background that is necessary about an axis or perform an operation on a qubit based on the
for designing our solution. value of another qubit. These are referred to as rotation gates
and controlled gates respectively.
A. QUANTUM COMPUTING BASICS
1) A Qubit and its superposition 1) Single-Qubit Gates
Classical computing uses bits that are binary in nature and A common type of single-qubit operations are the rotation
measure either 0 or 1. Quantum computing uses quantum gates. These gates perform qubit rotations by parameterized
bits or qubits. Qubits, unlike classical bits, are a probabilistic amounts. The generalized single-rotation gate R is shown in
mixture of 0 and 1. This mixture of 0 and 1 is known as a matrix form in Equation 5.
superposition. Upon measurement, the qubit in superposition
will collapse to either a value of 0 or 1. Quantum circuits are cos θ2 −ie−iϕ sin θ2
often run many times, using the results to get a probability R(θ, ϕ) = (5)
−ie−iϕ sin θ2 cos θ2
VOLUME 4, 2023 3
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4
This article has been accepted for publication in IEEE Transactions on Quantum Engineering. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/TQE.2024.3367234
L’Abbate et al.: A Quantum-Classical Collaborative Training Architecture Based on Quantum State Fidelity
Three commonly-used special cases of this gate are the 4) Controlled Gates
RX , RY , and RZ gates. These gates represent rotations in There are also two-qubit gates which utilize a control qubit
the x, y, and z plane and are expressed in Equations 6, 7, and and a target qubit. These gates, known as controlled gates,
8. RX and RY can be thought of as special cases of the R perform an operation on a target qubit depending on the value
gate in which ϕ = 0 and ϕ = π2 respectively. Therefore, of the control qubit.
RX (θ) is a rotation about the x-axis by angle θ and RY (θ) is CNOT Gate The CNOT gate is an example of a two-qubit
a rotation about the y-axis by angle θ. The derivation of RZ gate used in quantum computing. The CNOT gate flips the
from the general rotation gate is less straightforward and thus value of the target qubit if the control qubit is measured as
is not included here. 1 and does nothing otherwise. The CNOT gate can be seen
represented in matrix form below.
cos θ2 −i sin θ2
RX (θ) = = R(θ, 0) (6)
−i sin θ2 cos θ2
1 0 0 0
0 0 0 1
CN OT = (13)
cos θ2 − sin θ2
π 0 0 1 0
RY (θ) = = R(θ, ) (7)
sin θ2 cos θ2 2 0 1 0 0
" −iθ
# Fig.1 depicts the circuit notation for the CNOT gate. q0 is
e 2 0 the control qubit and q1 is the target qubit.
RZ (θ) = −iθ (8)
0 e 2
2) Hadamard Gate
A fundamental gate of quantum computation is the
Hadamard gate. It is a single-qubit gate puts a qubit into
superposition as described in Section III-A1. It can be ex- FIGURE 1: CNOT Gate Circuit Notation
pressed in matrix shown in equation 9. The √12 coefficient
is due to the fact that the sum of the squares of the state Controlled Rotation Gates Equations 14, 15 and 16 are con-
amplitudes must add to 1, so each state has a probability of 12 trolled rotation gates in matrix notation. Controlled rotation
and an amplitude of √12 . gates are similar to the CNOT gate but apply a rotation when
1
the control qubit measures 1 instead of flipping the state. This
1 1
H=√ (9) allows for variable levels of entanglement between qubits.
2 1 −1
1 0 0 0
3) Two-Qubit Gates 0 1 0 0
There are also operations that function as two-qubit rotations CRX (θ) = 0 0 cos θ θ
(14)
2 − sin 2
which perform an equal rotation on two qubits. These gates θ θ
0 0 − sin 2 cos 2
are described in Equations 10, 11, and 12. Note that these
gates are expressed as 4x4 matrices while the single-qubit
1 0 0 0
gates were 2x2 matrices. This is because for a two-qubit 0 1 0 0
gate, each individual qubit has two possible measurements, CRY (θ) = 0 0 cos θ − sin θ
(15)
2 2
yielding four possible results (|00⟩,|01⟩,|10⟩,|11⟩) rather than 0 0 sin θ2 cos θ2
two as seen previously for the single-qubit gates.
1 0 0 0
cos θ2 −i sin θ2
0 0 0 1 0 0
0 cos θ2 −i sin θ2 0 CRZ (θ) =
0 0 iθ (16)
RXX (θ) =
e2 0
0 −i sin θ2 cos θ2 0 iθ
0 0 0 e2
−i sin θ2 0 0 θ
cos 2
(10)
C. CONTROLLED SWAP GATE
Another type of controlled gate is the controlled SWAP
cos θ2 i sin θ2
0 0
0 cos θ2 −i sin θ2 0 gate. The SWAP gate measures the difference between two
RY Y (θ) = (11) quantum states and outputs the result to an ancilla qubit.
0 −i sin θ2 cos θ2 0
i sin θ2 0 0 θ
cos 2 Therefore, this gate is a three-qubit gate. The SWAP test
output values range from 0.5 to 1. Maximally different (or-
θ
e−i 2 0 0 0 thogonal) states will measure 1 with 50% probability while
θ
0 e−i 2 0 0
identical states will measure 1 with 100% probability. The
RZZ (θ) = (12)
−i θ SWAP test gate can be used to measure quantum state fidelity.
0 0 e 2 0
0 0 0 −i θ
e 2 The controlled swap gate is described in Equations 17 and 18.
4 VOLUME 4, 2023
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4
This article has been accepted for publication in IEEE Transactions on Quantum Engineering. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/TQE.2024.3367234
L’Abbate et al.: A Quantum-Classical Collaborative Training Architecture Based on Quantum State Fidelity
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4
This article has been accepted for publication in IEEE Transactions on Quantum Engineering. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/TQE.2024.3367234
L’Abbate et al.: A Quantum-Classical Collaborative Training Architecture Based on Quantum State Fidelity
data point, we apply the necessary rotations across d2 qubits, applied to the remaining n/2 qubits. Additionally, there is
with each rotation parameterized by the normalized value of one ancilla qubit used for swap test measurements.
that data point’s corresponding dimension. It is worth noting
that the encoding of 2-dimensional data onto a single qubit B. QUANTUM LAYERS
may pose challenges for extreme values of x. However, we Similar to classical artificial neural networks, quantum cir-
explore the dual dimensional encoding as a possible method cuits can also be thought of as having layers. For a quantum
of reducing high qubit counts and evaluate the performance circuit, these layers would be comprised of quantum gates.
when each dimension of data is encoded into one respective In co-TenQu, we define three quantum layer styles: single-
qubit solely through a RY Gate. This approach is validated qubit unitary, dual-qubit unitary, and controlled-qubit unitary.
by the fact that we never measure any of our qubits, but only Each of these layer styles comprises rotations that serve as
their quantum fidelity through the SWAP test. As a result, the trainable parameters in our quantum machine learning
we can bypass the superposition-collapsing issue inherent in model. Defining these three types of layers enables system
this approach.We encode the second dimension of data on the design at a higher level than individual gates.
same qubit through the following rotation: Single-Qubit Unitary A single-qubit unitary layer involves
√ single-qubit rotations around the y-axis and z-axis (RY and
RZ(θxi+1 ) = 2sin−1 ( xi ) (20) RZ ). This allows for total manipulation of a qubit’s quantum
When dealing with a limited number of qubits, methods state. A single-qubit unitary layer is depicted in Figure 4.
that can reduce the number required are highly valuable.
Unlike classical computers, which utilize formats such as
integers and floats, classical data encoding in quantum states
does not have a tried and tested method. Therefore, our
approach may be subject to criticism. Nevertheless, our ap- FIGURE 4: Single Qubit Unitary
proach has been tested and proven to be a viable solution
to the problem at hand. Additionally, having knowledge of Dual-Qubit Unitary A dual-qubit untary layer involves
both the qubit’s expectation across the Y and Z domains dual-qubit rotations around the y and z axis (RY Y and RZZ ).
enables the reconstruction of classical data. Various methods The same y rotation and z rotation are applied to both qubits
for classical-to-quantum data encoding exist, ranging from involved. A dual-qubit unitary layer is depicted in Figure 5.
encoding 2n classical data points across n qubits using state-
vector encoding to encoding classical data into a binary
|0⟩ |ψ⟩ |0⟩ |ψ⟩
representation on quantum states by translating a vector of bi- RYY(θ) RZZ(θ) CRY(θ) CRZ(θ)
nary values onto qubits. The former method is highly suscep- |0⟩ |φ⟩ |0⟩ |φ⟩
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4
This article has been accepted for publication in IEEE Transactions on Quantum Engineering. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/TQE.2024.3367234
L’Abbate et al.: A Quantum-Classical Collaborative Training Architecture Based on Quantum State Fidelity
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4
This article has been accepted for publication in IEEE Transactions on Quantum Engineering. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/TQE.2024.3367234
L’Abbate et al.: A Quantum-Classical Collaborative Training Architecture Based on Quantum State Fidelity
values are encoded for the rotations to keep inputs to the trained on the data set, X. Line 6 represents the input data, x,
quantum circuit in the range of [- π2 , π2 ]. After encoding, the being encoded into the tensor network. Line 7 represents the
circuit is run to get a quantum state fidelity measurement. output of the tensor network being obtained through tensor
This measurement is then mapped from [0.5,1] to [0,1] by contractions. Lines 8-23 represent the process by which each
subtracting 0.5 and multiplying by 2. The swap test may of the quantum parameters θ is updated. The output of the
sometimes measure below 0.5 due to statistical error, so a tensor network and the trainable quantum circuit parameters
ReLu layer is applied after the quantum circuit to prevent θd are all loaded into the quantum circuit with one of the pa-
negative outputs. For multi-class classification, the ReLu rameters (θ) either increased by π2 (∆f wd ) and the SWAP test
layer is not used due to the presence of the softmax layer. If is performed. Then the parameters are reset, θ is decreased
the output is below 0.5, the image is classified as 0, otherwise by π2 (∆bck ), and the SWAP test is performed again. The
the image is classified as label 1. The quantum circuit has overall cost function of the network, f (θd ), is then obtained
up to three types of layers: single-qubit unitary, dual-qubit for the two adjusted parameter values and used to update θ
unitary, and controlled-qubit unitary. as seen in Line 22. After all of the quantum parameters have
For binary classification, a single quantum circuit is run. been updated, the parameters of the Tensor Network layer
For n-class classification where n > 2, n quantum circuits are updated as seen in Line 24. The quantum neural network
with the same circuit design, but different parameters are run is induced across all trained classes and the quantum state
in parallel. The outputs of these circuits are then softmaxed to fidelity outputs are softmaxed. The class with the highest
get probabilities for each class. The image is classified as the probability is returned as the classification.
class with the highest probability. System diagrams for the Algorithm 1 presents a hybrid training process that in-
binary and multi-class versions of this system can be seen in volves both classical and quantum ends, e.g., data loading
Figures 8 and 9, respectively and tensor networks on the classical side; quantum layers
and measurements on the quantum side. The time and space
complexity analysis should consider both quantum and clas-
sical resources. Due to the page limit and scope, we omit the
theoretical algorithm analysis in this paper.
V. EVALUATION
We utilized Python 3.9 and the IBM Qiskit Quantum Com-
puting simulator package to implement co-TenQu . The cir-
cuits were trained on NSF Cloudlab M510 nodes at the Uni-
versity of Utah datacenter. In our experiments, co-TenQu is
compared with state-of-the-art solutions listed below.
FIGURE 8: co-TenQu Diagram (Binary)
• PCA-QuClassi [39]: It is the predecessor of co-TenQu.
Instead of a collaborative quantum-classical training
framework, it utilizes principal component analysis
(PCA) to reduce the dimensions of the dataset. In our
evaluations, we use PCA-5, PCA-7 and PCA-17 to
denote its 5-qubit, 7-qubit and 17-qubit settings. Ad-
ditionally, PCA-QuClassi has been compared with its
different versions, including the Single Qubit Unitary
Layer, Dual Qubit Unitary Layer and Entanglement
Layer.
• QuntumFlow [40] (QF-pNet): It employs a co-design
framework of quantum neural networks and utilizes
FIGURE 9: co-TenQu Diagram (3-class) downsampling to reduce the dimensions along with the
amplitude encoding method.
This system can be trained all together at once rather than • TensorFlow Quantum [36] (TFQ): The example codes
requiring a feature extractor to be pre-trained. The entire provided by Tensorflow Quantum library are based on
training algorithm is summarized in Algorithm 1. First, the Cirq circuits and standard layer designs.
data is loaded as shown in Equation 26 (Line 1). Lines 2- • DNN-Fair [41]: A classical deep neural network for
3 involve introducing training parameters set by the user MNIST data may contain 1.2M parameters. For a more
at run time. The learning rate α indicates how large the fair comparison, we construct a deep neural network
updates to the system parameters should be during training. with 3145 parameters.
The network weights are initialized randomly. The number Furthermore, when comparing our co-TenQu architecture
of epochs ϵ indicates how many times the network will be to above-mentioned solutions in the literature of quantum
8 VOLUME 4, 2023
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4
This article has been accepted for publication in IEEE Transactions on Quantum Engineering. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/TQE.2024.3367234
L’Abbate et al.: A Quantum-Classical Collaborative Training Architecture Based on Quantum State Fidelity
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4
This article has been accepted for publication in IEEE Transactions on Quantum Engineering. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/TQE.2024.3367234
L’Abbate et al.: A Quantum-Classical Collaborative Training Architecture Based on Quantum State Fidelity
co-TenQu PCA-5 QF-pNet TFQ DNN-Fair co-TenQu PCA-5 TFQ DNN-Fair co-TenQu PCA-5 TFQ DNN-Fair
1.0 1.0 1.0
Accuracy
Accuracy
0.6 0.6 0.6
Accuracy
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4
This article has been accepted for publication in IEEE Transactions on Quantum Engineering. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/TQE.2024.3367234
L’Abbate et al.: A Quantum-Classical Collaborative Training Architecture Based on Quantum State Fidelity
Accuracy
Accuracy
0.6 0.6 0.6
Accuracy
VOLUME 4, 2023 11
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4
This article has been accepted for publication in IEEE Transactions on Quantum Engineering. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/TQE.2024.3367234
L’Abbate et al.: A Quantum-Classical Collaborative Training Architecture Based on Quantum State Fidelity
0.8 Manila
Perth
0.7 Auckland
0.6 Mumbai
Quito
0.5 Guadalupe
Accuracy
Montreal
0.4 Belem
Lima
0.3 Lagos
Washington
0.2 Toronto
Hanoi
0.1 Jakarta
0.0
1/5 3/6
FIGURE 14: (1,5) and (3,6) MNIST Binary Classifications on IBM-Q Quantum FIGURE 15: IBM-Q Lima Topology
p. 505–510, 2019. [Online]. Available: https://www.nature.com/articles/ [23] A. D’Onofrio, A. Hossain, L. Santana, N. Machlovi, S. Stein, J. Liu,
s41586-019-1666-5 A. Li, and Y. Mao, “Distributed quantum learning with co-management in
[5] P. W. Shor, “Polynomial-time algorithms for prime factorization and a multi-tenant quantum system,” in 2023 IEEE International Conference
discrete logarithms on a quantum computer,” SIAM Journal on on Big Data (BigData). IEEE, 2023, pp. 221–228.
Computing, vol. 26, no. 5, pp. 1484–1509, oct 1997. [Online]. Available: [24] K. Beer, D. Bondarenko, T. Farrelly, T. J. Osborne, R. Salzmann,
https://doi.org/10.1137%2Fs0097539795293172 D. Scheiermann, and R. Wolf, “Training deep quantum neural networks,”
[6] L. K. Grover, “A fast quantum mechanical algorithm for database search,” Nature communications, vol. 11, no. 1, p. 808, 2020.
1996. [Online]. Available: https://arxiv.org/abs/quant-ph/9605043 [25] A. Abbas, D. Sutter, C. Zoufal, A. Lucchi, A. Figalli, and S. Woerner,
[7] S. Garg and G. Ramakrishnan, “Advances in quantum deep learning: An “The power of quantum neural networks,” Nature Computational Science,
overview,” 2020. [Online]. Available: https://arxiv.org/abs/2005.04316 vol. 1, no. 6, pp. 403–409, 2021.
[8] K. Beer, D. Bondarenko, T. Farrelly, R. S. T. J. Osborne, D. Scheiermann, [26] Z. Liang, H. Wang, J. Cheng, Y. Ding, H. Ren, Z. Gao, Z. Hu, D. S. Boning,
, and R. Wolf, “Training deep quantum neural networks,” Nature commu- X. Qian, S. Han et al., “Variational quantum pulse learning,” in 2022 IEEE
nications,vol. 11, no. 1, pp. 1–6, 2019. International Conference on Quantum Computing and Engineering (QCE).
[9] I. Kerenidis, A. Luongo, and A. Prakash, “Quantum expectation- IEEE, 2022, pp. 556–565.
maximization for gaussian mixture models,” 2019. [Online]. Available: [27] P. Easom-McCaldin, A. Bouridane, A. Belatreche, R. Jiang, and S. Al-
https://arxiv.org/abs/1908.06657 Maadeed, “Efficient quantum image classification using single qubit en-
[10] T. Li, S. Chakrabarti, and X. Wu, “Sublinear quantum algorithms for coding,” IEEE Transactions on Neural Networks and Learning Systems,
training linear and kernel-based classifiers,” 2019. [Online]. Available: 2022.
https://arxiv.org/abs/1904.02276 [28] “mnist,” http://yann.lecun.com/exdb/mnist/.
[11] C. Ding, T.-Y. Bao, and H.-L. Huang, “Quantum-inspired support vector [29] E. Farhi and H. Neven, “Classification with quantum neural networks
machine,” 2019. [Online]. Available: https://arxiv.org/abs/1906.08902 on near term processors,” 2018. [Online]. Available: https://arxiv.org/abs/
[12] A. Panahi, S. Saeedi, and T. Arodz, “word2ket: Space-efficient 1802.06002
word embeddings inspired by quantum entanglement,” 2019. [Online]. [30] K. Mitarai, M. Negoro, M. Kitagawa, and K. Fujii, “Quantum circuit
Available: https://arxiv.org/abs/1911.04975 learning,” Physical Review A, vol. 98, no. 3, p. 032309, 2018.
[13] P. Kaye, R. Laflamme, and M. M. al., “An introduction to quantum [31] M. Ostaszewski, L. M. Trenkwalder, W. Masarczyk, E. Scerri, and V. Dun-
computing,” Oxford university press, 2007. jko, “Reinforcement learning for optimization of variational quantum cir-
[14] S. A. Stein, B. Baheri, D. Chen, Y. Mao, Q. Guan, A. Li, S. Xu, and cuit architectures,” Advances in Neural Information Processing Systems,
C. Ding, “Quclassi: A hybrid deep neural network architecture based on vol. 34, pp. 18 182–18 194, 2021.
quantum state fidelity,” Proceedings of Machine Learning and Systems, [32] J. Stokes, J. Izaac, N. Killoran, and G. Carleo, “Quantum natural
vol. 4, 2022. gradient,” Quantum, vol. 4, p. 269, may 2020. [Online]. Available:
[15] S. A. Stein, R. L’Abbate, W. Mu, Y. Liu, B. Baheri, Y. Mao, G. Qiang, https://doi.org/10.22331%2Fq-2020-05-25-269
A. Li, and B. Fang, “A hybrid system for learning classical data in [33] I. Cong, S. Choi, and M. D. Lukin, “Quantum convolutional neural
quantum states,” in 2021 IEEE International Performance, Computing, and networks,” Nature Physics, vol. 15, no. 12, pp. 1273–1278, aug 2019.
Communications Conference (IPCCC). IEEE, 2021, pp. 1–7. [Online]. Available: https://doi.org/10.1038%2Fs41567-019-0648-8
[16] S. A. Stein, B. Baheri, D. Chen, Y. Mao, Q. Guan, A. Li, B. Fang, and [34] S. Stein, Y. Mao, J. Ang, and A. Li, “Qucnn: A quantum convolu-
S. Xu, “Qugan: A quantum state fidelity based generative adversarial tional neural network with entanglement based backpropagation,” in 2022
network,” in 2021 IEEE International Conference on Quantum Computing IEEE/ACM 7th Symposium on Edge Computing (SEC). IEEE, 2022, pp.
and Engineering (QCE). IEEE, 2021, pp. 71–81. 368–374.
[17] S. Y.-C. Chen, C.-M. Huang, C.-W. Hsing, and Y.-J. Kao, “An end-to-end [35] W. Jiang, J. Xiong, and Y. Shi, “A co-design framework of neural
trainable hybrid classical-quantum classifier,” Machine Learning: Science networks and quantum circuits towards quantum advantage,” Nature
and Technology, vol. 2, no. 4, p. 045021, 2021. Communications, vol. 12, no. 1, jan 2021. [Online]. Available:
[18] T. Hur, L. Kim, and D. K. Park, “Quantum convolutional neural network https://doi.org/10.1038%2Fs41467-020-20729-5
for classical data classification,” Quantum Machine Intelligence, vol. 4, [36] M. Broughton, G. Verdon, T. McCourt, A. J. Martinez, J. H. Yoo, S. V.
no. 1, p. 3, 2022. Isakov, P. Massey, R. Halavati, M. Y. Niu, A. Zlokapa et al., “Tensorflow
[19] E. H. Houssein, Z. Abohashima, M. Elhoseny, and W. M. Mohamed, quantum: A software framework for quantum machine learning,” arXiv
“Machine learning in the quantum realm: The state-of-the-art, challenges, preprint arXiv:2003.02989, 2020.
and future vision,” Expert Systems with Applications, p. 116512, 2022. [37] S. Y.-C. Chen, C.-M. Huang, C.-W. Hsing, and Y.-J. Kao, “Hybrid
[20] F. V. Massoli, L. Vadicamo, G. Amato, and F. Falchi, “A leap among quantum-classical classifier based on tensor network and variational
quantum computing and quantum neural networks: A survey,” ACM quantum circuit,” 2020. [Online]. Available: https://arxiv.org/abs/2011.
Computing Surveys, vol. 55, no. 5, pp. 1–37, 2022. 14651
[21] M. Cerezo, G. Verdon, H.-Y. Huang, L. Cincio, and P. J. Coles, “Chal- [38] V. Bergholm, J. Izaac, M. Schuld, C. Gogolin, M. S. Alam, S. Ahmed,
lenges and opportunities in quantum machine learning,” Nature Computa- J. M. Arrazola, C. Blank, A. Delgado, S. Jahangiri, K. McKiernan,
tional Science, vol. 2, no. 9, pp. 567–576, 2022. J. J. Meyer, Z. Niu, A. Száva, and N. Killoran, “Pennylane: Automatic
[22] W.-L. Chang and A. V. Vasilakos, Fundamentals of Quantum Program- differentiation of hybrid quantum-classical computations,” 2018. [Online].
ming in IBM’s Quantum Computers. Springer, 2021. Available: https://arxiv.org/abs/1811.04968
12 VOLUME 4, 2023
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4
This article has been accepted for publication in IEEE Transactions on Quantum Engineering. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/TQE.2024.3367234
L’Abbate et al.: A Quantum-Classical Collaborative Training Architecture Based on Quantum State Fidelity
[39] S. A. Stein, B. Baheri, D. Chen, Y. Mao, Q. Guan, A. Li, S. Xu, and ANG LI is a senior computer scientist in the
C. Ding, “Quclassi: A hybrid deep neural network architecture based on Physical and Computational Directorate of Pa-
quantum state fidelity,” Proceedings of Machine Learning and Systems, cific Northwest National Laboratory and affiliated
vol. 4, pp. 251–264, 2022. Associate Professor at University of Washington,
WA, USA. He received B.E. from Zhejiang Uni-
[40] W. Jiang, J. Xiong, and Y. Shi, “A co-design framework of neural networks versity, China, and two PhD from the National
and quantum circuits towards quantum advantage,” Nature communica- University of Singapore and Eindhoven University
tions, vol. 12, no. 1, p. 579, 2021. of Technology, Netherlands. His research focuses
on software-hardware codesign for scalable het-
[41] “Tensorflow Quantum Fair Comparison,” https://www.tensorflow.org/ erogeneous HPC and quantum computing.
quantum/tutorials/mnist, [Online; accessed 07-March-2023].
PIN-YU CHEN is a principal research staff mem-
ber at IBM Research, Yorktown Heights, NY,
[42] S. Stein, N. Wiebe, Y. Ding, P. Bo, K. Kowalski, N. Baker, J. Ang, USA. He is also the chief scientist of RPI-IBM AI
and A. Li, “Eqc: ensembled quantum computing for variational quantum Research Collaboration and PI of ongoing MIT-
algorithms,” in Proceedings of the 49th Annual International Symposium
IBM Watson AI Lab projects. Dr. Chen received
on Computer Architecture, 2022, pp. 59–71.
his Ph.D. degree in electrical engineering and
computer science from the University of Michi-
gan, Ann Arbor, USA, in 2016. Dr. Chen’s recent
research focuses on adversarial machine learning
RYAN L’ABBATE is a database developer and and robustness of neural networks. His long-term
research chemist at En-Tech Corp. He received research vision is to build trustworthy machine learning systems. At IBM
bachelor’s degrees in Chemical Engineering and Research, he received the honor of IBM Master Inventor and several re-
Mathematics from Manhattan College in Decem- search accomplishment awards, including an IBM Master Inventor and IBM
ber 2017 and a master’s degree in Data Science Corporate Technical Award in 2021. His research works contribute to IBM
from the Department of Computer and Informa- open-source libraries including Adversarial Robustness Toolbox (ART 360)
tion Science at Fordham University in May 2022. and AI Explainability 360 (AIX 360). He has published more than 50 papers
He was inducted into the Omega Chi Epsilon related to trustworthy machine learning at major AI and machine learning
honors society for chemical engineering and the conferences. He is a member of IEEE and an associate editor of Transactions
Pi Mu Epsilon honors society for mathematics. He on Machine Learning Research.
has also had data science research published in the Haseltonia journal. His
JUNTAO CHEN (S’15-M’21) received the Ph.D.
research interests include quantum computing, quantum data science, and
degree in Electrical Engineering from New York
data structures.
University (NYU), Brooklyn, NY, in 2020, and the
ANTHONY D’ONOFRIO JR. is a former grad- B.Eng. degree in Electrical Engineering and Au-
uate student from the Department of Computer tomation with honor from Central South Univer-
and Information Science at Fordham University in sity, Changsha, China, in 2014. He is currently an
New York City. He received his bachelor’s degree assistant professor at the Department of Computer
in Computer Science from Fordham University in and Information Sciences and an affiliated faculty
2022 and his master’s degree in May 2023. During member with the Fordham Center of Cybersecu-
his time at Fordham, he was a Fordham-IBM rity, Fordham University, New York, USA. His
research intern and was inducted into Fordham research interests include cyber-physical security and resilience, quantum
University’s Chapter of Sigma Xi, the Scientific AI and its security, game and decision theory, network optimization and
Research Honor Society, as an Associate Member learning. He was a recipient of the Ernst Weber Fellowship, the Dante Youla
for his work. His research interests focus on the fields of distributed systems, Award, and the Alexander Hessel Award for the Best Ph.D. Dissertation in
quantum systems, quantum deep learning, and software engineering. Electrical Engineering from NYU.
SAMUEL STEIN is a Staff Scientist in the high- YING MAO is an Associate Professor in the
performance-computing (HPC) group of Pacific Department of Computer and Information Sci-
Northwest National Laboratory (PNNL) since De- ence at Fordham University in the New York
cember, 2022. He received his bachelors degree City. He received his Ph.D. in Computer Sci-
in Chemical Engineering from the University of ence from the University of Massachusetts Boston
Cape Town, South Africa, in 2018 and his masters in 2016. He was a Fordham-IBM research fel-
in Data Science from Fordham University in 2020. low. His research interests mainly focus on the
His research has been focusing on Quantum Ma- fields of quantum systems, quantum deep learning,
chine Learning, Quantum Error Mitigation, and quantum-classical optimizations, quantum system
Distributed Quantum Computing. More recently, virtualization, cloud resource management, data-
his research has focused on heterogeneous quantum computing designs, and intensive platforms and containerized applications.
distributed quantum computing architectures.