0% found this document useful (0 votes)

76 views33 pages

Entanglement-Enabled Advantage For Learning A Bosonic Random Displacement Channel

This document discusses using quantum entanglement to provide an exponential advantage in learning properties of a bosonic random displacement channel. The task is to estimate the characteristic function of a probabilistic mixture of displacement operators acting on n bosonic modes. The authors prove that without entanglement, the number of samples needed grows exponentially with n. However, with entanglement to an ancillary system and sufficient squeezing, the number of samples needed is independent of n. They also show this entanglement-assisted scheme remains efficient even with photon loss. This establishes an exponential separation between entanglement-assisted and entanglement-free learning of continuous variable systems.

Uploaded by

hhonkue

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

76 views33 pages

Entanglement-Enabled Advantage For Learning A Bosonic Random Displacement Channel

Uploaded by

hhonkue

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Entanglement-enabled advantage for learning a bosonic random displacement channel

Changhun Oh,1, ∗ Senrui Chen,1, ∗ Yat Wong,1 Sisi Zhou,2, 3, 4 Hsin-Yuan Huang,3, 5, 6 Jens A.H. Nielsen,7
Zheng-Hao Liu,7 Jonas S. Neergaard-Nielsen,7 Ulrik L. Andersen,7 Liang Jiang,1, † and John Preskill3, ‡
1
Pritzker School of Molecular Engineering, The University of Chicago, Chicago, Illinois 60637, USA
2
Perimeter Institute for Theoretical Physics, Waterloo, Ontario N2L 2Y5, Canada
3
Institute for Quantum Information and Matter,
California Institute of Technology, Pasadena, CA 91125, USA
4
Department of Physics and Astronomy and Institute for Quantum Computing,
University of Waterloo, Ontario N2L 2Y5, Canada
5
Center for Theoretical Physics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
6
Google Quantum AI, Venice, CA, USA
7
Center for Macroscopic Quantum States (bigQ),
Department of Physics, Technical University of Denmark,
arXiv:2402.18809v1 [quant-ph] 29 Feb 2024

Building 307, Fysikvej, 2800 Kgs. Lyngby, Denmark

(Dated: March 1, 2024)
We show that quantum entanglement can provide an exponential advantage in learning properties
of a bosonic continuous-variable (CV) system. The task we consider is estimating a probabilistic
mixture of displacement operators acting on n bosonic modes, called a random displacement channel.
We prove that if the n modes are not entangled with an ancillary quantum memory, then the channel
must be sampled a number of times exponential in n in order to estimate its characteristic function
to reasonable precision; this lower bound on sample complexity applies even if the channel inputs and
measurements performed on channel outputs are chosen adaptively. On the other hand, we present a
simple entanglement-assisted scheme that only requires a number of samples independent of n, given
a sufficient amount of squeezing. This establishes an exponential separation in sample complexity.
We then analyze the effect of photon loss and show that the entanglement-assisted scheme is still
significantly more efficient than any lossless entanglement-free scheme under mild experimental
conditions. Our work illuminates the role of entanglement in learning continuous-variable systems and
points toward experimentally feasible demonstrations of provable entanglement-enabled advantage
using CV quantum platforms.

Quantum science and technology holds promise to rev- whether entanglement-enabled advantage can also be real-
olutionize how we understand and interact with nature, ized for learning properties of bosonic continuous-variable
enabling computational speedups [1], classically impossi- (CV) systems. This is particularly interesting and impor-
ble communication tasks [2, 3], and measurements with tant because CV systems are ubiquitous in nature and
unprecedented sensitivity [4–6]. Rapid progress during have many applications in quantum information science,
the noisy intermediate-scale quantum (NISQ) era [7] has such as quantum sensing [6, 8, 25–27]. However, gener-
brought these promises closer to reality, but the challenge alizing the results in DV systems to CV systems can be
remains to demonstrate rigorous quantum advantage for difficult because bosonic systems have infinite-dimensional
practical problems. Hilbert spaces, making it challenging to formulate rigorous
results concerning the complexity of learning properties
Over the past few years, there has been ongoing theo-
of these systems. Recent progress has been achieved in
retical and experimental progress in exploring quantum
studies of entanglement-enhanced learning of CV-state
computational advantage [8–16]. Another recent line of
characteristic functions [28]; however, the lower bounds
research seeks quantum advantage in learning [17–24],
obtained so far apply to a restricted class of learning strate-
revealing that access to quantum memory enables us to
gies rather than to general entanglement-free schemes.
learn properties of nature more efficiently. Specifically,
Refs. [18, 19] establish a framework for proving expo-
In this work, we rigorously establish an entanglement-
nential separation in sample complexity between learn-
enabled advantage in learning a probabilistic mixture of
ing with and without a coherently controllable quantum
n-mode displacement operations, called a bosonic ran-
memory. In contrast to its computational counterpart,
dom displacement channel. Specifically, we show that any
this entanglement-enabled advantage in learning can be
schemes without ancillary quantum memory require a
proven without invoking computational assumptions and
number of samples exponential in n to learn the char-
can sometimes be more experimentally accessible. A proof-
acteristic functions of the channel with reasonably good
of-principle experiment has been conducted on Google’s
precision and high success probability. On the contrary,
superconducting quantum processor using 40 qubits [18].
we present a simple scheme utilizing entanglement with
Most learning tasks studied so far are restricted to ancillary quantum memory (i.e., entanglement-assisted)
discrete-variable (DV) systems. It is natural to ask that can complete the same learning task with a sample
2

complexity independent of n, given access to two-mode (a) (b)

squeezed vacuum (TMSV) states with sufficiently large
squeezing parameter and Bell measurements (BM). This TMSV BM
establishes an exponential separation between learning
with and without entanglement in the bosonic system. 𝜌! {𝐸}

The two learning scenarios are illustrated in Fig. 1. Note

that our hardness results hold for arbitrarily high-energy
input states and arbitrary measurements, while the pre-
sented entanglement-assisted scheme only requires a finite-
energy TMSV and BM. FIG. 1. Schemes for learning an n-mode random displacement
Furthermore, we analyze the robustness of this channel Λ. (a) TMSV+BM, a specific entanglement-assisted
entanglement-enabled advantage under realistic exper- (EA) scheme. (b) General entanglement-free (EF) scheme.
imental conditions. Specifically, we study the photon-loss Here we assume no concatenation is allowed, i.e., each copy
of the channel acts on some input state ρ0 and is measured
effect, the most common noise source in optical platforms.
destructively by some POVM {E}. The input state and
Our results suggest that for squeezing parameters and loss measurement are allowed to be adaptively chosen depending
rates achievable in a state-of-the-art bosonic experiment on previous outcomes. An example of EF scheme is Vac-
platform, the separation in sample complexity remains uum+Heterodyne (see the main text).
significant. Therefore, we anticipate that an experimental
demonstration of entanglement-enabled advantage in CV
quantum systems can be achieved in the near future.
after each channel use. However, the entanglement-free
Problem Setup.— We consider the task of learning scheme is allowed to be adaptive; for each channel use,
an n-mode random displacement channel characterized the input to the channel and the measurement performed
by a probability distribution p(α) with α ∈ Cn , which on the output may depend on measurement outcomes
transforms an input state ρ̂ as obtained in earlier rounds. This scenario is similar to
Z Refs. [17, 22]. Several recent works have obtained lower
Λ(ρ̂) = d2n α p(α)D̂(α)ρ̂D̂† (α), (1) bounds on learning DV channels that hold even with
concatenation [23, 30, 31], but we will not analyze the
consequences of concatenating CV channels in this work
where D̂(α) := ⊗ni=1 D̂(αi ) and D̂(αi ) := exp(αi â†i − αi∗ âi )
for simplicity.
is the displacement operator for the ith mode. The ran-
dom displacement channel can also be equivalently de-
scribed by the characteristic function of p(α), i.e., its Schemes.— Now, we present an entanglement-assisted
Fourier transform, as (see SM S1 [29] for the derivation) scheme (see Fig. 1) inspired by a similar scheme to which
Z has been proposed in DV Pauli channel estimation in
1 Ref. [23]. Consider an n-mode random displacement chan-
Λ(ρ̂) = n d2n β λ(β) Tr[ρ̂D̂(β)]D̂† (β), (2)
π
Z nel ΛB acting on the n-mode system B. To learn this
channel, we prepare n CV Bell states with a finite squeez-
λ(β) := d2n α p(α)eα β−β α . (3)
† †

ing parameter r, which is a two-mode squeezed vacuum

(TMSV) state, and half of the states go through the chan-
Here, because of the Fourier relation, λ(β) with a large β nel while the other half stays in quantum memory. Finally,
contributes to rapidly oscillating p(α). Since the domain we measure the output state by CV Bell measurement
of β is infinite in principle, we will focus on a restricted (BM), which can be implemented by passing through
finite domain specified later. The goal is to learn the a 50:50 beam splitter and performing homodyne mea-
channel by estimating the characteristic function λ(β). surement on output ports along different quadratures [27].
We emphasize that the goal is to characterize the channel, Formally, the BM POVM element labeled by {ζ ∈ Cn } has
as opposed to identifying a particular displacement that the following form: (I ⊗ D̂(ζ))|Ψ⟩⟨Ψ|(I ⊗ D̂† (ζ))/π n ; here
is drawn from the distribution p(α). The value of β for |Ψ⟩ denotes the tensor product of n infinitely squeezed
which λ(β) is to be estimated is revealed only after all P∞
TMSV states, each proportional to |k⟩|k⟩ when
measurements are completed.
k=0
expressed in the Fock basis. To see how to learn a ran-
We focus on the separation between two types of learn- dom displacement channel using this TMSV+BM scheme,
ing schemes for the random displacement channel distin- we invoke the probability of obtaining outcome ζ from
guished by whether or not the scheme uses entanglement BM (see SM S2 A [29] for the derivation):
between the system and an ancilla, as illustrated in Fig. 1.
Throughout this work, we define an entanglement-free Z
scheme to be both ancilla-free and concatenation-free, 1 |α|2 α† ζ−ζ † α
pEA (ζ) = 2n d2n α λ(α)e−e (4)
−2r
e .
i.e., the output of the channel is measured destructively π
3

Fourier transforming to invert this relation, we obtain In particular, if we choose the squeezing parameter as r =
Z Ω(log n), the sample complexity NEA = O(ϵ−2 log δ −1 ) of
2
λ(β) = ee |β| d2n ζ pEA (ζ)eζ β−β ζ the entanglement-assisted scheme becomes independent
−2r † †

(5) of the number of modes n, while our upper bound on sam-

e−2r |β|2
:= e λEA (β). ple complexity of the entanglement-free scheme increases
exponentially with n. Since the accessible squeezing pa-
This expression indicates that, by sampling N measure- rameter is bounded in practice, though, we will compare
ment outcomes {ζ (i) }Ni=1 from a TMSV+BM scheme, one the sample complexities of the two schemes when r is an
can obtain an unbiased estimator λ̃(β) of λ(β) by defin- n-independent constant below.
2 PN
ζ (i)† β−β † ζ (i)
ing λ̃(β) := N1 ee |β| . Note that the
−2r
i=1 e To illustrate the difference, we compare TMSV+BM
same set of samples can be used to estimate λ(β) for dif- and Vacuum+Heterodyne strategies with an example in
ferent values of β just by modifying the estimator. Using Fig. 2. We consider a single-mode channel for ease of
the Hoeffding’s bound, we prove the following theorem visualization, characterized by
(see SM S2 A for the proof):
2σ 2 −2σ2 |α|2
p(α) = e [cos2 (αr γi − αi γr )
Theorem 1. For any n-mode random displacement chan- π
nel Λ, after the TMSV+BM scheme with squeezing param- + sin2 (αr γr + αi γi )], (10)
eter r has learned from N copies of Λ, and then received |β|2 1 |β−γ|2 1 |β+γ|2
a query β ∈ Cn , it can provide an estimator λ̃(β) of Λ’s λ(β) = e− 2σ2 + e− 2σ2 + e− 2σ2
4 4
characteristic function λ(β) such that |λ̃(β) − λ(β)| ≤ ϵ 1 − |β−iγ| 2
1 |β+iγ|2
with probability at least 1 − δ, with the number of samples − e 2σ2 − e− 2σ2 , (11)
2 4 4
N = 8e2e |β| ϵ−2 log 4δ −1 .
−2r

with σ = 0.3, γr = 1.6, γi = 0 (γ := γr + iγi ), and

Let us compare the TMSV+BM scheme with a particu- r = 2 for the TMSV+BM scheme. The figure, where we
lar entanglement-free scheme that uses the vacuum state present the underlying output probability distributions
as input and heterodyne detection (Vacuum+Heterodyne). and their characteristic functions from Eqs. (4),(5),(6),
Here, heterodyne detection is defined as a projection and (7), clearly shows that in the TMSV+BM scheme
onto the (overcomplete) basis of coherent states, i.e., with a sufficiently large squeezing parameter, the resul-
|ζ⟩⟨ζ|/π n with ζ ∈ Cn . Though it may not be the optimal tant probability distribution and characteristic function
entanglement-free scheme, this specific scheme helps us are almost identical to the ideal case. However, for the
understand the limitations of entanglement-free schemes, Vacuum+Heterodyne scheme, the vacuum noise distorts
which we capture more generally in Theorem 2 below. the initial probability distribution so significantly that we
In this scheme, the probability of obtaining POVM cannot see the signal clearly, which thus makes it harder
outcome ζ is (see SM S2 B [29]) to estimate the original characteristic function.
Z
1 2 Lower bound.— Our upper bound on the sample com-
pV H (ζ) = 2n d2n α λ(α)eα ζ−ζ α e−|α| . (6)
† †

π plexity of the Vacuum+Heterodyne scheme scales expo-

nentially with n. Can this scaling be improved using more
In fact, the Vacuum+Heterodyne scheme can be under- advanced entanglement-free schemes, such as homodyne
stood as the TMSV+BM scheme with r = 0. Inverting or general-dyne detection [27, 32], or by non-Gaussian
this relation by Fourier transforming, we may express the resources like GKP states [33] or photon-number resolv-
channel’s characteristic function in terms of the measure- ing measurements [34]? Here, using information-theoretic
ment probability distribution: methods, we prove an exponential sample complexity
Z lower bound for any entanglement-free scheme. This
2 2
λ(β) = e|β| d2n ζ pV H (ζ)eζ β−β ζ := e|β| λV H (β),
† †
highlights the indispensable role of entanglement for effi-
(7) ciently learning bosonic random displacement channels.
Our result is as follows:
which yields another unbiased estimator λ̃(β) := Theorem 2. Let Λ be an arbitrary n-mode random dis-
1 |β|2
PN ζ (i)† β−β † ζ (i)
Ne i=1 e given N samples {ζ (i) }N
i=1 . placement channel (n ≥ 8) and consider an entanglement-
Comparing to (5), we see that the r-dependent prefac- free scheme that uses N copies of Λ. After all mea-
tor is missing from (7). Specifically, if we confine β to surements are completed, the scheme receives the query
|β|2 ≤ κn with a constant κ > 0, we obtain upper bounds β ∈ Cn and returns an estimate λ̃(β) of Λ’s characteristic
on the sample complexity for achieving an error ϵ with function λ(β). Suppose that, with success probability at
success probability 1−δ using each scheme: least 2/3, |λ̃(β) − λ(β)| ≤ ϵ ≤ 0.24 for all β such that
|β|2 ≤ nκ. Then N ≥ 0.01ϵ−2 (1 + 1.98κ)n .
NEA = O(e2e log δ −1 ), (8)
−2r
κn −2
ϵ
2κn −2
Here, the choice of success probability 2/3 is arbi-
NV H = O(e ϵ log δ −1
). (9) trary and can be easily amplified. Comparing with the
4

(a) True distribution probability: (1) Set Λ = Λ0 ; (2) Set Λ = Λγ , for γ sampled
from a zero-mean homogeneous Gaussian distribution
whose variance is determined by κ. Next, Alice allows Bob
to use the channel Λ N times, and Bob uses his favorite
entanglement-free scheme to learn from these channel
uses. After Bob has finished all quantum measurements
and keeps only classical data, Alice reveals some auxiliary
(b) Entanglement-assisted information to Bob, who is then asked to decide whether
Alice has chosen (1) or (2).
Given a learning scheme satisfying the assumptions of
Theorem 2, Bob can guess correctly with high probabil-
ity. This means that the outcome distributions of Bob’s
scheme under hypotheses (1) and (2) must have a suffi-
ciently large total variation distance (TVD). On the other
(c) Vacuum+Heterodyne hand, we can upper bound the contribution from each
use of Λ to the TVD to be exponentially small, where
we use a technique inspired by Ref. [35] which derived
the maximum fidelity of Gaussian random displacement
channels. Therefore, the number of channel uses N must
be exponentially large to ensure a large enough TVD,
which gives us the desired lower bound.

FIG. 2. Comparison between (a) the true distribution, (b) Effect of loss.— Now, for practical applications, we
TMSV+BM, and (c) Vacuum+Heterodyne strategies. The study how the entanglement-assisted scheme is affected by
left panel represents the probability distribution of the true photon loss, a dominant noise source in optical platforms
distribution and measurement probability distributions for (see SM S2 D for a discussion of more general noise models,
each scheme. The right panel represents the characteristic such as phase diffusion). Photon loss
function of probability distributions. √ √ transforms the
relevant bosonic operator â to T â + 1 − T ê, where T
is the transmission rate and ê is the environmental mode,
i.e., 1−T is the loss rate. We consider two different places
entanglement-assisted sample complexity given in Eq. (8),
where the loss occurs: one is before applying the channel
Theorem 2 establishes a separation exponential in n for
with loss rate 1−Tb to model the preparation imperfection,
cutoff coefficient κ = O(1) and squeezing parameter
and the other is after applying the channel and before the
r = Ω(log n). The intuition underlying this theorem is
perfect BM with loss rate 1 − Ta , which models the finite
that displacement operators D̂(β) do not generally com-
efficiency of detection, i.e., an imperfect BM [27]. As
mute with each other. Consequently, entanglement-free
before, we derive the relation between the measurement
measurements can resolve λ(β) for only a small portion
probability distribution and the characteristic function of
of β space. We sketch the proof below and leave the full
the channel (with appropriate rescaling of the phase):
details to SM S3 [29].
Z √
|β|2
Proof Sketch. Our proof extends the techniques of λ(β) = ee d2n ζ ploss (ζ)e(ζ β−β ζ)/ Ta , (14)
−2reff † †

Refs. [18, 23] to the CV case. We begin by defining the

following family of “3-peak” random displacement chan- where we define an effective squeezing parameter
nels Λϵ,σ
39peak = {Λγ }γ∈Cn whose characteristic functions
and distributions of displacement are, respectively, 1 1 − Ta
reff := − log Tb e −2r
+ (1 − Tb ) + , (15)
|β|2 |β−γ|2 |β+γ|2
2 Ta
λγ (β) = e − 2σ2
+ 2iϵ0 e − 2σ2
− 2iϵ0 e − 2σ2
, (12)
which incorporates the loss rates. Because loss degrades
−2σ 2 |α|2
pγ (α) ∝ e (1 + 4ϵ0 sin(2(γi αr − γr αi ))) . (13) the advantage from squeezing, the upper bound on sample
complexity in Theorem 1 is modified in the presence of
with positive parameters σ and ϵ := 0.98ϵ0 . We will show loss (see SM S2 C for the proof):
that, even with the prior knowledge that the channel
is from this family, it is still hard for entanglement-free Theorem 3. For the same task as in Theorem 1, a
schemes to complete the learning tasks. TMVS+BM scheme with squeezing parameter r and trans-
The key idea is to reduce learning to binary hypothesis mission rates before and after the channel to be Tb and
testing. Consider the following game between Alice and Ta , respectively, can estimate any λ(β) to error ϵ with
Bob: Alice chooses one of two hypotheses with equal success probability 1−δ using the number of samples
5

(a) (b)

FIG. 3. (a) Comparison of TMSV+BM (with different loss rates), Vacuum+Heterodyne, and the entanglement-free lower
bound at κ = 1. The task is to estimate any λ(β) such that |β|2 ≤ κn with precision ε = 0.2 and success probability 1 − δ = 2/3.
The orange region represents a rigorous advantage over all entanglement-free schemes. The blue region represents an advantage
over noiseless Vacuum+Heterodyne. (b) Comparison of the TMSV+BM scheme with squeezing parameter r = 1.0 and loss rate
1 − T = 0.1 with the entanglement-free lower bound of Theorem 2. (See SM S3 A for further practical considerations.) The task
is the same as (a). The brown solid contour lines represent the sample complexity of TMSV+BM given by Theorem 3. The blue
dashed contour lines represent the ratio of sample complexity between the entanglement-free lower bound and TMSV+BM,
indicating the entanglement-enabled advantage.

−2reff |β|2
N = 8e2e ϵ−2 log 4δ −1 , where reff is defined ac- entanglement-free protocols may be realized in the near
cording to Eq. (15). future.
Apart from their theoretical interest, random displace-
Thus, when |β|2 ≤ κn with a constant κ > 0, Tb = ment channels can also be practically relevant in, e.g.,
1−O(1/n), Ta = 1−O(1/n) and r = Ω(log n), the sample modeling noise in bosonic systems. As in the qubit
complexity becomes N = O(ϵ−2 log δ −1 ) as in the lossless case [38], we expect that noise tailoring methods can
case. For practically relevant squeezing and including transform more general noise models into random dis-
loss prior to Bell measurement, we compare the sample placement channels; therefore efficiently learning random
complexity for the lossy TMSV+BM protocol and the displacement channels can be useful for benchmarking
lossless entanglement-free lower bound in Fig. 3, finding CV quantum systems [39, 40] and for error mitigation.
a significant entanglement-enabled advantage in realistic Displacement estimation is also studied in quantum
experimental settings. Specifically, for reasonable param- metrology (see e.g. [41–43]). A task often considered in
eter choices such as squeezing parameter r = 1, loss rate metrology is learning an unknown unitary displacement or
10%, and κ = O(1), we can achieve a factor of 104 (108 ) phase transformation acting independently on each mode
advantage for around n = 30 (60) modes. Although the [36, 42, 44–47] whereas the task analyzed in this word is
109 number of samples required to achieve the advantage learning an unknown mixture of multimode displacements.
seems large, the state-of-the-art quantum optics exper- Furthermore, while the goal in metrology is typically to
iments (e.g., Refs. [36, 37]) can attain such number of learn one or a few parameters, in our case, the param-
samples in a reasonable time with high sampling rate up eter space is very large. Therefore, the methodology in
to 160 GHz. the two settings is quite different. Connections between
Discussion.— We proved that schemes that exploit metrology and bosonic channel learning are worthy of
entanglement with an ancillary quantum memory can further exploration.
learn n-mode bosonic random displacement channels with
exponentially fewer samples compared to entanglement- We thank Mankei Tsang, Yuxin Wang, Ronald de
free schemes. Our results show that the information- Wolf, Mingxing Yao, Ming Yuan for insightful discus-
theoretic framework for learning studied in DV quantum sions. C.O., S.C., Y.W., L.J. acknowledge support from
systems [17, 19] can be generalized to the CV setting the ARO(W911NF-23-1-0077), ARO MURI (W911NF-
and have powerful implications. We anticipate that these 21-1-0325), AFOSR MURI (FA9550-19-1-0399, FA9550-
techniques can be applied to other CV learning tasks as 21-1-0209), NSF (OMA-1936118, ERC-1941583, OMA-
well. In addition, our analysis suggests that the separation 2137642), NTT Research, Packard Foundation (2020-
in sample complexity between entanglement-assisted and 71479). J.P. acknowledges support from the U.S. De-
6

partment of Energy Office of Science, Office of Advanced et al., Quantum computational advantage using photons,
Scientific Computing Research (DE-NA0003525, DE- Science 370, 1460 (2020).
SC0020290), the U.S. Department of Energy, Office of [13] H.-S. Zhong, Y.-H. Deng, J. Qin, H. Wang, M.-C. Chen,
Science, National Quantum Information Science Research L.-C. Peng, Y.-H. Luo, D. Wu, S.-Q. Gong, H. Su, et al.,
Phase-programmable Gaussian boson sampling using stim-
Centers, Quantum Systems Accelerator, and the National ulated squeezed light, Physical review letters 127, 180502
Science Foundation (PHY-1733907). The Institute for (2021).
Quantum Information and Matter is an NSF Physics [14] L. S. Madsen, F. Laudenbach, M. F. Askarani, F. Rortais,
Frontiers Center. S.Z. acknowledges funding provided T. Vincent, J. F. Bulmer, F. M. Miatto, L. Neuhaus, L. G.
by the Institute for Quantum Information and Matter Helt, M. J. Collins, et al., Quantum computational ad-
and Perimeter Institute for Theoretical Physics, a re- vantage with a programmable photonic processor, Nature
search institute supported in part by the Government of 606, 75 (2022).
[15] A. Morvan, B. Villalonga, X. Mi, S. Mandra, A. Bengtsson,
Canada through the Department of Innovation, Science P. Klimov, Z. Chen, S. Hong, C. Erickson, I. Drozdov,
and Economic Development Canada and by the Province et al., Phase transition in random circuit sampling, arXiv
of Ontario through the Ministry of Colleges and Uni- preprint arXiv:2304.11119 (2023).
versities. J.A.H.N, Z.L., J.S.N and U.L.A acknowledge [16] Y.-H. Deng, Y.-C. Gu, H.-L. Liu, S.-Q. Gong, H. Su, Z.-J.
support from DNRF (bigQ, DNRF142), IFD (photoQ) Zhang, H.-Y. Tang, M.-H. Jia, J.-M. Xu, M.-C. Chen,
and EU (CLUSTEC, ClusterQ ERC-101055224, GTGBS J. Qin, L.-C. Peng, J. Yan, Y. Hu, J. Huang, H. Li, Y. Li,
MC-101106833). Y. Chen, X. Jiang, L. Gan, G. Yang, L. You, L. Li, H.-S.
Zhong, H. Wang, N.-L. Liu, J. J. Renema, C.-Y. Lu, and
J.-W. Pan, Gaussian boson sampling with pseudo-photon-
number-resolving detectors and quantum computational
advantage, Phys. Rev. Lett. 131, 150601 (2023).
∗
[17] H.-Y. Huang, R. Kueng, and J. Preskill, Information-
These authors contributed equally to this work: theoretic bounds on quantum advantage in machine learn-
C.O. ([email protected]); S.C. (csen- ing, Physical Review Letters 126, 190505 (2021).
[email protected]). [18] H.-Y. Huang, M. Broughton, J. Cotler, S. Chen, J. Li,
†
[email protected] M. Mohseni, H. Neven, R. Babbush, R. Kueng, J. Preskill,
‡
[email protected] and J. R. McClean, Quantum advantage in learning from
[1] M. A. Nielsen and I. Chuang, Quantum computation and experiments, Science 376, 1182 (2022).
quantum information (2002). [19] S. Chen, J. Cotler, H.-Y. Huang, and J. Li, Exponential
[2] N. Gisin and R. Thew, Quantum communication, Nature separations between learning with and without quantum
photonics 1, 165 (2007). memory, in 2021 IEEE 62nd Annual Symposium on Foun-
[3] H. J. Kimble, The quantum internet, Nature 453, 1023 dations of Computer Science (FOCS) (IEEE, 2022) pp.
(2008). 574–585.
[4] V. Giovannetti, S. Lloyd, and L. Maccone, Quantum [20] M. C. Caro, Learning quantum processes and hamil-
metrology, Physical review letters 96, 010401 (2006). tonians via the pauli transfer matrix, arXiv preprint
[5] V. Giovannetti, S. Lloyd, and L. Maccone, Advances in arXiv:2212.04471 (2022).
quantum metrology, Nature photonics 5, 222 (2011). [21] S. Bubeck, S. Chen, and J. Li, Entanglement is necessary
[6] E. Polino, M. Valeri, N. Spagnolo, and F. Sciarrino, for optimal quantum property testing, in 2020 IEEE 61st
Photonic quantum metrology, AVS Quantum Science 2, Annual Symposium on Foundations of Computer Science
024703 (2020). (FOCS) (IEEE, 2020) pp. 692–703.
[7] J. Preskill, Quantum computing in the NISQ era and [22] D. Aharonov, J. Cotler, and X.-L. Qi, Quantum algorith-
beyond, Quantum 2, 79 (2018). mic measurement, Nature communications 13, 1 (2022).
[8] S. Aaronson and A. Arkhipov, The computational com- [23] S. Chen, S. Zhou, A. Seif, and L. Jiang, Quantum ad-
plexity of linear optics, in Proceedings of the forty-third vantages for pauli channel estimation, Phys. Rev. A 105,
annual ACM symposium on Theory of computing (2011) 032435 (2022).
pp. 333–342. [24] Z. M. Rossi, J. Yu, I. L. Chuang, and S. Sugiura, Quan-
[9] S. Boixo, S. V. Isakov, V. N. Smelyanskiy, R. Babbush, tum advantage for noisy channel discrimination, Physical
N. Ding, Z. Jiang, M. J. Bremner, J. M. Martinis, and Review A 105, 032401 (2022).
H. Neven, Characterizing quantum supremacy in near- [25] S. L. Braunstein and P. Van Loock, Quantum information
term devices, Nature Physics 14, 595 (2018). with continuous variables, Reviews of modern physics 77,
[10] F. Arute, K. Arya, R. Babbush, D. Bacon, J. C. Bardin, 513 (2005).
R. Barends, R. Biswas, S. Boixo, F. G. Brandao, D. A. [26] C. Weedbrook, S. Pirandola, R. García-Patrón, N. J.
Buell, et al., Quantum supremacy using a programmable Cerf, T. C. Ralph, J. H. Shapiro, and S. Lloyd, Gaussian
superconducting processor, Nature 574, 505 (2019). quantum information, Reviews of Modern Physics 84, 621
[11] Y. Wu, W.-S. Bao, S. Cao, F. Chen, M.-C. Chen, X. Chen, (2012).
T.-H. Chung, H. Deng, Y. Du, D. Fan, et al., Strong quan- [27] A. Serafini, Quantum continuous variables: a primer of
tum computational advantage using a superconducting theoretical methods (CRC press, 2017).
quantum processor, Physical review letters 127, 180501 [28] Y.-D. Wu, G. Chiribella, and N. Liu, Quantum-enhanced
(2021). learning of continuous-variable quantum states, arXiv
[12] H.-S. Zhong, H. Wang, Y.-H. Deng, M.-C. Chen, L.- preprint arXiv:2303.05097 (2023).
C. Peng, Y.-H. Luo, J. Qin, D. Wu, X. Ding, Y. Hu, [29] Supplemental material.
7

[30] S. Chen and W. Gong, Futility and utility of a quantum computation via randomized compiling, Physical
few ancillas for pauli channel learning, arXiv preprint Review A 94, 052325 (2016).
arXiv:2309.14326 (2023). [39] Y.-D. Wu and B. C. Sanders, Efficient verification of
[31] S. Chen, C. Oh, S. Zhou, H.-Y. Huang, and L. Jiang, Tight bosonic quantum channels via benchmarking, New Jour-
bounds on pauli channel learning without entanglement, nal of Physics 21, 073026 (2019).
arXiv preprint arXiv:2309.13461 (2023). [40] G. Bai and G. Chiribella, Test one to test many: a uni-
[32] W. P. Schleich, Quantum optics in phase space (John fied approach to quantum benchmarks, Physical Review
Wiley & Sons, 2011). Letters 120, 150502 (2018).
[33] D. Gottesman, A. Kitaev, and J. Preskill, Encoding a [41] H. Shi and Q. Zhuang, Ultimate precision limit of noise
qubit in an oscillator, Physical Review A 64, 012310 sensing and dark matter search, npj Quantum Information
(2001). 9, 27 (2023).
[34] D. Schuster, A. A. Houck, J. Schreier, A. Wallraff, J. Gam- [42] Q. Zhuang, Z. Zhang, and J. H. Shapiro, Distributed
betta, A. Blais, L. Frunzio, J. Majer, B. Johnson, M. De- quantum sensing using continuous-variable multipartite
voret, et al., Resolving photon number states in a super- entanglement, Physical Review A 97, 032329 (2018).
conducting circuit, Nature 445, 515 (2007). [43] Y. Xia, W. Li, W. Clark, D. Hart, Q. Zhuang, and
[35] C. M. Caves and K. Wódkiewicz, Fidelity of gaussian Z. Zhang, Demonstration of a reconfigurable entangled
channels, Open Systems & Information Dynamics 11, 309 radio-frequency photonic sensor network, Physical Review
(2004). Letters 124, 150502 (2020).
[36] X. Guo, C. R. Breum, J. Borregaard, S. Izumi, M. V. [44] K. Duivenvoorden, B. M. Terhal, and D. Weigand, Single-
Larsen, T. Gehring, M. Christandl, J. S. Neergaard- mode displacement sensor, Physical Review A 95, 012305
Nielsen, and U. L. Andersen, Distributed quantum sens- (2017).
ing in a continuous-variable entangled network, Nature [45] C. Oh, C. Lee, S. H. Lie, and H. Jeong, Optimal dis-
Physics 16, 281 (2020). tributed quantum sensing using gaussian states, Physical
[37] A. Inoue, T. Kashiwazaki, T. Yamashima, N. Takanashi, Review Research 2, 023030 (2020).
T. Kazama, K. Enbutsu, K. Watanabe, T. Umeki, [46] C. Oh, L. Jiang, and C. Lee, Distributed quantum phase
M. Endo, and A. Furusawa, Toward a multi-core ultra-fast sensing for arbitrary positive and negative weights, Phys-
optical quantum processor: 43-ghz bandwidth real-time ical Review Research 4, 023164 (2022).
amplitude measurement of 5-db squeezed light using mod- [47] H. Kwon, Y. Lim, L. Jiang, H. Jeong, and C. Oh, Quan-
ularized optical parametric amplifier with 5g technology, tum metrological power of continuous-variable quantum
Applied Physics Letters 122, 104001 (2023). networks, Physical Review Letters 128, 180503 (2022).
[38] J. J. Wallman and J. Emerson, Noise tailoring for scalable
Entanglement-enabled advantage for learning a bosonic random displacement
channel: Supplemental Material

Changhun Oh,1, ∗ Senrui Chen,1, ∗ Yat Wong,1 Sisi Zhou,2, 3, 4

Hsin-Yuan Huang,3, 5, 6 Jens A.H. Nielsen,7 Zheng-Hao Liu,7 Jonas S.
Neergaard-Nielsen,7 Ulrik L. Andersen,7 Liang Jiang,1, † and John Preskill3, ‡
1
Pritzker School of Molecular Engineering,
The University of Chicago, Chicago, Illinois 60637, USA
2
Perimeter Institute for Theoretical Physics, Waterloo, Ontario N2L 2Y5, Canada
3
Institute for Quantum Information and Matter,
California Institute of Technology, Pasadena, CA 91125, USA
arXiv:2402.18809v1 [quant-ph] 29 Feb 2024

4
Department of Physics and Astronomy and Institute for Quantum Computing,
University of Waterloo, Ontario N2L 2Y5, Canada
5
Center for Theoretical Physics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
6
Google Quantum AI, Venice, CA, USA
7
Center for Macroscopic Quantum States (bigQ),
Department of Physics, Technical University of Denmark,
Building 307, Fysikvej, 2800 Kgs. Lyngby, Denmark
(Dated: March 1, 2024)

CONTENTS

S1. Preliminary 1
A. Fourier relation 2

S2. Derivation of output probability distributions of each scheme 3

A. Entanglement-assisted (TMSV+BM) schemes 3
B. Entanglement-free (Vacuum+Heterodyne) schemes 6
C. Entanglement-assisted scheme with imperfection 7
D. Discussion on more general input states 10

S3. Fundamental limits for entanglement-free schemes 12

A. Lower bound for entanglement-free schemes 13
B. Proof of Lemma S1 18
C. Lower bound for entanglement-free Gaussian schemes 21

S4. Gaussian tail effect 25

References 26

S1. PRELIMINARY

In this section, we provide some identities that are frequently used in Supplemental Material
(more details can be found in Refs. [1–3]). First, an elementary operator in n-mode bosonic
system is n-mode displacement operator D̂(β) := eβâ −β â , where β := (β1 , . . . , βn )T ∈ Cn , â :=
† †

∗
These authors contributed equally to this work: C.O. ([email protected]); S.C. ([email protected]).
†
[email protected]
‡
[email protected]
2

(â1 , . . . , ân )T and â† := (â†1 , . . . , â†n )T are annihilation and creation operator of bosons, which follow
the commutation relation [âi , â†j ] = δij . Displacement operator D̂(β) forms an orthogonal basis in
the operator space; thus, any operator Ô can be expanded by displacement operators as
Z
1 h i
Ô = d2n β Tr ÔD̂(β) D̂† (β), (S1)
πn

where Tr[ÔD̂(β)] is called the characteristic function of an operator Ô. The n-mode displacement
operator has the following properties
h i
D̂† (β) = D̂(−β), D̂∗ (β) = D̂(β ∗ ), D̂T (β) = D̂(−β ∗ ), Tr D̂(β) = π n δ (2n) (β), (S2)
(β2† β1 −β1† β2 )/2
D̂(β1 )D̂(β2 ) = D̂(β1 + β2 )e D̂(α)D̂† (β)D̂† (α) = D̂† (β)eα (S3)
† β−β † α
, ,
Z
d2n β
D̂(β)OD̂† (β) = Tr[O]1, (S4)
πn
where the last identity is the twirling identity. We also frequently use the following identity:
Z
1
δ (2n) (α) = d2n βeβ (S5)
† α−α† β
.
π 2n

Also, we employ the Wigner function of an operator Ô defined as

Z
1 h i
WO (α) = d2n β Tr ÔD̂(β) eβ (S6)
† α−α† β
.
π 2n
In Sec. S1 A, we show that for a given random displacement channel, which is defined by the
probability distribution p(α), we can rewrite it by the Fourier transformation λ(β) of p(α), i.e., its
characteristic function, as
Z Z
1 h i
Λ(ρ̂) := d2n αp(α)D̂(α)ρ̂D̂† (α) = d2n βλ(β) Tr ρ̂D̂(β) D̂† (β), (S7)
πn
where the probability p(α) and the characteristic function λ(β) follow the relation
Z Z
2n 1
λ(β) = α† β−β † α
, p(α) = 2n d2n β λ(β)eβ (S8)
† α−α† β
d α p(α)e .
π

A. Fourier relation

Here, we derive the expression of a random displacement channel characterized by a probability

distribution p(α) by its characteristic function λ(β). To see the relation, we use the identity (S1).
Applying this for the density operator ρ̂, we can show that
Z Z Z
2n 2n 1 2n
h i
Λ(ρ̂) = d αp(α)D̂(α)ρ̂D̂ (α) = †
d αp(α)D̂(α) n d β Tr ρ̂D̂(β) D̂ (β) D̂† (α)
†
(S9)
π
Z Z
1 2n 2n
h i
= d α d βp(α) Tr ρ̂D̂(β) D̂(α)D̂† (β)D̂† (α) (S10)
πn Z Z
1 h i
= n d α d2n βp(α) Tr ρ̂D̂(β) D̂† (β)eα β−β α
2n
(S11)
† †

π Z
1 h i
= n d2n βλ(β) Tr ρ̂D̂(β) D̂† (β), (S12)
π
3

where we used the identities from Eqs. (S3),(S5). Here, the last line renders the identity
Z
λ(β) = d2n αp(α)eα (S13)
† β−β † α
.

Thus, λ(β) is the Fourier transform of p(α). Its inverse Fourier transformation gives
Z
1
p(α) = 2n d2n βλ(β)eβ (S14)
† α−α† β
.
π

S2. DERIVATION OF OUTPUT PROBABILITY DISTRIBUTIONS OF EACH SCHEME

A. Entanglement-assisted (TMSV+BM) schemes

In this section, we consider the two-mode squeezed vacuum and Bell measurement (TMSV+BM)
strategy (see Fig. 1(a) in the main text.) and prove Theorem 1 in the main text by deriving the
sample complexity of the strategy. We assume a lossless system and a perfect Bell measurement,
whereas the input squeezed state has a finite squeezing parameter r since the input squeezing
parameter r is typically upper-bounded by a constant in practice. We then analyze the effect of the
imperfect measurement and loss in Sec. S2 C.
We now derive the probability of outcomes of the strategy, i.e., the outcomes obtained by
applying an n-mode channel Λ onto a product state of a subsystem of n TMSV states |Ψ̃⟩ and
measuring in the Bell basis |Ψ(ζ)⟩⟨Ψ(ζ)|/π n with ζ ∈ Cn :
1 h i
pEA (ζ) = Tr (|Ψ(ζ)⟩⟨Ψ(ζ)|AB )(IA ⊗ Λ B )(|Ψ̃⟩⟨Ψ̃|AB ) . (S15)
πn
To simplify the expression, we rewrite a TMSV state. To do that, let us first consider a single-mode
squeezed state |r⟩ := Ŝ(r)|0⟩:
Z Z
1 h i 1
|r⟩⟨r| = d2 αD̂† (α) Tr D̂(α)|r⟩⟨r| := d2 αD̂† (α)f (α, r), (S16)
π π
h i
where Ŝ(r) := exp r(â†2 − â2 )/2 is the squeezing operation and we have defined
h i
f (α, r) := Tr D̂(α)|r⟩⟨r| = ⟨0|Ŝ † (r)D̂(α)Ŝ(r)|0⟩ = ⟨0|D̂(α cosh r − α∗ sinh r)|0⟩ (S17)

1
= exp − |α cosh r − α∗ sinh r|2 . (S18)
2

Here, we have used the relation, Ŝ † (r)âŜ(r) = â cosh r + â† sinh r. Using the fact that a TMSV
state can be generated by injecting two single-mode squeezed states into the 50:50 beam splitter,
i.e., ÛBS (|r⟩⟨r| ⊗ | − r⟩⟨−r|)ÛBS
†
= |Ψ̃(r)⟩⟨Ψ̃(r)|, where ÛBS is 50:50 beam splitter, we can rewrite
the TMSV state |Ψ̃(r)⟩ as
Z
1
|Ψ̃(r)⟩⟨Ψ̃(r)| = d2 α1 d2 α2 ÛBS [D̂† (α1 ) ⊗ D̂† (α2 )]ÛBS
†
f (α1 , r)f (α2 , −r) (S19)
π2
Z
1 α1 − α2 α1 + α2
= 2 d2 α1 d2 α2 D̂† √ ⊗ D̂†
√ f (α1 , r)f (α2 , −r) (S20)
π 2 2
Z
1 ω1 + ω2 ω2 − ω1
= 2 d2 ω1 d2 ω2 D̂† (ω1 ) ⊗ D̂† (ω2 )f √ ,r f √ , −r (S21)
π 2 2
Z
1
:= 2 d2 ω1 d2 ω2 D̂† (ω1 ) ⊗ D̂† (ω2 )g(w1 , w2 , r), (S22)
π
4

where we defined

ω1 + ω2 ω2 − ω1
g(ω1 , ω2 , r) := f √ ,r f √ , −r (S23)
2 2
i
1h 2 2
= exp − (|ω1 | + |ω2 | ) cosh 2r − (ω1 ω2 + ω1 ω2 ) sinh 2r ,
∗ ∗
(S24)
2
which is the characteristic function of the TMSV state |Ψ̃(r)⟩ by Eq. (S1), i.e.,
h i
g(ω1 , ω2 , r) := Tr |Ψ̃(r)⟩⟨Ψ̃(r)|(D̂(ω1 ) ⊗ D̂(ω2 )) . (S25)

Multiple TMSV states are straightforward to generalize:

1h i
g(ω1 , ω2 , r) := exp − (|ω1 |2 + |ω2 |2 ) cosh 2r − (ω1 · ω2 + ω1∗ · ω2∗ ) sinh 2r , (S26)
2
where ω1,2 are now n-dimensional complex vectors. Especially when ω2 = ω1∗ , it reduces to

g(ω1 , ω1∗ , r) = exp −e−2r |ω1 |2 . (S27)

Therefore, the input TMSV states can be written as

Z
1
|Ψ̃(r)⟩⟨Ψ̃(r)| = 2n d2n ω1 d2n ω2 D̂† (ω1 ) ⊗ D̂† (ω2 )g(w1 , w2 , r). (S28)
π
Similarly, by generalizing an entangled state with a single-mode ancilla to an entangled state
with n modes and infinite squeezing, the measurement POVM of CV Bell measurement can be
written as [2, 3]
Z
1 1 d2n ω ζ ∗ ·ω∗ −ζ·ω †
|Ψ(ζ)⟩⟨Ψ(ζ)| = n (I ⊗ D̂(ζ))|Ψ⟩⟨Ψ|(I ⊗ D̂† (ζ)) = e D̂ (ω) ⊗ D̂T (ω), (S29)
πn π π 2n
where
Z
d2n ω †
|Ψ⟩⟨Ψ| := D̂ (ω) ⊗ D̂T (ω) (S30)
πn
corresponds to a multimode generalization of a TMSV state with infinite squeezing parameter, i.e.,
P
|Ψ⟩ ∝ ∞ k=0 |k⟩|k⟩. Here, the normalization factor 1/π is introduced to ensure the completeness
n
Z
1
d2n ζ|Ψ(ζ)⟩⟨Ψ(ζ)| = 1 ⊗ 1. (S31)
πn
We now simplify the expression of the output probability distribution of the scheme:
pEA (ζ)
1 h i
= n Tr (|Ψ(ζ)⟩⟨Ψ(ζ)|AB )(IA ⊗ ΛB )(|Ψ̃⟩⟨Ψ̃|AB ) (S32)
π
Z 2n
1 d ω d2n β1 d2n β2 T
= n Tr g(β1 , β2 , r)e D̂ (ω)A ⊗ D̂ (ω)B (IA ⊗ ΛB )(D̂ (β1 )A ⊗ D̂ (β2 )B ) .
ζ ∗ ·ω ∗ −ζ·ω † † †
π πn π 2n
(S33)
Here, we have
Z
1 h i
(IA ⊗ ΛB )(D̂† (β1 )A ⊗ D̂† (β2 )B ) = D̂† (β1 )A ⊗ n
d2n ωλ(ω) Tr D̂† (β2 )B D̂(ω)B D̂† (ω)B (S34)
Zπ
= D̂† (β1 )A ⊗ d2n ωλ(ω)δ(ω − β2 )D̂† (ω)B (S35)

= λ(β2 )D̂† (β1 )A ⊗ D̂† (β2 )B . (S36)

Thus, we can simplify Eq. (S32) as

"Z #
1 d2n ω d2n β1 d2n β2
Tr g(β1 , β2 , r)eζ ·ω −ζ·ω D̂† (ω)A ⊗ D̂T (ω)B (IA ⊗ ΛB )(D̂† (β1 )A ⊗ D̂† (β2 )B )
∗ ∗

πn π n π 2n

(S37)
Z
1 d2n ω d2n β1 d2n β2 h i h
T
i
= λ(β2 )g(β 1 , β2 , r)eζ ∗ ·ω ∗ −ζ·ω
Tr D̂ †
(ω) D̂ †
(β 1 ) Tr D̂ (ω) D̂ †
(β 2 )
π 2n
A A B B
πn πn
(S38)
Z
1
= 2n d2n ωd2n β1 d2n β2 λ(β2 )g(β1 , β2 , r)eζ ·ω −ζ·ω δ(ω + β1 )δ(ω ∗ + β2 ) (S39)
∗ ∗

π Z
1
= 2n d2n ωλ(−ω ∗ )g(−ω, −ω ∗ , r)eζ ·ω −ζ·ω . (S40)
∗ ∗

π
Hence, we finally obtain the probability of obtaining ζ from Bell measurement with an initial Bell
state with finite squeezing:
Z Z
1 1 2
pEA (ζ) = 2n d2n ωλ(−ω ∗ )g(−ω, −ω ∗ , r)eζ ·ω −ζ·ω = 2n d2n ωλ(−ω ∗ )e−e |ω| eζ ·ω −ζ·ω .
∗ ∗ −2r ∗ ∗

π π
(S41)
By inverting the relation using Fourier transformation,
Z Z
1
d2n ζpEA (ζ)eζ β−β ζ = 2n d2n ωλ(−ω ∗ )g(−ω, −ω ∗ , r)e(ω +β)·ζ −(ω +β) ·ζ (S42)
† † ∗ ∗ ∗ ∗

π
= λ(β)g(β ∗ , β, r), (S43)
we obtain the relation between the characteristic function of the channel λ(β) and the probability
distribution of outcomes pEA (ζ):
Z Z
1 2n

2
λ(β) = (ζ)eζ † β−β † ζ
= exp d2n ζpEA (ζ)eζ β−β ζ . (S44)
−2r † †
d ζpEA e |β|
g(β ∗ , β, r)
The expression shows that by obtaining samples ζ’s following the probability distribution using
sampling in experiment and taking Fourier transformation, one can obtain the estimate of λ(β).
2 2
Now, we show the number of samples N ≥ ϵ82 log 4δ e2e |β| = O(e2e |β| ϵ−2 log δ −1 ) suffices to
−2r −2r

have a good precision ϵ with a high probability 1 − δ. As observed above, for

N number of
1 PN −2r |β|2 eζ (i)† β−β † ζ (i) for
samples, {ζ (i) }N
i=1 , we set the estimator of λ(β) to be λ̃(β) = N i=1 exp e
measurement outcome ζ and apply the Hoeffding bound for the estimator. To do that, we find
the bound for real part and imaginary part, respectively and combine them. We first obtain two
different probabilities’ bound by the Hoeffding bound such that
N ϵ2 −2e−2r |β|2 N ϵ2 −2e−2r |β|2
Pr[|λ̃r (β) − λr (β)| ≤ ϵ/2] ≥ 1 − 2e− 8
e
, Pr[|λ̃i (β) − λi (β)| ≤ ϵ/2] ≥ 1 − 2e− 8
e
,
(S45)
where λr = Re(λ) and λi = Im(λ) and similar for λ̃. Applying the union bound and the triangle
inequality, we obtain
N ϵ2 −2e−2r |β|2
Pr[|λ̃(β) − λ(β)| ≤ ϵ] ≥ 1 − 4e− 8
e
. (S46)
Thus, if we choose the number of samples to be
8 4 −2r 2 2
N ≥ 2 log e2e |β| = O(e2e |β| ϵ−2 log δ −1 ) (S47)
−2r

ϵ δ
the estimation error is upper-bounded by ϵ with high probability 1 − δ. This completes the proof of
Theorem 1 in the main text.
Note that in an ideal case where the input squeezing parameter r can be chosen to be arbitrarily
large, the sample complexity can be reduced to N = O(1/ϵ2 ) for any β.
6

B. Entanglement-free (Vacuum+Heterodyne) schemes

Now, let us consider the entanglement-free scheme with vacuum input and heterodyne detection
(Vacuum+Heterodyne). In general, denoting Πϕ as a POVM with an outcome ϕ and |ϕ0 ⟩⟨ϕ0 | as an
input state, the probability of a classical scheme is written as

pEF (ϕ) = Tr[Πϕ Λ(|ϕ0 ⟩⟨ϕ0 |)] (S48)

Z
1 h i h i h i
= 2n d2n αd2n β Tr D̂(α)Λ(D̂† (β)) Tr Πϕ D̂† (α) Tr |ϕ0 ⟩⟨ϕ0 |D̂(β) (S49)
π Z
1 h i h i
= n d2n αd2n βλ(β)δ(α − β) Tr Πϕ D̂† (α) Tr |ϕ0 ⟩⟨ϕ0 |D̂(β) (S50)
π Z
1 h i h i
= n d2n βλ(β) Tr Πϕ D̂† (β) Tr |ϕ0 ⟩⟨ϕ0 |D̂(β) . (S51)
π

For the Vacuum+Heterodyne scheme, we employ vacuum state input |ϕ0 ⟩ = |0⟩ and heterodyne
detection, whose POVM elements are described by projectors onto the (overcomplete) basis of
coherent states, |ζ⟩⟨ζ|/π n , where |ζ⟩ is a coherent state with complex amplitude ζ ∈ Cn . Note
that such a scheme is informationally complete in the sense that it provides distinct probability
distributions for different channels. For this scheme, we can obtain the probability distribution
Z Z
1 h i h i 1 2
pV H (ζ) = d2n αλ(α) Tr |ζ⟩⟨ζ|D̂† (α) Tr |0⟩⟨0|D̂(α) = d2n αλ(α)eα
† ζ−ζ † α
e−|α| .
π 2n π 2n
(S52)

Again, by inverting the probability distribution as

Z Z
2n 1 2 2
d2n ζd2n αλ(α)eζ
† (β−α)−(β−α)† ζ
d ζpV H (ζ)e ζ † β−β † ζ
= 2n e−|α| = λ(β)e−|β| , (S53)
π

we obtain the final relation between the measurement probability distribution and the characteristic
function of the channel:
Z
2
λ(β) = e|β| d2n ξpV H (ζ)eζ (S54)
† β−β † ζ
.

2
It clearly shows the difference from the quantum strategies, which is the prefactor e|β| . Thus,
2
similarly, after sampling ζ from experiments following pEF (ζ) and averaging e|β| eζ β−β ζ over the
† †

samples, we obtain the estimate of λ(β). As for the entanglement-assisted case, for N samples,
1 PN |β|2 eζ (i)† β−β † ζ (i) and using the Hoeffding bound,
{ζ (i) }N
i=1 , by setting the estimator λ̃(β) = N i=1 e
we obtain

N ϵ2 −2|β|2
Pr[|λ̃(β) − λ(β)| ≤ ϵ] ≥ 1 − 4e− 8
e
. (S55)

Thus, in this case, it indicates that the sufficient number of samples is

8 4 2 2
N≥ 2
log e2|β| = O(e2|β| ϵ−2 log δ −1 ) (S56)
ϵ δ

for the estimation error to be upper-bounded by ϵ with high probability 1 − δ. It clearly shows the
significant difference of the sample complexity from the entanglement-assisted case.
7

C. Entanglement-assisted scheme with imperfection

We now consider the effect of imperfections and prove Theorem 3 in the main text. More
specifically, we consider the cases where photon loss occurs before and after applying the random
displacement channel we want to learn and a regularized Bell measurement. Here, the photon loss
before and after the random displacement channel models an imperfect input state preparation
and imperfect Bell measurement. On the other hand, we introduce parameter s to regularize the
Bell measurement POVM by a general-dyne measurement POVM, where we recover the perfect
Bell measurement by taking s → ∞. By considering the regularized Bell measurement POVM,
we assume the same condition as the lower bound in Sec. S3 in that the measurement POVM is
normalizable, i.e., its norm is finite.
Let us first consider the effect of the loss channel on a single-mode displacement operator.
Using the equivalent description of a loss channel L by a beam splitter interaction with a vacuum
environment, we can show that
L[D̂† (α)S ] = TrE [UT (D̂† (α)S ⊗ |0⟩⟨0|E )UT† ] (S57)
Z
d2 z − 1 |z|2
= e 2 TrE [UT D̂† (α)S ⊗ D̂† (z)E UT† ] (S58)
πn
Z 2 √ √ √ √
d z − 1 |z|2
= e 2 TrE [ D̂ †
( T α − 1 − T z)S ⊗ D̂ †
( T z + 1 − T α)E ] (S59)
πn
1−T 2 √
= T −1 e− 2T |α| D̂† (α/ T )S , (S60)

where ÛT is the beam splitter interaction with the environment with the transmission rate T , and
thus 1 − T is the loss rate. Here, we used the identity [1]
Z
d2n z − 1 |z|2 †
|0⟩⟨0| = e 2 D̂ (z). (S61)
πn
Recall that the input state is two-mode squeezed states with a finite squeezing parameter, which
is written as
Z
1
|Ψ̃(r)⟩⟨Ψ̃(r)| = 2n d2n ω1 d2n ω2 g(w1 , w2 , r)D̂† (ω1 ) ⊗ D̂† (ω2 ). (S62)
π
Let us first study how the displacement operator transforms over the channels.
First, after a loss channel with loss rate 1 − Tb , the random displacement channel Λ and another
loss channel with loss rate 1 − Ta , an n-mode displacement operator transforms as
LTAB
a
(IA ⊗ ΛB )LTAB
b
(D̂† (ω1 )A ⊗ D̂† (ω2 )B ) (S63)
1−T p p
− 2T b (|ω1 |2 +|ω2 |2 )
= Tb−2n e b LTAB
a
(IA ⊗ ΛB )(D̂† (ω1 / Tb )A ⊗ D̂† (ω2 / Tb )B ) (S64)
1−T p p p
− 2T b (|ω1 |2 +|ω2 |2 )
= Tb−2n e b λ(ω2 / Tb )LAB
Ta
(D̂† (ω1 / Tb )A ⊗ D̂† (ω2 / Tb )B ) (S65)
1−T 1−Ta p p p
− 2T b (|ω1 |2 +|ω2 |2 ) − 2T (|ω1 |2 +|ω2 |2 )
= (Tb Ta )−2n e b e a Tb λ(ω2 / Tb )(D̂† (ω1 / Tb Ta )A ⊗ D̂† (ω2 / Tb Ta )B ).
(S66)
Now we implement the regularized Bell measurement. Recall that the perfect Bell measurement can
be conducted by applying a 50:50 beam splitter and then performing homodyne detection. Here, we
will regularize the homodyne detection by general-dyne detection and tracing out some quadratures.
After 50:50 beam splitters ÛBS , the displacement operators transform as

p p ω − ω2 ω + ω2
D̂ (ω1 / Tb Ta )A ⊗ D̂ (ω2 / Tb Ta )B → D̂
† † †
√1 ⊗ D̂ †
√1 . (S67)
2Tb Ta A 2Tb Ta B
8

We then perform measurements described by the following POVM:

A 1
Π̂α (−s) ⊗ Π̂B
β (s) α,β where Π̂γ (s) := D̂(γ) |s⟩⟨s| D̂† (γ), (S68)
πn

where s ≥ 0 is the squeezing parameter for the Bell measurement. We note that this measurement
corresponds to a special type of general-dyne measurement [3] and that when s → ∞, we recover
the Bell measurement studied in Sec. S2 A. Then, the output probability is written as
Z
1 h i
q(α, β) := 2n d2n ωg(ω1 , ω2 , r) Tr Π̂A
α (−s) ⊗ Π̂β (s)ÛBS LAB (IA ⊗ ΛB )LAB (D̂ (ω1 )A ⊗ D̂ (ω2 )B ) ÛBS .
B Ta Tb † † †
π
(S69)

Here, we have
h 1 i 1 1 2 2
Tr Π̂α (−s)D̂† (ω) = ⟨−s|D̂† (α)D̂† (ω)D̂(α)| − s⟩ = n e− 2 (|ω| cosh 2s+(ω +ω ) sinh 2s/2) eω α−α ω ,
∗2 † †

π n π
(S70)
h i 1 1 2 2
Tr Π̂β (s)D̂† (ω) = n e− 2 (|ω| cosh 2s−(ω +ω ) sinh 2s/2) eω β−β ω , (S71)
∗2 † †

and

ω − ω2 ω + ω2
Tr Π̂α (−s)D̂ †
√1 Tr Π̂β (s)D̂ †
√1 (S72)
2Ta Tb 2Ta Tb
h i
1 − 1 (|ω1 |2 +|ω2 |2 ) cosh 2s
−(ω1 ·ω2 +ω1∗ ·ω2∗ ) sinh 2s
α+β β−α
= 2n e 2 Ta Tb Ta Tb
exp ω1∗ √ + ω2∗ √ − c.c. (S73)
π 2Ta Tb 2Ta Tb

1 p p α + β β − α
= 2n g(ω1 / Ta Tb , ω2 / Ta Tb , s) exp ω1 √ †
+ ω2 √
†
− c.c. . (S74)
π 2Ta Tb 2Ta Tb

We now trace out one of two quadratures for each mode. To do that, we take integral over the
imaginary part of α and the real part of β. Here, we define α := αr + iαi and β := βr + iβi . If we
take the integral, the relevant part reduces to
Z ∗
1 ω1 − ω2∗ ∗ ω1 − ω2
p √ √
d αi exp α √
n
−α √ = δ(Re(ω1 − ω2 )/ 2Ta Tb )e−i 2 Im(ω1 −ω2 )αr / Ta Tb ,
πn 2Ta Tb 2Ta Tb
(S75)
Z ∗
1 ω1 + ω2∗ ∗ ω1 + ω2
p √ √
d βr exp β √
n
−β √ = δ(Im(ω1 + ω2 )/ 2Ta Tb )ei 2 Re(ω1 +ω2 )βi / Ta Tb ,
πn 2Ta Tb 2Ta Tb
(S76)

where the delta function gives us ω2 = ω1∗ . Thus,

Z
ω1 − ω2 ω1 + ω2
d αi d βr Tr Π̂α (−s)D̂ √
n n †
Tr Π̂β (s)D̂† √ (S77)
2Ta Tb 2Ta Tb

p p ω1 − ω2∗ −i√2 Im(ω1 −ω2 )αr /√Ta Tb i√2 Re(ω1 +ω2 )βi /√Ta Tb
= g(ω1 / Ta Tb , ω2 / Ta Tb , s)δ √ e e . (S78)
2Ta Tb

Thus, the output probability is, by defining the measurement output variable as ξ = −αr + iβi ,
9

given by
q(ξ) (S79)
Z
= dn αi dn βr q(α, β) (S80)
Z
1 1−Tb 1−Ta
+ 2T )(|ω1 |2 +|ω2 |2 ) p
dn αi dn βr d2n ω1 d2n ω2 (Tb Ta )−n e
−(
= 2Tb a Tb g(ω1 , ω2 , r)λ(ω2 / Tb )
π 2n
ω1 − ω2 ω1 + ω2
× Tr Π̂α (−s)D̂† √ Tr Π̂β (s)D̂† √ (S81)
2Ta Tb 2Ta Tb
Z
1 1−Tb 1−Ta
+ 2T )(|ω1 |2 +|ω2 |2 ) p p p
d2n ω1 d2n ω2 (Tb Ta )−2n e
−(
= 2Tb a Tb λ(ω2 / Tb )g(ω1 , ω2∗ , r)g(ω1 / Ta Tb , ω2 / Ta Tb , s)
π 2n

ω1 − ω2∗ −i√2 Im(ω1 −ω2 )αr /√Ta Tb i√2 Re(ω1 +ω2 )βi /√Ta Tb
×δ √ e e (S82)
2Ta Tb
n Z
2 1 1−Tb
+ 1−T a )|ω|2 p p p √ √
d2n ωe 2(ξ·ω−ω ∗ ·ξ ∗ )/ Ta Tb
−(
= Tb Ta Tb
λ(ω ∗ / Tb )g(ω, ω ∗ , r)g(ω/ Ta Tb , ω ∗ / Ta Tb , s)e
Ta Tb π 2n
(S83)
n Z
2 1 1−T
−( T b + 1−T a )|ω|2 p 2
√ √
= d2n ωe λ(ω ∗ / Tb )e−(e +e /Ta Tb )|ω| e 2(ξ·ω−ω ·ξ )/ Ta Tb
−2r −2s ∗ ∗
b Ta Tb
Ta Tb π 2n
(S84)
n Z
2 1 1−Ta
)|ω|2
√ √
= d2n ωe−((1−Tb )+ Ta λ(ω ∗ )e 2(ξ·ω−ω ∗ ·ξ ∗ )/ Ta
. (S85)
Ta π 2n
√
Here, for consistency with Sec. S2 A, we rescale 2ξ = ζ and define ploss (ζ) such that
n Z
1 1 1−Ta
)|ω|2
√
d2n ωe−((1−Tb )+
−2r +e−2s /T 2
a )|ω|
e(ω·ζ−ω
∗ ·ζ ∗ )/
ploss (ζ) = Ta λ(ω ∗ )e−(Tb e Ta
, (S86)
Ta π 2n
where 2n factor is canceled because of the relation 2n d2n ξ = d2n ζ (This rescaling is because the
√
convention of Bell measurement outcome ζ in Sec. S2 A is different from ξ in this section by 2
factor.). Therefore, after Fourier transformation, we obtain
Z √ 1−Ta
−2r +e−2s /T )|β|2 2 |β|2
d2n ζploss (ζ)e(ζ = λ(β)e−(Tb e e−(1−Tb )|β| e− (S87)
† β−β † ζ)/ Ta a Ta .

Consequently, the characteristic function is written by the probability distribution as

1−Ta
Z √
(Tb e−2r +e−2s /Ta )|β|2 (1−Tb )|β|2 |β|2
λ(β) = e d2n ζploss (ζ)e(ζ (S88)
† β−β † ζ)/ Ta
e e Ta ,
Z √
−2reff |β|2
= ee d2n ζploss (ζ)e (ζ † β−β † ζ)/ Ta
. (S89)

where we defined an effective squeezing parameter reff due to all kinds of imperfections via
1 − Ta
e−2reff := (Tb e−2r + Ta−1 e−2s ) + (1 − Tb ) + . (S90)
Ta
In order to estimate any λ(β), one simply obtains √
N samples {ζ (i) }N i=1 from p(ζ) and set the estimator
1 P 2 (ζ (i)† (i) )/
to be λ̃(β) := N i=1 e . According to the Hoeffding’s inequality as the
N e −2reff |β| β−β † ζ T
e b

2 2e−2reff |β|2
ideal case, averaging over N ≥ 8/ϵ log(4/δ)e copies is sufficient to estimate λ(β) to ϵ
additive error with high probability.
Meanwhile, the effects of imperfections are thus the envelope of Fourier transforms. Especially
when s → ∞, the effective squeezing parameter under loss is given by

1 1 − Ta
reff = − log Tb e−2r + (1 − Tb ) + . (S91)
2 Ta
10

This completes the proof of Theorem 3 in the main text.

To be clearer, for photon loss before the channel without any other imperfections, i.e., s → ∞
and Ta = 1, the envelope is given by
−2r +(1−T )]|β|2
e[Tb e b
, (S92)

and for photon loss after the channel without other imperfections, the envelope is given by
−2r +(1−T )/T ]|β|2
e[e a a
. (S93)

D. Discussion on more general input states

In this section, we study more general sources of noise other than finite squeezing and photon
loss. To begin with, consider the case where we use an arbitrary input state while the CV Bell
measurement is still employed. To this end, note that Eq. (S44) from Sec. S2 A does not use any
special properties of TMSV, and actually holds for any 2n-mode input state, i.e.,
Z
1
λ(β) = d2n ζpEA (ζ)eζ (S94)
† β−β † ζ
,
gρ̂ (β ∗ , β)
where g is the characteristic function of the input state ρ̂:
h i
gρ̂ (w1 , w2 ) = Tr ρ̂D̂(ω1 ) ⊗ D̂(ω2 ) . (S95)

The fact that the same relation holds by replacing the g function properly indicates that for different
types of input states, we still have a very similar form of an unbiased estimator for N samples:

1 XN
1 (i)† † (i)
λ̃(β) = eζ β−β ζ . (S96)
N i=1 gρ̂ (β , β)
∗

It implies that the sampling complexity is determined by the function gρ̂ (β ∗ , β). More specifically,
by the Hoeffding inequality, the number of samples to achieve an error ϵ with high probability 1 − δ
is given by

N = O(|gρ̂ (β ∗ , β)|−2 ϵ−2 log δ −1 ). (S97)

For example, for TMSV states, this function reduces to

gΨ̃ (β ∗ , β) = exp −e−2r |β|2 . (S98)

Therefore, as long as the function g of the input state is sufficiently large for the β of interest, we
can still expect the scheme to be sample efficient.
Such a general form enables us to analyze the effect of general noise on input states. Let us
again focus on TMSV states but assume a noise channel N . Then, the characteristic function g of
the noisy TMSV states can be written as
h i
gN (Ψ̃) (w1 , w2 ) = Tr N (|Ψ̃(r)⟩⟨Ψ̃(r)|)D̂(ω1 ) ⊗ D̂(ω2 ) . (S99)

As discussed, it suffices to analyze how the characteristic function changes by noise to ensure that
a significant advantage is still maintained for noisy states. In typical experiments, while the CV
Bell measurement noise can be modeled by photon loss, as we considered already in the previous
section, other types of noise may exist in the TMSV state preparation procedure. An example is
11

(a) (b)

FIG. S1. Effect of phase diffusion on the √

squared characteristic
√ function |gN∆ (ρ̂) (β ∗ , β)|2 , where we choose a
specific form of β ∈ C as (a) β := (|β|/ n, . . . , |β|/ n) and (b) β := (|β|, 0, . . . , 0) for two extreme cases
n

with a given |β|2 . We fix the squeezing parameter r = 1.5 of the input TMSV states and set the number of
modes n = 50. For the standard deviation ∆ = 1◦ of phase noise, following Gaussian distributions, one may
observe that the characteristic function is almost identical to the noiseless case (∆ = 0◦ ). For the standard
deviation ∆ = 2◦ as well, the effect is not very significant.

phase diffusion, which can be modeled by a photon-number-dependent random phase following a

Gaussian distribution. For 2n-mode state input, we can write the noise channel as
N∆ (ρ̂) (S100)
|ϕ |2 +|ϕB |2
Z − A
e 2∆2
= dn ϕA dn ϕB e−iϕA ·n̂A −iϕB ·n̂B ρ̂eiϕA ·n̂A +iϕB ·n̂B (S101)
(2π∆2 )n
|ϕA |2 +|ϕB |2
Z Z
1 e− 2∆2
= 2n d2n ω1 d2n ω2 gρ̂ (ω1 , ω2 ) dn ϕA dn ϕB e−iϕA ·n̂A −iϕB ·n̂B [D̂† (ω1 ) ⊗ D̂† (ω2 )]eiϕA ·n̂A +iϕB ·n̂B ,
π (2π∆2 )n
(S102)
where ϕA and ϕB are n-dimensional real vectors and n̂A and n̂B are photon number operator vectors
for A and B parts, respectively. Then, noting that
e−iϕA ·n̂A −iϕB ·n̂B [D̂† (ω1 ) ⊗ D̂† (ω2 )]eiϕA ·n̂A +iϕB ·n̂B = D̂† (ω1 e−iϕA ) ⊗ D̂† (ω2 e−iϕB ), (S103)

where ω1 e−iϕA and ω2 e−iϕB are interpreted as vectors obtained by an elementwise product, the
corresponding g function for TMSV states is written as
gN (Ψ̃) (w1 , w2 ) (S104)
h i
= Tr N∆ (|Ψ̃(r)⟩⟨Ψ̃(r)|)D̂(ω1 ) ⊗ D̂(ω2 ) (S105)
|ϕA |2 +|ϕB |2
Z Z
1 2n 2n e− 2∆2 h i
= 2n d β1 d β2 gΨ̃ (β1 , β2 ) n
d ϕA d ϕB n
Tr D̂† (β1 e−iϕA ) ⊗ D̂† (β2 e−iϕB )D̂(ω1 ) ⊗ D̂(ω2 )
π (2π∆2 )n
(S106)
|ϕ |2 +|ϕB |2
Z Z − A
e 2∆2
2n 2n
= d β1 d β2 gΨ̃ (β1 , β2 ) dn ϕA dn ϕB δ(ω1 − β1 e−iϕA )δ(ω2 − β2 e−iϕB ) (S107)
(2π∆2 )n
|ϕA |2 +|ϕB |2
Z
e− 2∆2
= dn ϕA dn ϕB gΨ̃ (ω1 eiϕA , ω2 eiϕB ). (S108)
(2π∆2 )n
Thus, the effect of phase noise is to transform the g function as a mixture with random phases. We
present examples to illustrate the effect of the phase noise on the sample complexity in Fig. S1.
12

√ √
We have chosen the parameters β ∈ Cn of two extreme cases as β := (|β|/ n, . . . , |β|/ n) and
β := (|β|, 0, . . . , 0). Recall that the typical choice of the regime of interest in the main text is
|β|2 ≤ κn; here, the range |β|2 ∈ [0, 130] in the figure covers up to κ = 2.5 for n = 50. We see that
the advantages of the entanglement-assisted scheme look robust against small-phase diffusion noise.

S3. FUNDAMENTAL LIMITS FOR ENTANGLEMENT-FREE SCHEMES

In this section, we prove the fundamental limit on general entanglement-free schemes for learning
n-mode random displacement channels. In this work, we will focus on the ancilla-free schemes
without concatenation. This means that, for each copy of the channel, we act it on some input state
and apply a destructive POVM measurement right after. The input states and measurements can
be adaptively chosen depending on previous measurement outcomes. See Fig. S2. Bounds for such
schemes have been investigated in different tasks [4–6]. One can also study ancilla-free protocols
with concatenation, the lower bounds for which have been obtained in several recent works [6–8], but
we leave that for future study as continuous-variable system puts an additional level of complexity.
Throughout this work, we assume entanglement-free schemes to have no concatenation.

FIG. S2. Schematics for entanglement-free schemes. In this work we assume no concatenation is allowed.
Such a scheme can be completely specificed by a collection of input states and POVM measurements that
adaptively depend on the measurement outcomes from the previous round.

Theorem S1. Let Λ be an arbitrary n-mode random displacement channel (n ≥ 8) and consider
an entanglement-free scheme that uses N copies of Λ. After all measurements are completed, the
scheme receives the query β ∈ Cn and returns an estimate λ̃(β) of Λ’s characteristic function λ(β).
Suppose that, with success probability at least 2/3, |λ̃(β) − λ(β)| ≤ ϵ ≤ 0.24 for all β such that
|β|2 ≤ nκ. Then N ≥ 0.01ϵ−2 (1 + 1.98κ)n .

Recall that an entanglement-assisted scheme can achieve the same task using O(ϵ−2 ) copies of
channels given sufficient squeezing and κ = O(1). Therefore, we establish an exponential separation
between learning bosonic random displacement channels with and without entanglement. In the
following, we start proving this result in Sec. S3 A and present a core lemma in Sec. S3 B. We also
prove a bound for learning with Gaussian schemes in Sec. S3 C, which might be of independent
interest.
Before proceeding, let us specify some regularization conditions. We will only work with proper
vectors (i.e., normalizable vector) in the Hilbert space and bounded operator acting on the Hilbert
space. That is to say, all the quantum states we considered can be expressed as a density operator
ρ̂ with trace 1, and all the POVM element Ê is a bounded positive semi-definite operator satisfying
ˆ Perhaps the most representative example that does not admit the above form is the perfect
Ê ≤ I.
homodyne detection projector |x⟩⟨x|, which represents projection onto the quadrature x. While |x⟩
is not a proper vector in the relevant Hilbert space, it can be treated as a limit of proper vectors in
any physical setting. Concretely, homodyne detection is implemented by applying a 50:50 beam
splitter between the input state and a strong local oscillator [3], and the above improper projector
|x⟩⟨x| is obtained by taking the limit where the power of the oscillator goes to infinity. Therefore, in
13

a reasonable physical setup, the actual projector is constructed with a proper vector in the Hilbert
space, thus satisfying our assumptions. We emphasize that our entanglement-assisted strategy also
satisfies the same assumption as we regularize the Bell measurements with general-dyne detection
with a parameter s < ∞, see Sec. S2 C.

A. Lower bound for entanglement-free schemes

Given positive number n ≥ 8 and ϵ ≤ 0.24, we introduce a family of “3-peak” random displacement
channels, defined by their characteristic functions,
|β|2 |β−γ|2 |β+γ|2
Λγ : λγ (β) := e− 2σ2 + 2iϵ0 e− 2σ 2 − 2iϵ0 e− 2σ 2 , γ ∈ Cn , (S109)
where ϵ0 := ϵ/0.98 ≤ 0.25. The corresponding distributions of displacements, computed via Fourier
transformation, are
!n
2σ 2 2 |α|2
Λγ : pγ (α) = e−2σ (1 + 4ϵ0 sin(2(Im[γ] Re[α] − Re[γ] Im[α]))) , (S110)
π
from which we see that the typical strength of displacement is of order 1/σ. Roughly, the smaller σ
is, the larger energy the channel carries. We define Λdep := Λ0 as the CV analogy of the depolarizing
channel, and the other Λγ can be viewed as perturbed depolarizing channels. The set of 3-peak
channels with parameters (ϵ, σ) is denoted as Λϵ,σ 39peak . With this, we are going to prove a strictly
stronger result than Theorem S1. That is, even if one knows the channel to be estimated is from
the restricted family, Λϵ,σ
39peak , an exponential lower bound still applies.

Theorem S2. Given positive numbers n, σ, κ, ϵ such that

q
2σ 2 ≤ max 1 − 1.98κ, 0.99κ 1 + (0.99κ)−2 − 1 , n ≥ 8, ϵ ≤ 0.24. (S111)

If there exists an entanglement-free scheme such that, after learning from N copies of an n-mode
random displacement channel Λ ∈ Λϵ,σ 39peak , and then receiving a query β ∈ C , can return an
n

estimate λ̃(β) of λ(β) such that |λ̃(β) − λ(β)| ≤ ϵ with probability at least 2/3 for all β such that
|β|2 ≤ nκ, then
n
1.98κ
N ≥ 0.01ϵ −2
1+ . (S112)
1 + 2σ 2
It is not hard to see that a σ > 0 satisfying the assumptions can always be found for any κ > 0.
Indeed, Theorem S1 follows from Theorem S2 by choosing σ → 0. Note that Theorem S2 does not
place any constraint on the input state and measurement. This means that learning a finite-energy
random displacement channel is hard without ancilla even given energy-unbounded input state and
measurement. Also, Theorem S2 enables an experimental test, as it only requires generating finite
displacement with high probability. The practical performance of this bound with σ = 0.3 is shown
in Fig. S3.
Proof of Theorem S2. Now we introduce the following game between Alice and Bob that helps
reduce the learning task to a partially-revealed hypothesis testing task [9]. First, Alice samples
s ∈ {±1} with equal probability and γ ∈ Cn according to the multivariate normal distribution q(γ)
defined as
!n |γ|2
1 − 2
q(γ) := e 2σγ
, (S113)
2πσγ2
14

(a) (b)

FIG. S3. Learning random displacement channels from the family with σ = 0.3 as in Theorem S2. (In the
main text, we set σ = 0.) All κ shown in the figure satisfies Eq. (S111). (a) Comparison of TMSV+BM (with
different loss rates), Vacuum+Heterodyne, and the entanglement-free lower bound at κ = 1. The task is to
estimate any λ(β) such that |β|2 ≤ κn with precision ε = 0.2 and success probability 1 − δ = 2/3. The orange
region represents a rigorous advantage over any entanglement-free schemes. The blue region represents an
advantage over Vacuum+heterodyne. (b) Comparison of the TMSV+BM scheme with squeezing parameter
r = 1.0 and loss rate 1 − T = 0.1 with the entanglement-free lower bound of Theorem 2. The task is the same
as (a). The brown solid contour lines represent the sample complexity of TMSV+BM given by Theorem 3.
The blue dashed contour lines represent the ratio of sample complexity between the entanglement-free lower
bound and TMSV+BM, which clearly indicates the entanglement-enabled advantages.

where we will set 2σγ2 := 0.99κ to ensure the tail probability, i.e., Pr |γ|2 > κn , to be sufficiently
small. Next, Alice does one of the following with equal probability:

1. Prepare N copies of Λdep for Bob;

2. Prepare N copies of Λsγ for Bob.

Bob then measures the N copies of the channels Alice prepared. After Bob has finished the
measurements and retains only classical information, Alice reveals the value of γ to Bob. Now Bob
is asked to distinguish between the two hypotheses: whether Alice has prepared copies of Λdep or
Λsγ . Crucially, Bob must have completed all quantum measurements before Alice reveals γ, and
can only perform classical post-processing after that.
We first argue that if there is a scheme satisfying the assumptions of Theorem S2, then Bob can
use it to win the game with an average probability much better than random guess. Bob’s strategy
is as follows: If the γ he received satisfies 2σ 2 < |γ|2 ≤ κn, use the scheme to query λ(γ). Note
that for any γ ∈ Cn :

1 4|γ|2
λdep (γ) − λ±γ (γ) = λγ (γ) − λ−γ (γ) = 2ϵ0 1 − e− 2σ2 . (S114)
2

For |γ|2 > 2σ 2 , the R.H.S. is lower bounded by 2ϵ0 × 0.98 = 2ϵ. By assumption, this allows Bob to
distinguish among {Λdep , Λγ , Λ−γ } and thus guess correctly with at least 2/3 chance; For other γ,
15

Bob just makes a uniformly random guess. Note that

Pr 2σ 2 < |γ|2 ≤ κn = 1 − Pr |γ|2 > κn − Pr |γ|2 ≤ 2σ 2 (S115)
!
Γ(n, n/0.99) Γ(n, σ 2 /σγ2 )
=1− − 1− (S116)
Γ(n) Γ(n)
!
1 Γ(n, σ 2 /σγ2 )
≥ − 1− (S117)
2 Γ(n)
R σ2 /σ2
1 γ n−1 −t
t e dt
= − 0 (S118)
2 (n − 1)!
≥ 0.49987. (S119)

The first inequality is shown in Sec. S4. The second inequality requires n ≥ 8 and 2σ 2 ≤ 0.99κ := 2σγ2 .
Bob’s average success probability is lower bounded by

Pr[Success] ≥ Pr 2σ 2 < |γ|2 ≤ κn × 2/3 + 1 − Pr 2σ 2 < |γ|2 ≤ κn × 1/2. (S120)

Now we investigate the probability distribution of Bob’s measurement outcomes for any γ. For
any adaptive entanglement-free strategy, one specifies an input state and a POVM for the ith
copy of Λ that can depend on previous measurement outcomes. We denote the ith measurement
outcomes as oi and the outcomes up to the ith round as o<i = [o1 , ..., oi−1 ]. The latter is added as
a superscript to the ith input states ρo<i and POVM element Eoi<i to emphasize their adaptive
o

nature. With these notations, the probability of obtaining outcomes o1:N on N copies of Λ is
N
Y h i
p(o1:N |Λ) = Tr Êooi<i Λ(ρ̂o<i ) . (S121)
k=1

For a fixed γ, let p1 (o1:N ) := p(o1:N |Λdep ), p2,γ (o1:N ) := Es=±1 p(o1:N |Λsγ ), which is the distribution
of Bob’s outcomes under the two hypotheses, respectively, conditioned on the γ he received.
According to the property of total variation distance, the maximal probability that Bob can
distinguish p1 and p2,γ is bounded by
1
Pr[Success|γ] ≤ (1 + TVD(p1 , p2,γ )), (S122)
2
where TVD is the total variation distance defined as
X
TVD(p1 , p2,γ ) := max {0, p1 (o1:N ) − p2,γ (o1:N )} . (S123)
o1:N

We note that the sum over o1:N should be understood as integral for continuous-variable outcomes.
Thus, the average probability that Bob can win the game is upper bounded by
1
Pr[Success] = Eγ∼q Pr[Success|γ] ≤ (1 + Eγ TVD(p1 , p2,γ )). (S124)
2
Combining Eq. (S120) and Eq. (S124), we get

Eγ TVD(p1 , p2,γ ) ≥ 0.1666. (S125)

In the following, we show by direct calculation that this is impossible unless N is exponentially
large in n, which yields a desired lower bound for the sample complexity.
16

Thanks to convexity, we assume pure input states and rank-1 measurement without decreasing
the TVD, i.e., the kth round’s input state and POVM are written as |Ao<k ⟩ and {|Bok<k ⟩⟨Bok<k |},
o o

which are conditioned on the previous measurement outcomes o<k . Here, the input state has unit
P
length and ok |Bok<k ⟩⟨Bok<k | = 1. We note that since any density matrix is trace-class, a spectrum
o o

decomposition always exists. On the other hand, the POVM element can be non-compact operator
and might not have spectrum decomposition, but it is known that they can always be composed
into rank-1 projectors with positive coefficients (see [10, Theorem 6]). Thus, making both the input
state and measurement projector to be rank-1 is indeed justified.
Now, let us rewrite the probabilities as
N
Y
p1 (o1:N ) = ⟨Book<k |Λdep (|Ao<k ⟩⟨Ao<k |)|Book<k ⟩ (S126)
k=1
N Z
Y 1
= d2n βk λdep (βk )⟨Book<k |D̂† (βk )|Book<k ⟩⟨Ao<k |D̂(βk )|Ao<k ⟩ , (S127)
k=1
πn
N
Y
p2,γ (o1:N ) = Es=±1 ⟨Book<k |Λsγ (|Ao<k ⟩⟨Ao<k |)|Book<k ⟩ (S128)
k=1
N Z
Y 1
= Es=±1 d2n βk λsγ (βk )⟨Book<k |D̂† (βk )|Book<k ⟩⟨Ao<k |D̂(βk )|Ao<k ⟩ . (S129)
k=1
πn

Let λadd
γ (βk ) := λγ (βk ) − λdep (βk ). The difference of the probabilities can then be written as

p1 (o1:N ) − p2,γ (o1:N ) (S130)

p2,γ (o1:N )
=p1 (o1:N ) 1 − (S131)
p1 (o1:N )
1 R 2n !!
d βk λadd
sγ (βk )⟨Bok |D̂ (βk )|Bok ⟩⟨A
o<k |D̂(β )|Ao<k ⟩
N o<k † o<k
Y k
=p1 (o1:N ) 1 − Es=±1 1+ πn
1 R
d2n βk λdep (βk )⟨Bok |D̂† (βk )|Bok ⟩⟨Ao<k |D̂(βk )|Ao<k ⟩
o<k o<k
k=1 πn
(S132)
N
!
Y o

=p1 (o1:N ) 1 − Es=±1 1− 4ϵ0 Im Gσ≤k (sγ) , (S133)
k=1

where we have defined

R 2n − |β−γ|2 o
d βe 2σ2 G ≤k (β)
:=
o
Gσ≤k (γ) R |β ′ |2
, (S134)
d2n β ′ e
− o 2σ 2 G ≤k (β ′ )

where

⟨Bok<k |D̂† (β)|Bok<k ⟩

o o
Go≤k (β) := o o · ⟨Ao<k |D̂(β)|Ao<k ⟩, (S135)
⟨Bok<k |Bok<k ⟩

which satisfies Go≤k (γ) = Go≤k (−γ)∗ .

We thus have
( N
)
X Y o

Eγ TVD(p1 , p2,γ ) = Eγ p1 (o1:N ) max 0, 1 − Es=±1 1− 4ϵ0 Im Gσ≤k (sγ) . (S136)
o1:N k=1
17

Now we can lower bound the following term,

N
Y
o
Es=±1 1 − 4ϵ0 Im Gσ≤k (sγ) (S137)
k=1
N r
Y o o
≥ 1 − 4ϵ0 Im Gσ≤k (+γ) 1 − 4ϵ0 Im Gσ≤k (−γ) (S138)
k=1
YN r 2
1 − 16ϵ20 Im Gσ≤k (γ)
o
= (S139)
k=1
YN 2
16ϵ20
o
≥ 1− ImGσ≤k (γ) (S140)
k=1
N
X
16ϵ20 |Gσ≤k (γ)|2 ,
o
≥ 1− (S141)
k=1

where the second line uses the AM-GM inequality and the fact that the expression inside the bracket
is the ratio of two conditional probabilities and is thus
√ non-negative; the third line uses the fact that
o o
Im Gσ≤k (γ) = − Im Gσ≤k (−γ); the fourth line uses 1 − x ≥ 1 − x, ∀ 0 ≤ x ≤ 1; and the final line
Q P
uses the inequality i (1 − xi ) ≥ 1 − i xi for all 0 ≤ xi ≤ 1. Thus, we can get rid of the maximum
in the expression of the average TVD as

X N
X
16ϵ20 Eγ |Gσ≤k (γ)|2 .
o
Eγ TVD(p1 , p2,γ ) ≤ p1 (o1:N ) (S142)
o1:N k=1

To further upper bound the R.H.S., we need the following Lemma S1. The lemma is analogous
to Pauli twirling in discrete-variable systems but also takes finite energy into consideration. The
proof of Lemma S1 is given in Sec. S3 B; Alternatively, when the input states and measurements are
restricted to be Gaussian, a more straightforward calculation is possible, yielding different bounds,
which we will present in Sec. S3 C.
Lemma S1. For any |Ao<k ⟩ , |Bok<k ⟩ we have
o

!n
1 + 2σ 2
Eγ |Gσ≤k (γ)|2
o
≤ , (S143)
1 + 2σ 2 + 4σγ2
q
1 1
given that σ2 ≤ max 2 − 2σγ2 , σγ2 1+ 4σγ4
−1 .

Thanks to Lemma S1, we get the following upper bound

!n !n
X N
X 1 + 2σ 2 1 + 2σ 2
Eγ TVD(p1 , p2,γ ) ≤ p1 (o1:N ) 16ϵ20 = 16N ϵ20 . (S144)
o1:N k=1
1 + 2σ 2 + 4σγ2 1 + 2σ 2 + 4σγ2

Combining this with the lower bound in Eq. (S125) and substituting ϵ = 0.98ϵ0 ,
!n
4σγ2
N ≥ 0.01ϵ−2 1+ . (S145)
1 + 2σ 2

By substituting 2σγ2 = 0.99κ, we obtain the lower bound as claimed in Theorem S2.
18

In Fig. S3, we compare the upper bound of the TMSV+BM scheme to the derived lower bound
of entanglement-free schemes. In contrast to the main text, we set σ = 0.3 to consider a more
practical case for experimental realization in the near future. To see how much energy is required to
realize the 3-peak channel, one can easily check that for a given σ, the corresponding single-mode
depolarizing channel Λ0 transforms a vacuum input state to a thermal state of mean photon number
1/2σ 2 . Since this channel is a product channel, it implies that we need 1/2σ 2 average photons per
mode. For our choice σ = 0.3, 1/2σ 2 ≈ 5.56. Since the envelope determined by σ has a larger
contribution than γ that determines the oscillation, we are required to produce approximately
1/2σ 2 photon number on average. It is worth emphasizing that for κ ≤ 2.5, the choice satisfies the
condition of Theorem S2.

B. Proof of Lemma S1

In this section we prove Lemma S1. Let |A⟩ , |B⟩ be arbitrary normalized pure states in the
n-mode bosonic Hilbert space. Define
G(β) := ⟨B|D̂† (β)|B⟩⟨A|D̂(β)|A⟩, (S146)
R 2n
[Nσ ∗ G](γ) d β exp −|β − γ|2 /2σ 2 G(β)
Gσ (γ) := := R 2n . (S147)
[Nσ ∗ G](0) d β exp(−|β|2 /2σ 2 )G(β)
Here ∗ stands for convolution. We are going to prove the following inequality
!n
Eγ |[Nσ ∗ G](γ)|2
2 1 + 2σ 2
Eγ |Gσ (γ)| = ≤ , (S148)
|[Nσ ∗ G](0)|2 1 + 2σ 2 + 4σγ2
n q
1 |γ|2 1 1
where γ ∼ q(γ) := 2πσγ2
exp − 2σ 2 and σ2 ≤ max 2 − 2σγ2 , σγ2 1+ 4σγ4
−1 .
γ

First of all, write the following expression in the Fourier basis

Z Z
1 1
[Nσ ∗ G](γ) = d2n ωeω F[Nσ ∗G] (ω) = d2n ωeω FNσ (ω)FG (ω), (S149)
† γ−γ † ω † γ−γ † ω

π 2n π 2n
where the last equality uses the convolution theorem [11]. The Fourier component of Nσ is
Z |β|2 2 |ω|2
FNσ (ω) = d2n βe− 2σ2 eβ = (2πσ 2 )n e−2σ (S150)
† ω−ω † β
.

The Fourier component of G can be computed as

Z
FG (ω) = d2n β⟨B|D̂† (β)|B⟩⟨A|D̂(β)|A⟩eβ (S151)
† ω−ω † β

Z
= d2n β⟨B|D̂† (β)|B⟩⟨A|D̂† (ω)D̂(β)D̂(ω)|A⟩ (S152)

= π n |⟨B|D̂(ω)|A⟩|2 , (S153)

where the second line uses D̂† (ω)D̂(β)D̂(ω) = eβ D̂(β), and the last line is by Eq. (S1). Thus,
† ω−ω † β

2
|[Nσ ∗ G](γ)| (S154)
Z 2
1
= d2n ωeω γ−γ ω FNσ (ω)FG (ω) (S155)
† †

π 2n
Z
1
= 4n d2n ωd2n ω ′ e(ω−ω ) γ−γ (ω−ω ) FNσ (ω)FN∗ σ (ω ′ )FG (ω)FG∗ (ω ′ ) (S156)
′ † † ′

π Z
2 (|ω|2 +|ω ′ |2 )
= (2σ 2 )2n d2n ωd2n ω ′ e(ω−ω )
′ † γ−γ † (ω−ω ′ )
e−2σ |⟨B, B|D̂(ω) ⊗ D̂(ω ′ )|A, A⟩|2 . (S157)
19

✓
<latexit sha1_base64="trBE1klUsy9CG5N6h840+ccDBWc=">AAACIXicbVDLSsNAFJ3UV62vqEs3wSJUxJIUH10W3bisYB/QtGUynbRDJ5MwcyOUkF9x46+4caFId+LPOH2A2nrgwplz7mXuPV7EmQLb/jQyK6tr6xvZzdzW9s7unrl/UFdhLAmtkZCHsulhRTkTtAYMOG1GkuLA47ThDW8nfuORSsVC8QCjiLYD3BfMZwSDlrpm2eXUh4LrS0wS57zkKtYPcKeUJs7Zz8OVrD+A007iDjAkIu06adfM20V7CmuZOHOSR3NUu+bY7YUkDqgAwrFSLceOoJ1gCYxwmubcWNEIkyHu05amAgdUtZPphal1opWe5YdSlwBrqv6eSHCg1CjwdGeAYaAWvYn4n9eKwS+3EyaiGKggs4/8mFsQWpO4rB6TlAAfaYKJZHpXiwywDgt0qDkdgrN48jKpl4rOVfHy/iJfuZnHkUVH6BgVkIOuUQXdoSqqIYKe0At6Q+/Gs/FqfBjjWWvGmM8coj8wvr4BPTGjnA==</latexit>

◆n̂1
<latexit sha1_base64="alaF0NDh7qsqabj8K13WLWtxSgg=">AAACQ3icbZBPa9swGMbldFu77F/aHncxC4OOkmCbrOsxtJceW1iasNg1rxXZFpVkI70eBOPv1ku/QG/7ArvssDJ2LUxJs7E1e0Dw43neF0lPUgpu0PO+OK2NR4+fbG49bT97/uLlq872zrkpKk3ZiBai0JMEDBNcsRFyFGxSagYyEWycXB4v8vFnpg0v1EeclyySkCmecgporbjzKRQsxb0w1UBrvxeEhmcSLoLe4DfFYQZSQlP7+3/S/bU01DzL8d1FHeaAtWrioIk7Xa/vLeWug7+CLlnpNO7chLOCVpIppAKMmfpeiVENGjkVrGmHlWEl0EvI2NSiAslMVC87aNy31pm5aaHtUegu3b83apDGzGViJyVgbh5mC/N/2bTC9DCquSorZIreX5RWwsXCXRTqzrhmFMXcAlDN7VtdmoOtE23tbVuC//DL63Ae9P2D/vuzQXd4tKpji7wmb8ge8ckHMiQn5JSMCCVX5Cv5Tm6da+eb88P5eT/aclY7u+QfOXe/ACoKsSc=</latexit>

!n̂2
2 2 2
1 2 1 2 4
2 1+2 2+4 2
1+2
Trace Trace
out out
⇢ˆd
<latexit sha1_base64="wiHWiiwnA+IaucJhwKq6qwE5JdU=">AAAB83icbVBNS8NAEJ3Ur1q/qh69LBbBU0nEr2PRi8cK1haaUDabTbN0sxt2N0IJ/RtePCji1T/jzX/jts1BWx8MPN6bYWZemHGmjet+O5WV1bX1jepmbWt7Z3evvn/wqGWuCO0QyaXqhVhTzgTtGGY47WWK4jTktBuObqd+94kqzaR4MOOMBikeChYzgo2VfD/BpvBVIieDaFBvuE13BrRMvJI0oER7UP/yI0nylApDONa677mZCQqsDCOcTmp+rmmGyQgPad9SgVOqg2J28wSdWCVCsVS2hEEz9fdEgVOtx2loO1NsEr3oTcX/vH5u4uugYCLLDRVkvijOOTISTQNAEVOUGD62BBPF7K2IJFhhYmxMNRuCt/jyMnk8a3qXzYv780brpoyjCkdwDKfgwRW04A7a0AECGTzDK7w5ufPivDsf89aKU84cwh84nz9u0JH1</latexit>

50:50 50:50

|bi
<latexit sha1_base64="QObvOTzqYscW7BnskYRLeBrE/Ko=">AAAB8HicbVDLSgNBEOz1GeMr6tHLYhA8hV3xdQx68RjBPCRZwuykNxkyM7vMzAphzVd48aCIVz/Hm3/jJNmDJhY0FFXddHeFCWfaeN63s7S8srq2Xtgobm5t7+yW9vYbOk4VxTqNeaxaIdHImcS6YYZjK1FIRMixGQ5vJn7zEZVmsbw3owQDQfqSRYwSY6WHp7CjiOxz7JbKXsWbwl0kfk7KkKPWLX11ejFNBUpDOdG67XuJCTKiDKMcx8VOqjEhdEj62LZUEoE6yKYHj91jq/TcKFa2pHGn6u+JjAitRyK0nYKYgZ73JuJ/Xjs10VWQMZmkBiWdLYpS7prYnXzv9phCavjIEkIVs7e6dEAUocZmVLQh+PMvL5LGacW/qJzfnZWr13kcBTiEIzgBHy6hCrdQgzpQEPAMr/DmKOfFeXc+Zq1LTj5zAH/gfP4ABZ6QkQ==</latexit>

|ai
<latexit sha1_base64="u633PLCoAg9IDnSC3LmRXGNsUWY=">AAAB8HicbVDLSgNBEOz1GeMr6tHLYhA8hV3xdQx68RjBPCRZwuykNxkyM7vMzAphzVd48aCIVz/Hm3/jJNmDJhY0FFXddHeFCWfaeN63s7S8srq2Xtgobm5t7+yW9vYbOk4VxTqNeaxaIdHImcS6YYZjK1FIRMixGQ5vJn7zEZVmsbw3owQDQfqSRYwSY6WHJ9JRRPY5dktlr+JN4S4SPydlyFHrlr46vZimAqWhnGjd9r3EBBlRhlGO42In1ZgQOiR9bFsqiUAdZNODx+6xVXpuFCtb0rhT9fdERoTWIxHaTkHMQM97E/E/r52a6CrImExSg5LOFkUpd03sTr53e0whNXxkCaGK2VtdOiCKUGMzKtoQ/PmXF0njtOJfVM7vzsrV6zyOAhzCEZyAD5dQhVuoQR0oCHiGV3hzlPPivDsfs9YlJ585gD9wPn8ABBOQkA==</latexit>

50:50 50:50

|Bi |Bi
<latexit sha1_base64="A1W0syloA77/CaF2W9Nf6bTqe9A=">AAAB8HicbVDLTgJBEOzFF+IL9ehlIzHxRHaNryPBi0dM5GFgQ2aHXpgwM7uZmTUhyFd48aAxXv0cb/6NA+xBwUo6qVR1p7srTDjTxvO+ndzK6tr6Rn6zsLW9s7tX3D9o6DhVFOs05rFqhUQjZxLrhhmOrUQhESHHZji8mfrNR1SaxfLejBIMBOlLFjFKjJUenqodRWSfY7dY8sreDO4y8TNSggy1bvGr04tpKlAayonWbd9LTDAmyjDKcVLopBoTQoekj21LJRGog/Hs4Il7YpWeG8XKljTuTP09MSZC65EIbacgZqAXvan4n9dOTXQdjJlMUoOSzhdFKXdN7E6/d3tMITV8ZAmhitlbXTogilBjMyrYEPzFl5dJ46zsX5Yv7s5LlWoWRx6O4BhOwYcrqMAt1KAOFAQ8wyu8Ocp5cd6dj3lrzslmDuEPnM8f1C+QcQ==</latexit> <latexit sha1_base64="A1W0syloA77/CaF2W9Nf6bTqe9A=">AAAB8HicbVDLTgJBEOzFF+IL9ehlIzHxRHaNryPBi0dM5GFgQ2aHXpgwM7uZmTUhyFd48aAxXv0cb/6NA+xBwUo6qVR1p7srTDjTxvO+ndzK6tr6Rn6zsLW9s7tX3D9o6DhVFOs05rFqhUQjZxLrhhmOrUQhESHHZji8mfrNR1SaxfLejBIMBOlLFjFKjJUenqodRWSfY7dY8sreDO4y8TNSggy1bvGr04tpKlAayonWbd9LTDAmyjDKcVLopBoTQoekj21LJRGog/Hs4Il7YpWeG8XKljTuTP09MSZC65EIbacgZqAXvan4n9dOTXQdjJlMUoOSzhdFKXdN7E6/d3tMITV8ZAmhitlbXTogilBjMyrYEPzFl5dJ46zsX5Yv7s5LlWoWRx6O4BhOwYcrqMAt1KAOFAQ8wyu8Ocp5cd6dj3lrzslmDuEPnM8f1C+QcQ==</latexit>

|Ai |Ai
<latexit sha1_base64="/pzOFfqXAgv5f/wpTS8yykM3q+Q=">AAAB8HicbVDJSgNBEK2JW4xb1KOXwSB4CjPidox68RjBLJIMoadTkzTp7hm6e4Qw5iu8eFDEq5/jzb+xsxw08UHB470qquqFCWfaeN63k1taXlldy68XNja3tneKu3t1HaeKYo3GPFbNkGjkTGLNMMOxmSgkIuTYCAc3Y7/xiEqzWN6bYYKBID3JIkaJsdLD01VbEdnj2CmWvLI3gbtI/BkpwQzVTvGr3Y1pKlAayonWLd9LTJARZRjlOCq0U40JoQPSw5alkgjUQTY5eOQeWaXrRrGyJY07UX9PZERoPRSh7RTE9PW8Nxb/81qpiS6DjMkkNSjpdFGUctfE7vh7t8sUUsOHlhCqmL3VpX2iCDU2o4INwZ9/eZHUT8r+efns7rRUuZ7FkYcDOIRj8OECKnALVagBBQHP8ApvjnJenHfnY9qac2Yz+/AHzucP0qSQcA==</latexit> <latexit sha1_base64="/pzOFfqXAgv5f/wpTS8yykM3q+Q=">AAAB8HicbVDJSgNBEK2JW4xb1KOXwSB4CjPidox68RjBLJIMoadTkzTp7hm6e4Qw5iu8eFDEq5/jzb+xsxw08UHB470qquqFCWfaeN63k1taXlldy68XNja3tneKu3t1HaeKYo3GPFbNkGjkTGLNMMOxmSgkIuTYCAc3Y7/xiEqzWN6bYYKBID3JIkaJsdLD01VbEdnj2CmWvLI3gbtI/BkpwQzVTvGr3Y1pKlAayonWLd9LTJARZRjlOCq0U40JoQPSw5alkgjUQTY5eOQeWaXrRrGyJY07UX9PZERoPRSh7RTE9PW8Nxb/81qpiS6DjMkkNSjpdFGUctfE7vh7t8sUUsOHlhCqmL3VpX2iCDU2o4INwZ9/eZHUT8r+efns7rRUuZ7FkYcDOIRj8OECKnALVagBBQHP8ApvjnJenHfnY9qac2Yz+/AHzucP0qSQcA==</latexit>

FIG. S4. Schematics for Eq. (S164) to (S169). Here, each line represents n-mode state, and we omit the
phase factors for simplicity.

After averaging over Gaussian distribution of γ, we obtain the numerator

Z
2 2 2n 2 ′ 2 2 (|ω|2 +|ω ′ |2 )
Eγ |[Nσ ∗ G](γ)| = (2σ ) d2n ωd2n ω ′ e−2σγ |ω−ω | e−2σ |⟨B, B|D̂(ω) ⊗ D̂(ω ′ )|A, A⟩|2
(S158)
Z
2 2 2 (|α|2 +|β|2 )
= (2σ 2 )2n d2n αd2n βe−4σγ |β| e−2σ †
|⟨B, B|ÛBS D̂(α) ⊗ D̂(β)ÛBS |A, A⟩|2 .
(S159)

√ √ √
Here, we √ changed the variable as ω = (α + β)/ 2 and ω ′ = (α − β)/ 2, i.e., (ω + ω ′ )/ 2 = α and
(ω − ω ′ )/ 2 = β and chose the 50:50 beam splitter such that

√ √
†
ÛBS D̂(α) ⊗ D̂(β)ÛBS = D̂((α + β)/ 2) ⊗ D̂((α − β)/ 2). (S160)

The denominator follows similarly from Eq. (S157) as

Z
2 (|ω|2 +|ω ′ |2 )
|[Nσ ∗ G](0)|2 = (2σ 2 )2n d2n ωd2n ω ′ e−2σ |⟨B, B|D̂(ω) ⊗ D̂(ω ′ )|A, A⟩|2 (S161)
Z
2 (|α|2 +|β|2 )
= (2σ 2 )2n d2n αd2n βe−2σ †
|⟨B, B|ÛBS D̂(α) ⊗ D̂(β)ÛBS |A, A⟩|2 . (S162)

To further simplify the expressions, note that by applying the convolution theorem to Eq. (S151),
we have

Z
1
|⟨B|D̂(α)|A⟩|2 = d2n βWA (β)WB (β − α), (S163)
πn

where WA and WB are the Wigner functions of the states |A⟩ and |B⟩, respectively. Here, note the
sign in the arguments due to the complex conjugate of the characteristic function, ⟨B|D̂† (β)|B⟩, in
20

Eq. (S151). Thus, by defining the 2n-mode states |a⟩ := ÛBS |A, A⟩ and |b⟩ := ÛBS |B, B⟩, we have
Z
2 2 2 (|α|2 +|β|2 )
d2n αd2n βe−4σγ |β| e−2σ †
|⟨B, B|ÛBS D̂(α) ⊗ D̂(β)ÛBS |A, A⟩|2 (S164)
Z
2 |α|2 2 2 )|β|2
= π 2n d2n ω1 d2n ω2 d2n αd2n βe−2σ e−(4σγ +2σ Wa (ω1 , ω2 )Wb (ω1 − α, ω2 − β) (S165)
Z
2 |ω −γ |2 2 2 )|ω 2
= π 2n d2n ω1 d2n ω2 d2n γ1 d2n γ2 e−2σ 1 1
e−(4σγ +2σ 2 −γ2 |
Wa (ω1 , ω2 )Wb (γ1 , γ2 ) (S166)
Z
2 |α 2 2 2 )|α 2 α1 + β1 α2 + β2 β1 − α1 β2 − α2
= π 2n d2n α1 d2n α2 d2n β1 d2n β2 e−4σ 1|
e−(8σγ +4σ 2|
Wa √ , √ Wb √ , √
2 2 2 2
(S167)
Z
2n 2 |α 2 2 2 )|α 2
=π d2n α1 d2n α2 e−4σ 1|
e−(8σγ +4σ 2|
Wd (α1 , α2 ) (S168)
 !n̂1 !n̂ 
2n 2 −n 2 1 − 2σ 2 1 − 2σ 2 − 4σγ2 2
= π (1 + 2σ ) (1 + 2σ + 4σγ2 )−n Trρ̂d ⊗ , (S169)
1 + 2σ 2 1 + 2σ 2 + 4σγ2

where we used the convolution theorem for the first equality, and we changed the variables for the
second and third equalities, and
Z
α1 + β1 α2 + β2 β1 − α1 β2 − α2
Wd (α1 , α2 ) = d2n β1 d2n β2 Wa √ , √ Wb √ , √ (S170)
2 2 2 2
is the Wigner function of the state ρ̂d obtained by applying a 50:50 beam splitter to the state |a⟩
and |b⟩ and tracing out half of the output. For the last equality, n̂1 and n̂2 are the sum of the
photon number operators for the first and second n modes, respectively, and we use the following
correspondence between the Wigner function and the operator
2
e−4x|α| [(1 − 2x)/(1 + 2x)]n̂
⇐⇒ , (S171)
πn (1 + 2x)n
for any x > 0 (see e.g. [12, Eq. (3.6.39)]). Note that, when x > 1/2 the R.H.S. is proportional
to a thermal state. Similar methods have been used to prove the maximum fidelity of Gaussian
channels [13]. We illustrate the procedure in Fig. S4. With the same logic, we have
Z
2 (|α|2 +|β|2 )
d2n αd2n βe−2σ †
|⟨B, B|ÛBS D̂(α) ⊗ D̂(β)ÛBS |A, A⟩|2 (S172)
 !n̂1 !n̂2 
2n 2 −2n 1 − 2σ 2 1 − 2σ 2
= π (1 + 2σ ) Tr ρ̂d ⊗ . (S173)
1 + 2σ 2 1 + 2σ 2

Hence, we have
" n̂2 #
n̂1 1−2σ 2 −4σγ2
!n Tr ρ̂d 1−2σ 2
1+2σ 2
⊗ 1+2σ 2 +4σγ2
2 1 + 2σ 2
Eγ |Gσ (γ)| = n̂1 n̂2 . (S174)
1 + 2σ 2 + 4σγ2 1−2σ 2 1−2σ 2
Tr ρ̂d 1+2σ 2
⊗ 1+2σ 2

We now consider two parameter regimes. First, if 2σ 2 + 4σγ2 ≤ 1, the operators on the R.H.S. are
positive-semidefinite, and it is not hard to see, by monotonicity, that
" n̂2 #
n̂1 1−2σ 2 −4σγ2
1−2σ 2
Tr ρ̂d 1+2σ 2
⊗ 1+2σ 2 +4σγ2
n̂1 n̂2 ≤ 1. (S175)
1−2σ 2 1−2σ 2
Tr ρ̂d 1+2σ 2
⊗ 1+2σ 2
21

q
1
Second, if 2σ 2 + 4σγ2 > 1 but 2σ 2 ≤ 2σγ2 1+ 4σγ4
−1 ≤ 1 (the last inequality holds for all
σγ > 0), the above can be bounded as
" n̂2 # " #
n̂1 1−2σ 2 −4σγ2
n̂1 1−2σ 2 −4σγ2
n̂2
1−2σ 2 1−2σ 2
Tr ρ̂d 1+2σ 2
⊗ 1+2σ 2 +4σγ2
Tr ρ̂d 1+2σ 2
⊗ 1+2σ 2 +4σγ2
n̂1 ≤ n̂1 n̂2 (S176)
1−2σ 2 1−2σ 2 n̂2 1−2σ 2 1−2σ 2
Tr ρ̂d 1+2σ 2
⊗ 1+2σ 2
Tr ρ̂d 1+2σ 2
⊗ 1+2σ 2
n̂1 n̂2
1−2σ 2 1−2Σ2
Tr ρ̂d 1+2σ 2
⊗ 1+2Σ2
= n̂1 n̂2 (S177)
1−2σ 2 1−2σ 2
Tr ρ̂d 1+2σ 2
⊗ 1+2σ 2

≤ 1, (S178)
1−2σ 2 −4σ 2 1−2Σ2 1
where, in the second line, we define − 1+2σ2 +4σγ2 := 1+2Σ2
, i.e., Σ2 = 4(σ 2 +2σγ2 )
. In the third line,
γ
we use Σ2 ≥ σ 2 , which can
q be easily verified
under our assumptions for σ. Therefore, as long as
2 1 2 2 1
σ ≤ max 2 − 2σγ , σγ 1 + 4σ4 − 1 , we have the following bound
γ

!n
2 1 + 2σ 2
Eγ |Gσ (γ)| ≤ . (S179)
1 + 2σ 2 + 4σγ2

This completes the proof of Lemma S1. Note that the equality can be achieved if ρ̂d is chosen to be
the vacuum state. One can verify this holds when |A⟩ = |B⟩ = |α⟩ for some coherent state |α⟩.

C. Lower bound for entanglement-free Gaussian schemes

In this section, we give a lower bound for a specific class of scheme, the Gaussian schemes,
which may be of independent interest. An ancilla-free Gaussian scheme is specified by collections of
adaptively chosen Gaussian input state and Gaussian measurements. Again, thanks to convexity,
we again consider only pure input states and rank-1 POVM measurements. A Gaussian input state
can be expressed as |A⟩ = D̂(ω)|Ā⟩, where |Ā⟩ is a centered (i.e., zero-mean) Gaussian state; A
Gaussian POVM can be written as
1
Π̂(α) = |B⟩⟨B| = D̂(α)|B̄⟩⟨B̄|D̂† (α), (S180)
πn
for outcomes α ∈ Cn , where |B̄⟩ is a centered Gaussian state. We refer the readers to Ref. [1–3] for
more details about Gaussian quantum information.

Proposition S3. Given positive numbers n, σ, κ, ϵ such that

n ≥ 8, ϵ ≤ 0.24. (S181)

If there exists an entanglement-free Gaussian scheme such that, after learning from N copies of an
n-mode random displacement channel Λ ∈ Λϵ,σ 39peak , and then receiving a query β ∈ C , can return
n

an estimate λ̃(β) of λ(β) such that |λ̃(β) − λ(β)| ≤ ϵ with probability at least 2/3 for all β such that
|β|2 ≤ nκ, then
( n/2 n )
0.99κ 1.98κ
N ≥ 0.01ϵ −2
min 1+ , 1+ . (S182)
σ2 1 + 2σ 2
22

A few remarks before presenting the proof: When κ = O(1) and σ 2 ≪ κ, the second expression
in the minimization dominates, and we recover Theorem S2. On the other hand, the bound for
Gaussian schemes also holds for arbitrarily large σ, though the upper bound will enter a different
branch and the separation with entanglement-assisted schemes becomes weaker.

Proof. Consider the same partially-revealed hypothesis-testing task and the same strategy used by
Bob in the proof of Theorem S2. Recall that the average TVD under the two hypotheses is lower
bounded by

Eγ TVD(p1 , p2,γ ) ≥ 0.1666. (S183)

To upper bound the average TVD, recall the following bound derived in Eq. (S142),

X N
X
16ϵ20 Eγ |Gσ≤k (γ)|2 ,
o
Eγ TVD(p1 , p2,γ ) ≤ p1 (o1:N ) (S184)
o1:N k=1

o
with Gσ≤k defined as
R 2n − |β−γ|2 o
o d βe 2σ2 G ≤k (β)
Gσ≤k (γ) = R |β ′ |2
, (S185)
d2n β ′ e
− o 2σ 2 G ≤k (β ′ )
⟨Bok<k |D̂† (β)|Bok<k ⟩
o o
Go≤k (β) = o o · ⟨Ao<k |D̂(β)|Ao<k ⟩, (S186)
⟨Bok<k |Bok<k ⟩

Note that this bound holds for any σ. Now we calculate the R.H.S. with Gaussian schemes. First
compute Go≤k (β),
† (α−ω)−(α−ω)† β
Go≤k (β) = ⟨B̄|D̂† (α)D̂† (β)D̂(α)|B̄⟩⟨A|D̂(β)|A⟩ = ⟨B̄|D̂† (β)|B̄⟩⟨Ā|D̂(β)|Ā⟩eβ ,
(S187)

Here, without loss of generality, we can always write |Ā⟩ = ÛBSA ÛsqA |0⟩, where ÛBSA represents
the unitary operator for a beam-splitter network and ÛsqA represents the product of single-mode
squeezing operations. Similarly, |B̄⟩ = ÛBSB Û√sqB |0⟩.
To simplify Go≤k (β), using â := (x̂ + ip̂)/ 2, we can rewrite the displacement operator as
√ √ √
D̂(β) := exp βâ† − β † â = exp 2i Im β x̂ − 2i Re β p̂ = exp 2iv · q̂ , (S188)

where q̂ := (x̂1 , . . . , x̂n , p̂1 , . . . , p̂n )T and v(β) := (Im β1 , . . . , Im βn , Re β1 , . . . , Re βn )T . And

1 2 1 2
⟨0|D̂(β)|0⟩ = e− 2 |β| = e− 2 |v| . (S189)

Now, let us introduce the symplectic matrix S that describes the dynamics of quadrature operators
under Gaussian unitary operation Û :

Û † q̂i Û = (S q̂)i . (S190)

Since the Gaussian unitary operation we consider is written as ÛBS Ûsq , the symplectic matrix can
be decomposed as S = SBS Ssq . Here, SBS is an orthogonal matrix and Ŝsq can be explicitly written
as diag(er1 , . . . , ern , e−r1 , . . . , e−rn ), where r1 , . . . , rn ≥ 0 represent squeezing parameters for each
23

mode. We use r for squeezing parameters for |A⟩ and s for |B̄⟩. After the symplectic transformation
S, the displacement operator transforms as
√ √ √
exp 2iv T q̂ → exp 2iv · (S q̂) = exp 2i(S T v) · q̂ , (S191)

and
√ 1 T v|2
⟨0| exp 2i(S T v) · q̂ |0⟩ = e− 2 |S (S192)

Thus,
1 T 2
⟨Ā|D̂(β)|Ā⟩ = ⟨0|Ûsq
†
Û † D̂(β)ÛBSA ÛsqA |0⟩ = e− 2 |SA v| ,
A BSA
(S193)

and
1 T ′ 2 1 T 2
⟨B̄|D̂† (β)|B̄⟩ = ⟨B̄|D̂(β)|B̄⟩ = e− 2 |SB v | = e− 2 |SB v| , (S194)

where v := v(β). The first equality is due to the fact that ⟨B̄|D̂† (β)|B̄⟩ is real. And we can write
the phase factor as

exp β † (α − ω) − β(α − ω)† = exp 2iv T u , (S195)

where u := (− Re(α1 − ω1 ), . . . , − Re(αn − ωn ), Im(α1 − ω1 ), . . . , Im(αn − ωn ))T . Thus,

Go≤k (β) = ⟨B̄|D̂† (β)|B̄⟩⟨Ā|D̂(β)|Ā⟩eβ (α−ω)−(α−ω) β (S196)

† †

1 T T T T
= exp − v (SA SA + SB SB )v + 2iv u (S197)
2

1 T T
:= exp − v Σv + 2iv u (S198)
2

1 T T
= exp − (Ov) D(Ov) + 2i(Ov) · (Ou) (S199)
2
2n
" 2 #
Y di 2i 2u′2
= exp − vi′ − u′i − i
, (S200)
i=1
2 di di

where Σ := SA SA T + S S T > 0 is diagonalized as Σ = O T DO with diagonal matrix D =

B B
diag(d1 , . . . , d2n ) > 0, and v ′ := Ov, u′ := Ou. Let us analyze the spectrum of D. Note that
since SA SA T and S S T are physical covariance matrices, S S T ≥ iΩ, S S T ≥ iΩ, and thus
B B A A B B
Σ ≥ 2iΩ [3], where
!
0 1
Ω= ⊗ 1M . (S201)
−1 0

Hence, by the Williamson decomposition [3], the spectrum of Σ is composed of pairs such that the
product of the ith and (i + n)th eigenvalues of this matrix is no smaller than 4. Without loss of
generality, we label the eigenvalues d1 , ..., d2n in such a way that di di+n ≥ 4 for all 1 ≤ i ≤ n.
o
Now let us compute Gσ≤k (γ).
R 2n − |β−γ|2 o
o d βe 2σ2 G ≤k (β)/(2πσ 2 )n
Gσ≤k (γ) = R |β ′ |2
(S202)
d2n β ′ e
− o 2σ 2 G ≤k (β ′ )/(2πσ 2 )n
24

By defining z := (Im γ1 , . . . , Im γn , Re γ1 , . . . , Re γn ), and z ′ := Oz, we have

Z
1 |β−γ|2
d2n βe− 2σ 2 Go≤k (β) (S203)
(2πσ 2 )n
Z 2n
" 2 #
1 2n ′
Y (v ′ − z ′ )2 di 2i 2u′2
= d v exp − i 2 i − vi′ − u′i − i (S204)
(2πσ 2 )n i=1
2σ 2 di di
2n
" ! !!#
Y 1 2u′2 2 1 −di zi′2 + 4iui zi′
i σ
= √ exp − exp , (S205)
i=1 1 + di σ 2 1 + di σ 2 2 1 + di σ 2

Therefore, we obtain
2n
" !# 2n
" #
o Y 1 −di zi′2 + 4iui zi′ o Y −di zi′2
Gσ≤k (γ) = exp , and |Gσ≤k (γ)| = exp . (S206)
i=1
2 1 + di σ 2 i=1
2(1 + di σ 2 )

Thus, after taking the average over γ, we obtain

|z|2
Z − 2n 2
" 2n
# s
e Y 2σγ
−di zi′2 Y 1
Eγ |Gσ≤k (γ)|2 2n ′
o
= d z exp = . (S207)
(2πσγ2 )n i=1 (1 + di σ 2 ) i=1
1 + 2di σγ2 /(1 + di σ 2 ))

Substituting this back to Eq. (S184), we obtain the following upper bound for the average TVD
2n
s
Y 1
Eγ TVD(p1 , p2,γ ) ≤ 16N ϵ20 , (S208)
i=1
1 + 2di σγ2 /(1 + di σ 2 ))

which, combined with the lower bound Eq. (S183) and substituting ϵ0 = ϵ/0.98, yields the following
sample complexity bound
 s 
2n
Y 2di σγ2
N ≥ 0.01ϵ−2  1+ . (S209)
i=1
1 + di σ 2

To find a lower bound independent of di ’s, focus on the following product, for any 1 ≤ i ≤ n,
! !
2di σγ2 2di+n σγ2
1+ 1+ (S210)
1 + di σ 2 1 + di+n σ 2

This is an increasing function in di and di+n . We know the spectrum satisfies di di+n ≥ 4. Hence,
we can lower bound it by setting di+n /2 = 2/di := d > 0, which leads to
! !
2di σγ2 2di+n σγ2 (d + 2σ 2 + 4σγ2 )(1 + 2d(σ 2 + 2σγ2 ))
1+ 1+ ≥ . (S211)
1 + di σ 2 1 + di+n σ 2 (d + 2σ 2 )(1 + 2dσ 2 )

The R.H.S. is differentiable in d, with its only extreme value at d = 1 being

!2
4σγ2
1+ . (S212)
1 + 2σ 2

Meanwhile, when d → 0 or d → ∞, it becomes 1 + 2σγ2 /σ 2 . We thus have the following lower bound,
! !  !2 
2di σγ2 2di+n σγ2 2σγ2
 4σγ2 
1+ 1+ ≥ min 1 + 2 , 1 + , (S213)
1 + di σ 2 1 + di+n σ 2  σ 1 + 2σ 2 
25

which gives us the following sample complexity lower bound:

 !n/2 !n 
2σγ2
 4σγ2 
N ≥ 0.01ϵ−2 min 1+ 2 , 1+ . (S214)
 σ 1 + 2σ 2 

Substituting 2σγ2 = 0.99κ completes the proof of Proposition S3.

S4. GAUSSIAN TAIL EFFECT

In this section, we find the condition that the effect of truncating a multivariate normal
distribution is smaller than 0.5, which is used to derive the lower bound for entanglement-free
schemes. Consider a multivariate normal distribution:
n !
1 |x|2
q(x) = exp − 2 , (S215)
2πσ 2 2σ

where x ∈ R2n . Note that in the main text, while we consider γ ∈ Cn , they are equivalent. Now,
we consider a truncated distribution with |x|2 ≤ R2 with a given R:
Z Z n !
1 |x|2
dxq(x) = dx exp − 2 (S216)
|x|≤R |x|≤R 2πσ 2 2σ
n Z R Z !
1 2n−1 r2
= dr dΩ2n r exp − 2 (S217)
2πσ 2 0 2σ
2

Γ n, 2σ
R
2
=1− . (S218)
Γ(n)
where we have used the following integrals:
Z Z R ! " !#
2π n/2 2n−1 r2 n−1 2n R2
dΩn = , drr exp − 2 =2 σ Γ(n) − Γ n, 2 , (S219)
Γ(n/2) 0 2σ 2σ

and
Z ∞
Γ(n, x) = tn−1 e−t dt (S220)
x

is the (upper)
incomplete
gamma function and Γ(n) = Γ(n, 0). Therefore, the tail probability is
R2
given by Γ n, 2σ2 /Γ(n). In the main text and the proof of sample complexity lower bound of
entanglement-free schemes, we choose 2σ 2 = 0.99κ and R2 = κn. For our purpose, it suffices to
show that Γ(n,n/0.99)
Γ(n) ≤ 0.5. To see this, we use the following inequality [14]:

Γ(n, kn)
≤ (ke1−k )n , ∀ k > 1. (S221)
Γ(n)

First notice that for k = 1/0.99 and n = 14000, (ke1−k )n ≤ 0.492. Now, for every n < 14000, one
can numerically verify Γ(n,n/0.99)
Γ(n) ≤ 0.5 (see Fig. S5); For n > 14000, the upper bound (ke1−k )n
Γ(n,n/0.99)
monotonically decreases with n, so we also have Γ(n) ≤ 0.492. Combining these two cases
completes the proof.
26

FIG. S5. Numerical verification that the Gaussian tail probability is upper bounded by 0.5 for n up to 14000.

[1] A. Ferraro, S. Olivares, and M. G. Paris, Gaussian states in continuous variable quantum information,
arXiv preprint quant-ph/0503237 (2005).
[2] C. Weedbrook, S. Pirandola, R. García-Patrón, N. J. Cerf, T. C. Ralph, J. H. Shapiro, and S. Lloyd,
Gaussian quantum information, Reviews of Modern Physics 84, 621 (2012).
[3] A. Serafini, Quantum continuous variables: a primer of theoretical methods (CRC press, 2017).
[4] H.-Y. Huang, R. Kueng, and J. Preskill, Information-theoretic bounds on quantum advantage in machine
learning, Physical Review Letters 126, 190505 (2021).
[5] D. Aharonov, J. Cotler, and X.-L. Qi, Quantum algorithmic measurement, Nature communications 13,
1 (2022).
[6] S. Chen, S. Zhou, A. Seif, and L. Jiang, Quantum advantages for pauli channel estimation, Phys. Rev.
A 105, 032435 (2022).
[7] S. Chen, C. Oh, S. Zhou, H.-Y. Huang, and L. Jiang, Tight bounds on pauli channel learning without
entanglement, arXiv preprint arXiv:2309.13461 (2023).
[8] S. Chen and W. Gong, Futility and utility of a few ancillas for pauli channel learning, arXiv preprint
arXiv:2309.14326 (2023).
[9] H.-Y. Huang, M. Broughton, J. Cotler, S. Chen, J. Li, M. Mohseni, H. Neven, R. Babbush, R. Kueng,
J. Preskill, and J. R. McClean, Quantum advantage in learning from experiments, Science 376, 1182
(2022).
[10] K. Kornelson and D. Larson, Rank-one decomposition of operators and construction of frames, Contem-
porary Mathematics 345, 203 (2004).
[11] K. R. Castleman, Digital image processing (Prentice Hall Press, 1996).
[12] S. Barnett and P. M. Radmore, Methods in theoretical quantum optics, Vol. 15 (Oxford University Press,
2002).
[13] C. M. Caves and K. Wódkiewicz, Fidelity of gaussian channels, Open Systems & Information Dynamics
11, 309 (2004).
[14] M. Ghosh, Exponential tail bounds for chisquared random variables, Journal of Statistical Theory and
Practice 15, 1 (2021).

Ure Substance: Saturation Temperature (T)
No ratings yet
Ure Substance: Saturation Temperature (T)
13 pages
Machine Diagnosis PDF
No ratings yet
Machine Diagnosis PDF
97 pages
Information Plane and Compression-Gnostic Feedback in Quantum Machine Learning
No ratings yet
Information Plane and Compression-Gnostic Feedback in Quantum Machine Learning
16 pages
Hybrid Progress v11.1.8 Postprint
No ratings yet
Hybrid Progress v11.1.8 Postprint
11 pages
Quantum Learning Algorithms Imply Circuit Lower Bounds: Srinivasan Arunachalam Alex B. Grilo
No ratings yet
Quantum Learning Algorithms Imply Circuit Lower Bounds: Srinivasan Arunachalam Alex B. Grilo
74 pages
Advances in High-Dimensional Quantum Entanglement (Nature Reviews Physics, 2020)
No ratings yet
Advances in High-Dimensional Quantum Entanglement (Nature Reviews Physics, 2020)
17 pages
Quantumness and Memory of An Open Qubit Under Classical Control
No ratings yet
Quantumness and Memory of An Open Qubit Under Classical Control
10 pages
Cryptographic Characterization of Quantum Advantage (Tomoyuki Morimae Et Al)
No ratings yet
Cryptographic Characterization of Quantum Advantage (Tomoyuki Morimae Et Al)
12 pages
BMG End Is Learning
No ratings yet
BMG End Is Learning
25 pages
Quantum Information With Continuous Variables: Samuel L. Braunstein and Peter Van Loock
No ratings yet
Quantum Information With Continuous Variables: Samuel L. Braunstein and Peter Van Loock
65 pages
202202AISTATS PadMag
No ratings yet
202202AISTATS PadMag
15 pages
Experimental Quantum Learning Spectral Decomposition
No ratings yet
Experimental Quantum Learning Spectral Decomposition
10 pages
Power of Data in Quantum Machine Learning: Article
No ratings yet
Power of Data in Quantum Machine Learning: Article
9 pages
Learning Boolean Formulae: Key Words: Machine Learning, Inductive Inference
No ratings yet
Learning Boolean Formulae: Key Words: Machine Learning, Inductive Inference
36 pages
A Quantum Neural Network Computes Its Own Relative Phase: Elizabeth C. Behrman James E. Steck
No ratings yet
A Quantum Neural Network Computes Its Own Relative Phase: Elizabeth C. Behrman James E. Steck
6 pages
Learning With Errors Is Easy With Quantum Samples
No ratings yet
Learning With Errors Is Easy With Quantum Samples
11 pages
Paper 1
No ratings yet
Paper 1
23 pages
Empowering Complex-Valued Data Classification With The Variational Quantum Classifier
No ratings yet
Empowering Complex-Valued Data Classification With The Variational Quantum Classifier
15 pages
Dear The Weight
From Everand
Dear The Weight
Masud Rana
No ratings yet
Quantum Learning With Noise and Decoherence: A Robust Quantum Neural Network
No ratings yet
Quantum Learning With Noise and Decoherence: A Robust Quantum Neural Network
15 pages
Active Learning of Quantum System Hamiltonians Yields Query Advantage
No ratings yet
Active Learning of Quantum System Hamiltonians Yields Query Advantage
48 pages
Learning Functions Represented As Multiplicity Automata: Amos Beimel
No ratings yet
Learning Functions Represented As Multiplicity Automata: Amos Beimel
25 pages
QPOL Chen 2022 Mach. Learn. Sci. Technol. 3 015025
No ratings yet
QPOL Chen 2022 Mach. Learn. Sci. Technol. 3 015025
18 pages
9036 Provably Noise Resilient
No ratings yet
9036 Provably Noise Resilient
17 pages
Continuous Variable Quantum Information An Beyond
No ratings yet
Continuous Variable Quantum Information An Beyond
47 pages
In Machine Learning, Data That Is U
No ratings yet
In Machine Learning, Data That Is U
1 page
The Power of Data in QML
No ratings yet
The Power of Data in QML
34 pages
Efficient DBM
No ratings yet
Efficient DBM
40 pages
PhysRevResearch 6 023181
No ratings yet
PhysRevResearch 6 023181
16 pages
Why Does Deep and Cheap Work So Well
No ratings yet
Why Does Deep and Cheap Work So Well
17 pages
Entropy 26 00905
No ratings yet
Entropy 26 00905
14 pages
On Quantum Backpropagation, Information Reuse, and Cheating Measurement Collapse
No ratings yet
On Quantum Backpropagation, Information Reuse, and Cheating Measurement Collapse
29 pages
Quantum Neural Networks: Concepts, Applications, and Challenges
No ratings yet
Quantum Neural Networks: Concepts, Applications, and Challenges
4 pages
Quantum Neural Networks: Concepts, Applications, and Challenges
No ratings yet
Quantum Neural Networks: Concepts, Applications, and Challenges
1 page
AT-QIT Learning Theory
No ratings yet
AT-QIT Learning Theory
13 pages
Lecture5 Learning Theory v1.1
No ratings yet
Lecture5 Learning Theory v1.1
59 pages
Quantum Channel Capacites - Alessandro Fasse
No ratings yet
Quantum Channel Capacites - Alessandro Fasse
33 pages
Projective DNF Rev
No ratings yet
Projective DNF Rev
26 pages
Quantum Cosmos
From Everand
Quantum Cosmos
Azhar ul Haque Sario
No ratings yet
Transfer Learning in Hybrid Classical-Quantum Neural Networks
No ratings yet
Transfer Learning in Hybrid Classical-Quantum Neural Networks
13 pages
Quantum Communication
From Everand
Quantum Communication
IntroBooks Team
No ratings yet
Quantum Computing Neuroscience
From Everand
Quantum Computing Neuroscience
Kerrick Patterson
No ratings yet
Parametrized Quantum Policies For Reinforcement Learning
No ratings yet
Parametrized Quantum Policies For Reinforcement Learning
33 pages
2003 Book QuantumInformationWithContinuo PDF
No ratings yet
2003 Book QuantumInformationWithContinuo PDF
419 pages
Quantum Deep Learning
No ratings yet
Quantum Deep Learning
35 pages
Major Project Thesis
No ratings yet
Major Project Thesis
33 pages
Colt Tutorial
No ratings yet
Colt Tutorial
43 pages
Robust and Efficient Hamiltonian Learning: Wenjun Yu, Jinzhao Sun, Zeyao Han, and Xiao Yuan
No ratings yet
Robust and Efficient Hamiltonian Learning: Wenjun Yu, Jinzhao Sun, Zeyao Han, and Xiao Yuan
41 pages
The World As A Neural Network: Vitaly Vanchurin
No ratings yet
The World As A Neural Network: Vitaly Vanchurin
23 pages
Quantum Computational Advantage Using Photons
No ratings yet
Quantum Computational Advantage Using Photons
4 pages
Classically Verifiable Quantum Advantage From A Computational Bell Test
No ratings yet
Classically Verifiable Quantum Advantage From A Computational Bell Test
10 pages
Curriculum Learning by Transfer Learning Theory and Experiments With Deep Networks
No ratings yet
Curriculum Learning by Transfer Learning Theory and Experiments With Deep Networks
9 pages
RevModPhys 84 621
No ratings yet
RevModPhys 84 621
49 pages
Quantum Horizon
From Everand
Quantum Horizon
Laura Lee
No ratings yet
PHD Thesis
No ratings yet
PHD Thesis
175 pages
Michail Zak and Colin P. Williams - Quantum Neural Nets
No ratings yet
Michail Zak and Colin P. Williams - Quantum Neural Nets
48 pages
Nips00 GM
No ratings yet
Nips00 GM
7 pages
Generalization Performance of Narrow One-Hidden Layer Networks in The Teacher-Student Setting
No ratings yet
Generalization Performance of Narrow One-Hidden Layer Networks in The Teacher-Student Setting
34 pages
On Contrastive Divergence Learning
No ratings yet
On Contrastive Divergence Learning
8 pages
Opportunities and Limitations of Explaining Quantu
No ratings yet
Opportunities and Limitations of Explaining Quantu
32 pages
Energy Based Models in Document Recognition and Computer Vision
No ratings yet
Energy Based Models in Document Recognition and Computer Vision
118 pages
The Quantum Leap: Unraveling the Mysteries of Reality
From Everand
The Quantum Leap: Unraveling the Mysteries of Reality
Anurag Anurag
No ratings yet
Apreo 2 Sem Life Sciences Datasheet Ds0373
No ratings yet
Apreo 2 Sem Life Sciences Datasheet Ds0373
9 pages
CE 415 Columns Part 1
No ratings yet
CE 415 Columns Part 1
66 pages
Geo Notes Final
No ratings yet
Geo Notes Final
65 pages
Ultrasonic Testing Report: AST-NDE-UT-R-500-4-2 Rev.01
No ratings yet
Ultrasonic Testing Report: AST-NDE-UT-R-500-4-2 Rev.01
7 pages
In Military
No ratings yet
In Military
1 page
Masoneilan SVI II AP Installation and Maintenance Manual (Rev G) PDF
75% (4)
Masoneilan SVI II AP Installation and Maintenance Manual (Rev G) PDF
180 pages
Transkrip Nilai 14.16.1.0031 (Cepi Kusdiana)
No ratings yet
Transkrip Nilai 14.16.1.0031 (Cepi Kusdiana)
3 pages
Compliance Methods of ECBC
100% (2)
Compliance Methods of ECBC
21 pages
Offshore Technology Report: Partial Safety Factors For SINTAP Procedure
No ratings yet
Offshore Technology Report: Partial Safety Factors For SINTAP Procedure
42 pages
Squat Computation
No ratings yet
Squat Computation
15 pages
Idler Testing
100% (1)
Idler Testing
1 page
Normet Microfine Cement Brochure Eng 0
No ratings yet
Normet Microfine Cement Brochure Eng 0
8 pages
Analogy by John D. Norton
No ratings yet
Analogy by John D. Norton
31 pages
WWW - Espol.edu - Ec: Secretaría Técnica Académica
No ratings yet
WWW - Espol.edu - Ec: Secretaría Técnica Académica
2 pages
Comparison Between Grain-Size Analyses Using Laser
No ratings yet
Comparison Between Grain-Size Analyses Using Laser
12 pages
Note Vray
No ratings yet
Note Vray
3 pages
RailWay UT
No ratings yet
RailWay UT
8 pages
Factorization
No ratings yet
Factorization
9 pages
Lebl - Basic Analysis
No ratings yet
Lebl - Basic Analysis
252 pages
Bear e Bachmat - 1990 - Introduction To Modeling of Transport Phenomena in
100% (2)
Bear e Bachmat - 1990 - Introduction To Modeling of Transport Phenomena in
23 pages
18ni300 Maraging Steel Produced Via Direct Energy Deposition On H13 Tool Steel and DIN CK45
No ratings yet
18ni300 Maraging Steel Produced Via Direct Energy Deposition On H13 Tool Steel and DIN CK45
12 pages
Physics Year 8 - Mass, Weight and Gravity
No ratings yet
Physics Year 8 - Mass, Weight and Gravity
10 pages
Bamboo 1
No ratings yet
Bamboo 1
15 pages
The Second Epilogue of War and Peace
No ratings yet
The Second Epilogue of War and Peace
10 pages
BIOM9027
No ratings yet
BIOM9027
12 pages
Lecture 3 - The Design Process PDF
No ratings yet
Lecture 3 - The Design Process PDF
35 pages
Cambridge International AS & A Level: Physics 9702/52 March 2021
No ratings yet
Cambridge International AS & A Level: Physics 9702/52 March 2021
9 pages
PKGS 00LDP1 R
No ratings yet
PKGS 00LDP1 R
5 pages

Entanglement-Enabled Advantage For Learning A Bosonic Random Displacement Channel

Uploaded by

Entanglement-Enabled Advantage For Learning A Bosonic Random Displacement Channel

Uploaded by

Entanglement-enabled advantage for learning a bosonic random displacement channel

Building 307, Fysikvej, 2800 Kgs. Lyngby, Denmark

complexity independent of n, given access to two-mode (a) (b)

The two learning scenarios are illustrated in Fig. 1. Note

ing parameter r, which is a two-mode squeezed vacuum

(5) of the number of modes n, while our upper bound on sam-

with σ = 0.3, γr = 1.6, γi = 0 (γ := γr + iγi ), and

π plexity of the Vacuum+Heterodyne scheme scales expo-

Refs. [18, 23] to the CV case. We begin by defining the

Changhun Oh,1, ∗ Senrui Chen,1, ∗ Yat Wong,1 Sisi Zhou,2, 3, 4

S2. Derivation of output probability distributions of each scheme 3

S3. Fundamental limits for entanglement-free schemes 12

S4. Gaussian tail effect 25

Also, we employ the Wigner function of an operator Ô defined as

Here, we derive the expression of a random displacement channel characterized by a probability

S2. DERIVATION OF OUTPUT PROBABILITY DISTRIBUTIONS OF EACH SCHEME

A. Entanglement-assisted (TMSV+BM) schemes

Multiple TMSV states are straightforward to generalize:

Therefore, the input TMSV states can be written as

= λ(β2 )D̂† (β1 )A ⊗ D̂† (β2 )B . (S36)

Thus, we can simplify Eq. (S32) as

have a good precision ϵ with a high probability 1 − δ. As observed above, for

B. Entanglement-free (Vacuum+Heterodyne) schemes

pEF (ϕ) = Tr[Πϕ Λ(|ϕ0 ⟩⟨ϕ0 |)] (S48)

Again, by inverting the probability distribution as

Thus, in this case, it indicates that the sufficient number of samples is

C. Entanglement-assisted scheme with imperfection

We then perform measurements described by the following POVM:

where the delta function gives us ω2 = ω1∗ . Thus,

Consequently, the characteristic function is written by the probability distribution as

This completes the proof of Theorem 3 in the main text.

D. Discussion on more general input states

N = O(|gρ̂ (β ∗ , β)|−2 ϵ−2 log δ −1 ). (S97)

For example, for TMSV states, this function reduces to

FIG. S1. Effect of phase diffusion on the √

phase diffusion, which can be modeled by a photon-number-dependent random phase following a

S3. FUNDAMENTAL LIMITS FOR ENTANGLEMENT-FREE SCHEMES

A. Lower bound for entanglement-free schemes

Theorem S2. Given positive numbers n, σ, κ, ϵ such that

1. Prepare N copies of Λdep for Bob;

2. Prepare N copies of Λsγ for Bob.

Bob just makes a uniformly random guess. Note that

Eγ TVD(p1 , p2,γ ) ≥ 0.1666. (S125)

p1 (o1:N ) − p2,γ (o1:N ) (S130)

where we have defined

⟨Bok<k |D̂† (β)|Bok<k ⟩

which satisfies Go≤k (γ) = Go≤k (−γ)∗ .

Now we can lower bound the following term,

Thanks to Lemma S1, we get the following upper bound

First of all, write the following expression in the Fourier basis

The Fourier component of G can be computed as

After averaging over Gaussian distribution of γ, we obtain the numerator

The denominator follows similarly from Eq. (S157) as

C. Lower bound for entanglement-free Gaussian schemes

Proposition S3. Given positive numbers n, σ, κ, ϵ such that

Eγ TVD(p1 , p2,γ ) ≥ 0.1666. (S183)

where q̂ := (x̂1 , . . . , x̂n , p̂1 , . . . , p̂n )T and v(β) := (Im β1 , . . . , Im βn , Re β1 , . . . , Re βn )T . And

Û † q̂i Û = (S q̂)i . (S190)

where u := (− Re(α1 − ω1 ), . . . , − Re(αn − ωn ), Im(α1 − ω1 ), . . . , Im(αn − ωn ))T . Thus,

Go≤k (β) = ⟨B̄|D̂† (β)|B̄⟩⟨Ā|D̂(β)|Ā⟩eβ (α−ω)−(α−ω) β (S196)

where Σ := SA SA T + S S T > 0 is diagonalized as Σ = O T DO with diagonal matrix D =

By defining z := (Im γ1 , . . . , Im γn , Re γ1 , . . . , Re γn ), and z ′ := Oz, we have

Thus, after taking the average over γ, we obtain

The R.H.S. is differentiable in d, with its only extreme value at d = 1 being

which gives us the following sample complexity lower bound:

Substituting 2σγ2 = 0.99κ completes the proof of Proposition S3.

S4. GAUSSIAN TAIL EFFECT

You might also like