0% found this document useful (0 votes)

84 views8 pages

Contextual Multi-Armed Bandits: A+b+1 A+b+2

This document introduces the problem of contextual multi-armed bandits where the context comes from metric spaces and the payoff satisfies a Lipschitz condition with respect to the metric. The problem is motivated by sponsored web search, where the task is to display relevant ads to users based on their search queries. The paper presents an algorithm for this problem that achieves regret of O(T a+b+1+) where a and b are the covering dimensions of the query and ad spaces, and establishes a matching lower bound of Ω(T ã+b̃+1−) where ã and b̃ are the packing dimensions.

Uploaded by

Long Nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views8 pages

Contextual Multi-Armed Bandits: A+b+1 A+b+2

Uploaded by

Long Nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Contextual Multi-Armed Bandits

Tyler Lu Dávid Pál Martin Pál

[email protected] [email protected] [email protected]
Department of Computer Science Department of Computing Science Google, Inc.
University of Toronto University of Alberta 76 9th Avenue, 4th Floor
10 King’s College Road, T6G 2E8 Edmonton, AB, Canada New York, NY 10011, USA
M5S 3G4 Toronto, ON, Canada

Abstract 1 INTRODUCTION

We study contextual multi-armed bandit prob- Internet search engines, such as Google, Yahoo! and Mi-
lems where the context comes from a metric crosoft’s Bing, receive revenue from advertisements shown
space and the payoff satisfies a Lipschitz condi- to a user’s query. Whenever a user decides to click on an ad
tion with respect to the metric. Abstractly, a con- displayed for a search query, the advertiser pays the search
textual multi-armed bandit problem models a sit- engine. Thus, part of the search engine’s goal is to display
uation where, in a sequence of independent trials, ads that are most relevant to the user in the hopes of in-
an online algorithm chooses, based on a given creasing the chance of a click, and possibly increasing its
context (side information), an action from a set of expected revenue. In order to achieve this, the search en-
possible actions so as to maximize the total pay- gine has to learn over time which ads are the most relevant
off of the chosen actions. The payoff depends on to display for different queries. On the one hand, it is im-
both the action chosen and the context. In con- portant to exploit currently relevant ads, and on the other
trast, context-free multi-armed bandit problems, hand, one should explore potentially relevant ads. This
a focus of much previous research, model situa- problem can be naturally posed as a multi-armed bandit
tions where no side information is available and problem with context. Here by context we mean a user’s
the payoff depends only on the action chosen. query. Each time a query x arrives and an ad y is dis-
Our problem is motivated by sponsored web played there is an (unknown) probability µ(x, y) that the
search, where the task is to display ads to a user user clicks on the ad.1 We call µ(x, y) the click-through
of an Internet search engine based on her search rate (or CTR) of x and y.
query so as to maximize the click-through rate We want to design an online algorithm, which given a query
(CTR) of the ads displayed. We cast this prob- in each time step and a history of past queries and ad clicks,
lem as a contextual multi-armed bandit problem displays an ad to maximize the expected number of clicks.
where queries and ads form metric spaces and In our setting, we make a crucial yet very natural assump-
the payoff function is Lipschitz with respect to tion that the space of queries and ads are endowed with a
both the metrics. For any > 0 we present an metric and µ(x, y) satisfies a Lipschitz condition with re-
a+b+1
algorithm with regret O(T a+b+2 + ) where a, b spect to each coordinate. Informally, we assume that the
are the covering dimensions of the query space CTRs of two similar ads for the same query are close, and
and the ad space respectively. We prove a lower that of two similar queries for the same ad are also close.
ã+b̃+1
− Lastly, we assume that the sequence of queries is fixed in
bound Ω(T ã+b̃+2 ) for the regret of any algo-
rithm where ã, b̃ are packing dimensions of the advance by an adversary and revealed in each time step (aka
query spaces and the ad space respectively. For oblivious adversary).
finite spaces or convex bounded subsets of Eu- Clearly, the best possible algorithm—Bayes optimal —
clidean spaces, this gives an almost matching up- displays, for a given query, the ad which has the highest
per and lower bound. CTR. Of course, in order to execute it the CTRs must be
known. Instead we are interested in algorithms that do not
Appearing in Proceedings of the 13th International Conference depend on the knowledge of the CTRs and whose perfor-
on Artificial Intelligence and Statistics (AISTATS) 2010, Chia La- mance is still asymptotically the same as that of the Bayes
guna Resort, Sardinia, Italy. Volume 9 of JMLR: W&CP 9. Copy-
right 2010 by the authors. 1
For simplicity we assume that one ad is displayed per query.

485
Contextual Multi-Armed Bandits

optimal. More precisely, for any algorithm A, we consider of any Lipschitz contextual bandit algorithm, showing that
the expected difference between the number of clicks that our algorithm is essentially optimal.
the Bayes optimal receives and A receives for T queries.
This difference is called the regret of A and is denoted by
1.1 RELATED WORK
RA (T ). An algorithm is said to be asymptotically Bayes
optimal if the per-query regret RA (T )/T approaches 0 as There is a body of relevant literature on context-free multi-
T → ∞ for any sequence of queries. armed bandit problems: first bounds on the regret for the
The standard measure of quality of an asymptotically model with finite action space were obtained in the classic
Bayes optimal algorithm is the speed of convergence at paper by Lai and Robbins [1985]; a more detailed exposi-
which per-round regret approaches zero. Equivalently, one tion can be found in Auer et al. [2002]. Auer et al. [2003]
measures the growth of the regret RA (T ) as T → ∞. The introduced and provided regret optimal algorithms in the
bounds are usually of the form RA (T ) = O(T γ ) for some non-stochastic bandit problem when payoffs are adversar-
γ < 1. Such regret bounds are the standard way of mea- ial. In recent years much work has been done on very large
suring performance of algorithms for multi-armed bandit action spaces. Flaxman et al. [2005] considered a setting
problems, for online learning problems and, more broadly, where actions form a convex set and in each round a convex
for reinforcement learning problems. payoff function is adversarially chosen. Continuum actions
spaces and payoff functions satisfying (variants of) Lips-
The main contributions of this paper are 1) a formal chitz condition were studied in Kleinberg [2005a,b], Auer
model of the Lipschitz contextual bandit problem on met- et al. [2007]. Most recently, metric action spaces where the
ric spaces, 2) a novel, conceptually simple and clean al- payoff function is Lipschitz was considered by Kleinberg
gorithm, which we call query-ad-clustering, and 3) lower et al. [2008]. Inspired by their work, we also consider met-
bounds that show the algorithm is essentially optimal with ric spaces for our work. In a follow-up paper by Bubeck
respect to regret. In particular, the following theorem states et al. [2008] the results of Kleinberg et al. [2008] are ex-
our results in our contextual bandit model. Note that the tended to more general settings.
covering dimension of a metric space is defined as the
smallest d such that the number of balls of radius r required Our model can be viewed as a direct and strict generaliza-
to cover the space is O(r−d ). The packing dimension, is tion of the classical multi-armed bandit problem by Lai and
defined as the largest d˜ such that there for any r there exists Robbins and the bandit problem in continuum and general
˜ metric spaces as presented by Agrawal [1995] and Klein-
a subset of disjoint balls of radius r of size Ω(r−d ).
berg et al. [2008]. These models can be viewed as a special
Theorem 1. Consider a contextual Lipschitz multi-armed
case of our model where the query space is a singleton. Our
bandit problem with query metric space (X, LX ) and ads
upper and lower bounds on the regret apply to these mod-
metric space (Y, LY ) of size at least 2. Let a, b be the cov-
els as well. See section 1.3 for a closer comparison with
ering dimensions of X, Y respectively, and ã, b̃ be the pack-
the model of Kleinberg et al. [2008].
ing dimensions of X, Y respectively. Then,
Online learning with expert advice is a class of problems
a+b+1 related to multi-armed bandits, see the book by Cesa-
• For any γ > a+b+2 , the query-ad-clustering algo-
rithm A has the property that there exists constants Bianchi and Lugosi [2006]. These can viewed as multi-
T0 , C such that for any instance µ, T ≥ T0 and se- armed bandit problems with side information, but their
quence of T queries the regret RA (T ) ≤ C · T γ . structure is different than the structure of our model. The
most relevant work is the Exp4 algorithm of Auer et al.
• For any γ < ã+ b̃+1
ã+b̃+2
there exists positive constants [2003] where experts are simply any multi-armed bandit al-
C, T0 such that for any T ≥ T0 and any algorithm A gorithm, and the goal is to compete against the best expert.
there exists an instance µ and a sequence of T queries In fact this setting and the Exp4 algorithm can be reformu-
such that the regret RA (T ) ≥ C · T γ . lated in our model, which is discussed further at the end of
section 2.
If the query space and the ads space are convex bounded
We are aware of three papers that define multi-armed bandit
subsets of Euclidean spaces or are finite then ã = a and
problem with side information. The first two are by Wang
b̃ = b (finite spaces have zero dimension) and the theorem
et al. [2005] and Goldenshluger and Zeevi [2007], how-
provides matching upper and lower bounds.
ever, the models in these papers are very different from
The paper is organized as follows. In section 1.1 we ours. The epoch-greedy algorithm proposed in Langford
discuss related work, and introduce our Lipschitz contex- and Zhang [2007] pertains to a setting where contexts ar-
tual multi-armed bandit model in section 1.2. Then we in- rive i.i.d. and regret is defined relative to the best context-
troduce the query-ad-clustering algorithm in section 2 and to-action mapping in some fixed class of such mappings.
give an upper bound on its regret. In section 3 we present They upper bound the regret of epoch-greedy in terms of
what is essentially a matching lower bound on the regret an exploitation parameter that makes it hard to compare

486
Tyler Lu, Dávid Pál, Martin Pál

with our bounds. Definition 3. Let (Z, LZ ) be a metric space. Covering

number N (Z, LZ , r) is the smallest number of sets needed
Regret bounds for reinforcement learning has been stud-
to cover Z such that in each set of the covering any two
ied by several authors. See, for example, Auer and Ortner
points have distance less than r. The covering dimension
[2007], Even-Dar et al. [2006]. For a general overview of
of (Z, LZ ), denoted COV(Z, LZ ), is
reinforcement learning see Sutton and Barto [1998].
inf d : ∃c > 0 ∀r ∈ (0, 1] N (Z, LZ , r) ≤ cr−d .

1.2 NOTATION
A subset Z0 ⊆ Z is called r-separated if for all z, z 0 ∈ Z0
Definition 2. A Lipschitz contextual multi-armed bandit we have LZ (z, z 0 ) ≥ r. The packing number M(Z, LZ , r)
problem (Lipschitz contextual MAB) is a pair of metric is the largest size of a r-separated subset. Packing dimen-
spaces—a metric space of queries (X, LX ) of and a met- sion of (Z, LZ ), denoted PACK(Z, LZ ), is
ric space of ads (Y, LY ). An instance of the problem is a
sup d : ∃c > 0 ∀r ∈ (0, 1] M(Z, LZ , r) ≥ cr−d .

payoff function µ : X × Y → [0, 1] which is Lipschitz in
each coordinate, that is, ∀x, x0 ∈ X, ∀y, y 0 ∈ Y ,
In the rest of the paper, when a Lipschitz contextual MAB
|µ(x, y) − µ(x0 , y 0 )| ≤ LX (x, x0 ) + LY (y, y 0 ). (1)
(X, Y ) is understood, we denote by a, b the covering di-
mensions of X, Y respectively and we denote by ã, b̃ the
The above condition can still be meaningful if the metric packing dimension of X, Y respectively.
spaces have diameter greater than unity, however, we steer
clear of the issue of learning meaningful metrics. In the
1.3 COMPARISON WITH Kleinberg et al. [2008]
above definition, the Lipschitz condition (1) can be equiv-
alently, perhaps more intuitively, written as a pair of Lips- Compared to the results of Kleinberg et al. [2008] whose
chitz conditions, one condition for the query space and one bounds are in terms of a metric dependent max-min-
for the ad space: covering dimension, our lower bound might seem contra-
∀x, x0 ∈ X, ∀y ∈ Y, |µ(x, y) − µ(x0 , y)| ≤ LX (x, x0 ), dictory since our bound also applies to a query space con-
0 sisting of a singleton. However, the important difference
∀x ∈ X, ∀y, y ∈ Y, |µ(x, y) − µ(x, y 0 )| ≤ LY (y, y 0 ). is the non-uniformity over the payoff function µ. Namely,
our bounds do not depend on µ whereas theirs do.
An algorithm for a Lipschitz contextual MAB is a sequence
A = {At }∞ t−1 For a fixed metric space (Y, LY ), let µ be the set of all
t=1 of functions At : (X × Y × [0, 1]) ×
X → Y where the function At maps a history (x1 , y1 , µ̂1 ), Lipschitz payoff functions, for any algorithm A, the regret
(x2 , y2 , µ̂2 ), . . . , (xt−1 , yt−1 , µ̂t−1 ) and a current query xt dimension as defined by Kleinberg et al. [2008] is
to an ad yt . The algorithm operates in rounds t = 1, 2, . . . n d+1
o
in an online fashion. In each round t the algorithm first sup inf ∃T0 , ∀T > T0 , RA (T ) ≤ T d+2 .
µ d≥0
receives a query xt , then (based on the query and the his-
tory) it displays an ad yt , and finally it receives payoff2 It is shown that there exists algorithms that achieve any re-
µ̂t ∈ [0, 1] which is an independent random variable with gret dimension strictly greater than the max-min-covering
expectation µ(xt , yt ). Regret of A after T rounds on a fixed dimension and no algorithms exist with regret dimension
sequence of queries x1 , x2 , . . . , xT is defined as strictly smaller. The infimum and T0 in the definition of
" T # " T # regret dimension “swallows up” constants that can depend
X
0
X on the payoff in µ.
RA (T ) = sup µ(xt , yt ) − E µ̂(xt , yt )
0
t=1 yt ∈Y t=1 On the other hand, the constants in our regret bound do
where the expectation is taken over the random choice of not depend on the payoff functions. For example, the lower
the payoff sequence µ̂1 , µ̂2 , . . . , µ̂T that the algorithm re- bound says that there exists constants T0 and C, for all T >
b̃+1
ceives. T0 , any algorithm A satisfies RA (T ) ≥ C · T b̃+2 when the
query space is a singleton and b̃ = PACK(Y, LY ).
Our results are upper and lower bounds on the regret. We
express those bounds in terms of covering and packing di-
mensions of the query space and the ad space, respectively. 2 QUERY-AD-CLUSTERING
These dimensions are in turn defined in terms of covering ALGORITHM
and packing numbers. We specify these notions formally
in the following definition. In this section we present the query-ad-clustering algo-
2
In the case of clicks, µ̂t ∈ {0, 1} where µ̂t = 1 indicates that rithm for the Lipschitz contextual MAB. Strictly speak-
the user has clicked on the ad. Our results, however, are the same ing, the algorithm represents, in fact, a class of algorithms,
regardless of whether the range of µ̂t is {0, 1} or [0, 1]. one for each MAB (X, Y ) and each γ > a+b+1 a+b+2 . First

487
Contextual Multi-Armed Bandits

we present the algorithm and then we prove O(T γ ) upper the book [Devroye and Lugosi, 2001, Chapter 2] or in the
bound on its regret. original paper by Hoeffding [1963].
Before we state the algorithm we define several parame-
Hoeffding’s Inequality Let X1 , X2 , . . . , Xn be indepen-
ters that depend on (X, Y ) and γ and fully specify the al-
dent bounded random variables such that Xi , 1 ≤ i ≤ n,
gorithm. Let a, b to be the covering dimensions of X, Y
has support [ai , bi ]. Then for the sum S = X1 + X2 + · · · +
respectively. We define a0 , b0 so that a0 > a, b0 > b
0
+b0 +1 Xn we have for any u ≥ 0,
and γ > aa0 +b 0 +2 . We also let c, d be constants such that

the covering numbers of X, Y respectively are bounded as

2u2

0 0
N (X, r) ≤ cr−a and N (Y, r) ≤ dr−b . Existence of such Pr [|S − E[S]| ≥ u] ≤ 2 exp − Pn 2
.
i=1 (ai − bi )
constants c, d is guaranteed by the definition of covering di-
mension. Lemma 4. Assume that during phase i, up to step T , n
queries were received in a cluster Xj . Then, the contribu-
Algorithm Description: The algorithm works in phases
tion of these queries to the regret is bounded as
i = 0, 1, 2, . . . consisting of 2i rounds each. Consider a
particular phase i, at the beginning of the phase, the algo-  
rithm partitions the query space X into disjoint sets (clus-  X 
ters) X1 , X2 , . . . , XN each of diameter at most r where Ri,j (T ) = E  sup µ(xt , yt0 ) − µ(xt , yt )
 
 yt0 ∈Y 
i a0 i
2i ≤t≤min(T,2i+1 −1)
r = 2− a0 +b0 +2 and N = c · 2 a0 +b0 +2 . (2) xt ∈Xj

16i
The existence of such partition X1 , X2 , . . . , XN follows ≤ 6rn + K +1
r
from the assumption that the covering dimension of X is
a. Similarly, at the beginning of the phase, the algorithm where r is the diameter defined in (2) and K is the size of
picks a subset Y0 ⊆ Y of size K such that each y ∈ Y is the ads space covering defined in (3).
within distance r to a point in Y0 , where
b0 i
Proof. For i = 0 the bound is trivial. Henceforth we
K = d · 2 a0 +b0 +2 . (3) assume i ≥ 1. Fix an arbitrary query point x0 in Xj .
Let the good event be that µt (y) ∈ [µ(x0 , y) − Rt (y) −
The existence of such Y0 comes from the fact that the cover- r, µ(x0 , y) + Rt (y) + r] for all y ∈ Y and all t, 2i ≤ t ≤
ing dimension of Y is b. (In phase i, the algorithm displays min(T, 2i+1 −1). The complement of the good event is the
only ads from Y0 .) bad event.
In each round t of the current phase i, when a query xt We use Hoeffding’s inequality to show that with probability
is received, the algorithm determines the cluster Xj of the at most K2−i the bad event occurs conditioned on the val-
partition to which xt belongs. Fix a cluster Xj . For each ues of nt (y) for all y ∈ Y0 and all t. Since the K2−i bound
ad y ∈ Y0 , let nt (y) be the number of times that the ad does not depend on the values of nt (y), the bad event oc-
y has been displayed for a query from Xj during the cur- curs with at most this probability unconditionally. Consider
rent phase up to round t and let µt (y) be the corresponding any y ∈ Y0 and any t, 2i ≤ t < T , for which nt (y) ≥ 1.
empirical average payoff of ad y. If nt (y) = 0 we define By Lipschitz condition
µt (y) = 0. In round t, the algorithm displays ad y ∈ Y0
that maximizes the upper confidence index |E[µt (y)] − µ(x0 , y)| ≤ r .

It−1 (y) = µt−1 (y) + Rt−1 (y) Therefore by Hoeffding’s inequality

q
where Rt = 4i Pr [µt (y) 6∈ [µ(x0 , y) − Rt (y) − r, µ(x0 , y) + Rt (y) + r]]
1+nt (y) is the confidence radius. Note
that in round t the quantities nt−1 (y), µt−1 (y), Rt−1 (y) ≤ Pr [|µt (y) − E[µt (y)]| > Rt (y)]
≤ 2 exp −2nt (y)(Rt (y))2 ≤ 2e−4i ≤ 4−i

and It−1 (y) are available to the algorithm. If multiple ads
achieve the maximum upper confidence index, we break
ties arbitrarily. This finishes the description of the algo- and the same inequality, Pr[µt (y) 6∈ [µ(x0 , y) − Rt (y) −
rithm. r, µ(x0 , y)+Rt (y)+r]] ≤ 4−i , holds trivially if nt (y) = 0
since Rt (y) > 1. We use the union bound over all y ∈
We now bound the regret of the query-ad-clustering algo- Y0 and all t, 2i ≤ t ≤ min(T, 2i+1 − 1) to bound the
rithm. In Lemma 4 we bound the regret for a cluster of probability of the bad event:
queries during one phase. The regret of all clusters during
one phase is bounded in Lemma 5. The resulting O(T γ ) Pr [bad event] ≤ 2i |Y0 |4−i ≤ K2−i . (4)
bound is stated as Lemma 6. In proof of Lemma 4 we make
use of Hoeffding’s bound, proof of which can be found in Recall, that we first conditioned on the values nt (y) and

488
Tyler Lu, Dávid Pál, Martin Pál

Now suppose that the good event occurs. Let R b be the Equivalently,
actual regret,
! 2Rt−1 (y) ≥ µ(x0 , y0∗ ) − µ(x0 , y) − 2r.
X
0 We substitute the definition of Rt−1 (y) into this inequality
R=
b sup µ(xt , y ) − µ(xt , yt ) .
t
yt0 ∈Y and square both sides of the inequality. (Note that both
2i ≤t≤min(T,2i+1 −1)
xt ∈Xj side are positive.) This gives an upper bound on nT (y) =
Since the algorithm during the phase i displays ads only nt−1 (y) + 1:
from PY0 , the actual regret R
b can be decomposed as a sum 16i
R = y∈Y0 Ry where Ry is the contribution to the regret nT (y) = nt−1 (y) + 1 ≤ 2.
(µ(x0 , y0∗ )
b b b
− µ(x0 , y) − 2r)
by displaying the ad y, that is,
! Combining with (5) we have
X
0
Ry =
b sup µ(xt , yt ) − µ(xt , y) b y ≤ nT (y) [µ(x0 , y0∗ ) − µ(x0 , y) + 3r]
R
yt0 ∈Y
2i ≤t≤min(T,2i+1 −1)
xt ∈Xj ≤ nT (y) [µ(x0 , y0∗ ) − µ(x0 , y) − 2r] + 5rnT (y)
yt =y
16i
≤ + 5rnT (y).
Fix y ∈ Y0 . Pick any > 0. Let y ∗ be an -optimal µ(x0 , y ∗ ) − µ(x0 , y) − 2r
for query x0 , that is, y ∗ is such that µ(x0 , y ∗ ) ≥ supy∈Y
Using the definition of a bad ad we get that
µ(x0 , y)−. Let y0∗ be the optimal ad in Y0 for the query x0 ,
that is, y0∗ = argmaxy∈Y0 µ(x0 , y). Lipschitz condition b y ≤ 16i + 5rnT (y) .
guarantees that for any xt ∈ Xj ∀y ∈ Ybad R (7)
r
sup µ(xt , yt0 ) ≤ sup µ(x0 , y) + r Summing over all ads, both bad and good, we have
yt0 ∈Y y∈Y

≤ µ(x0 , y ∗ ) + r +
X X
R
b= Rby + R
by
≤ µ(x0 , y0∗ ) + 2r + , y∈Ygood y∈Ybad

µ(xt , y) ≥ µ(x0 , y) − r . X X 16i

≤ 6rnT (y) + + 5rnT (y)
r
y∈Ygood y∈Ybad
Using the two inequalities the bound on R
b y simplifies to
16i
b y ≤ nT (y) [µ(x0 , y0∗ ) + 3r + − µ(x0 , y)] . ≤ 6rn + |Ybad | (since n ≤ 2i )
R r
16i
Since can be chosen arbitrarily small, we have ≤ 6rn + K .
r
b y ≤ nT (y) [µ(x0 , y0∗ ) − µ(x0 , y) + 3r] . (5)
∀y ∈ Y0 , R Finally, we bound the expected regret
h i
We split the set Y0 into two subsets, good ads Ygood and bad Ri,j (T ) = E Rb
ads Ybad . An ad y is good when µ(x0 , y ∗ ) − µ(x0 , y) ≤ 3r
or it was not displayed (during phase i up to round T for a 16i
≤ n Pr[bad event] + 6rn + K Pr[good event]
query in Xj ), otherwise the ad is bad. It follows from (5) r
and the definition of a good ad that 16i
≤ nK2−i + 6rn + K
∀y ∈ Ygood R
b y ≤ 6rnT (y). (6) r
16i 16i
≤ K + 6rn + K ≤ 6rn + K +1 .
r r
For bad ads we use inequality (5) and give an upper bound
on nT (y). To upper bound nT (y) we use the good event Lemma 5. Assume n queries were received up to round T
property. According to the definition of the upper con- during a phase i (in any cluster). The contribution of these
fidence index, the good event is equivalent to It (y) ∈ queries to the regret is bounded as
[µ(x0 , y) − r, µ(x0 , y) + 2Rt (y) + r] for all y ∈ Y  
and all rounds t, 2i ≤ i < T . Therefore, the good X
event implies that for any ad y when the upper bound, Ri (T ) = E  sup µ(xt , yt0 ) − µ(xt , yt )
yt0 ∈Y
µ(x0 , y) + 2Rt−1 (y) + r, on It−1 (y) gets below the lower 2i ≤t≤min(T,2i+1 −1)

bound, µ(x0 , y0∗ ) − r, on It−1 (y0∗ ) the algorithm stops dis-

16i
≤ 6rn + N K +1 .
playing the ad y for queries from Xj . Therefore, in the r
last round t when the ad y is displayed to a query in Xj , is
nt−1 (y) + 1 = nt (y) = nT (y) and where r is the diamter defined in (2), N is the size of the
query covering defined in (2) and K is the size of the ads
µ(x0 , y) + 2Rt−1 (y) + r ≥ µ(x0 , y0∗ ) − r. space covering defined in (3).

489
Contextual Multi-Armed Bandits

Proof. Let denote by nj the number of queries belonging the Bayes optimal strategy, setting = T −1/(a+b+2) we re-
PN
to cluster Xj . Clearly n = j=1 nj . From the preceding trieve the same regret upper bound as query-ad-clustering.
lemma we have However, the problem with this algorithm is that it must
keep track of an extremely large number, E, of experts
N N
X X 16i while ignoring the structure of our model—it does not ex-
Ri (T ) = Ri,j (T ) ≤ 6rnj + K +1 ploit the fact that a bandit algorithm can be run for each
j=1 j=1
r
context “piece” as opposed to each expert.
16i
≤ 6rn + N K +1 .
r
3 A LOWER BOUND
Lemma 6. For any T ≥ 0, the regret of the query-ad- ã+b̃+1
clustering algorithm is bounded as In this section we prove for any γ < ã+ b̃+1
lower bound
γ
Ω(T ) on the regret of any algorithm for a contextual
a0 +b0 +1
RA (T ) ≤ (24 + 64cd log2 T + 4cd)T a0 +b0 +2 = O (T γ ) . Lipschitz MAB (X, Y ) with ã = PACK(X, LZ ), b̃ =
PACK(Y, LY ). On the highest level, the main idea of the
lower bound is a simple averaging argument. We construct
The lemma proves the first part of Theorem 1. several “hard” instances and we show that the average re-
gret of any algorithm on those instances is Ω(T γ ).
Proof. Let k be the last phase, that is, k is such that 2k ≤
T < 2k+1 . In other words k = blog2 T c. We sum the Before we construct the instances we define several param-
regret over all phases 0, 1, . . . , k. We use the preceding eters that depend on (X, Y ) and γ. We define a0 , b0 so that
0
+b0 +1
lemma and recall that in phase i a0 ∈ [0, ã], b0 ∈ [0, b̃] and γ = aa0 +b 0 +2 . Moreover, if
0
ã > 0 we ensure that a ∈ (0, ã) and likewise if b̃ > 0 we
a0 i b0 i
i
r = 2− a0 +b0 +2 , N = c · 2 a0 +b0 +2 , K = d · 2 a0 +b0 +2 , n ≤ 2i . ensure b0 ∈ (0, b̃). Let c, d be constants such that for any
r ∈ (0, 1] there exist 2r-separated subsets of X, Y of sizes
0 0
We have at least cr−a and dr−b respectively. Existence of such
constants is guaranteed by the definition of the packing di-
k k
i mension. We also use positive constants α, β, C, T0 that
6 · 2− a0 +b0 +2 · 2i
X X
RA (T ) = Ri (T ) ≤ can be expressed in terms of a0 , b0 , c, d only. We don’t give
i=0 i=0
the formulas for these constants; they can be in principle
a0 i b0 i 16i extracted from the proofs.
+2 a0 +b0 +2 ·d·2 a0 +b0 +2 · i +1
2− a0 +b0 +2
k
Hard instances: Let time horizon T be given. The
0 0 0 0 0 0

≤
X
6·2 ia +b +1
a0 +b0 +2 + 16icd2 ia +b +1
a0 +b0 +2 + cd2 i a0a+b
+b
0 +2 “hard” instances are constructed as follows. Let r =
0 0

i=0
α · T −1/(a +b +2) and X0 ⊆ X, Y0 ⊆ Y be 2r-separated
0 0

k i subsets of sizes at least c · r−a , d · r−b respectively.

a0 +b0 +1 |X0 |
We construct |Y0 | instances each defined by a function
X
≤ (6 + 16cdk + cd) 2 a0 +b0 +2
i=0
v : X0 → Y0 . For each v ∈ Y0X0 we define an instance
0 0 k µv : X × Y → [0, 1] as follows. First we define µv for any
a +b +1
≤ (6 + 16cdk + cd) 4 2 a0 +b0 +2 (x0 , y) ∈ X0 × Y as
a0 +b0 +1 µv (x0 , y) = 1/2 + max{0, r − LY (y, v(x0 ))},
≤ (24 + 64cd log2 T + 4cd) T a0 +b0 +2
0 0
a +b +1
and then we make into a Lipschitz function on the whole
=O T a 0 +b0 +2
log T = O(T γ ). domain X × Y as follows. For any x ∈ X let x0 ∈ X0 be
the closest point to x and define for any y ∈ Y

While the query-ad-clustering algorithm achieves what µv (x, y) = 1/2+max{0, r −LY (y, v(x0 ))−LX (x, x0 )}.
turns out to be the optimal regret bound, we note that a
Furthermore, we assume that in each round t the payoff
modification of the Exp4 “experts” algorithm Auer et al.
µ̂t the algorithm receives lies in {0, 1}, that is, µ̂t is a
[2003] achieves the same bound (but we discuss the prob-
Bernoulli random variable with parameter µv (xt , yt ).
lems with this algorithm below). Each expert is defined by
a mapping f : {X1 , . . . , XN } → Y0 where given a x ∈ X Now, we choose a sequence of T queries. The sequence
finds the appropriate cluster Xx and recommends f (Xx ). of queries will consists of |X0 | subsequences, one for each
a
There are E = (1/b )(1/ ) such experts (mappings), x0 ∈ X0 , concatenated together. For each x0 j∈ Xk0 the
and one of them is -close to the Bayes optimal strategy. corresponding subsequence consists of M = |XT0 | (or
The√regret bound Auer et al. [2003] for Exp4 gives us j k
O( T E log E) to the best expert, which has regret T to M = |XT0 | + 1) copies of x0 . In Lemma 7 we lower

490
Tyler Lu, Dávid Pál, Martin Pál

bound the contribution of each subsequence to the total re- is a strict generalization of previously studied multi-armed
gret. The proof of Lemma 7 is an adaptation of the proof bandit settings where no side information is given in each
Theorem 6.11 from Cesa-Bianchi and Lugosi [2006, Chap- round. We believe that our model applies to many other real
ter 6] of a lower bound for the finitely-armed bandit prob- life scenarios where additional information is available that
lem. In Lemma 8 we sum the contributions together and affects the rewards of the actions.
give the final lower bound.
We present a very natural and conceptually simple algo-
Lemma 7. For x0 ∈ X0 consider a sequence of M copies rithm known as query-ad-clustering, which roughly speak-
of query x0 . Then for T ≥ T0 and for any algorithm A the ing, clusters the contexts into similar regions and runs
average regret on this sequence of queries is lower bounded a multi-armed bandit algorithm for each context cluster.
as When the query and ad spaces are endowed with a met-
1 X p ric for which the reward function is Lipschitz, we prove
Rx0 = |X |
RvA (M ) ≥ β |Y0 |M , an upper bound on the regret of query-ad-clustering and
|Y0 | 0
X
v∈Y0 0 a lower bound on the regret of any algorithm showing
that query-ad-clustering is optimal. Specifically, the upper
where RvA (M ) denotes the regret on instance µv . a+b+1
bound O(T a+b+2 + ) is dependent on the covering dimen-
sion of the query (a) and ad spaces (b) and the lower bound
Proof. Deferred to the full version of the paper. ã+b̃+1
−
Ω(T ã+b̃+2 ) is dependent on the packing dimensions of
Lemma 8. For any algorithm A, there exists an v ∈ Y0X0 , spaces (ã, b̃). For bounded Euclidean spaces and finite sets,
and an instance µv and a sequence of T ≥ T0 queries on these dimensions are equal and imply nearly tight bounds
which regret is at least on the regret. The lowernbound can be o strengthened to
∞
a+b̃+1 ã+b+1
γ Ω(T γ ) for any γ < max ,
a+b̃+2 ã+b+2
. So, if either
RA (T ) ≥ C · T
ã = a or b̃ = b, then we can still prove a lower bound
Proof. We use the preceding lemma and sum the regret that matches the upper bound. However, the lower bound
over all x0 ∈ X0 . will hold “only” for infinitely many time horizons T (as op-
posed to all horizons). It seems that for Lipschitz context
1 X MABs where ã 6= a and b̃ 6= b one needs to craft a dif-
sup RvA (T ) ≥ |X |
RvA (T )
v∈Y0
X0 |Y0 | 0 X
ferent notion of dimension, which would somehow capture
v∈Y0 0

X √ the growths of covering numbers of both the query space

≥ Rx0 ≥ β|X0 | M T and the ads space.
x0 ∈X0
s s Our paper raises some intriguing extensions. First, we can
explore the setting where queries are coming i.i.d. from

T T
= β|X0 | |Y0 | ≥ β|X0 | |Y0 | −1 a fixed distribution (known or unknown). We expect the
|X0 | |X0 |
p p worst distribution to be uniform over the query space and
≥ β |Y0 ||X0 |T − β|X0 | |Y0 | have the same regret as the adversarial setting. However,
√ √ √
(using x − y > x − y for any x > y > 0) what if the query distribution was concentrated in several
√ 0√ regions of the space? In web search we would expect some
= β dr−b0 · cr−a0 · T − βcr−a dr−b0 topics to be much hotter than others. It would be interesting
√ a0 +b0 +1 √ a0 +b0 /2
to develop algorithms that can exploit this structure. As
= β cd · T a0 +b0 +2 − βc d · T a0 +b0 +2
well, we can use a more refined metric multi-armed bandit
1 √ a0 +b0 +1 1 √
≥ β cd · T a0 +b0 +2 = β cd · T γ algorithm such as the zooming algorithm Kleinberg et al.
2 2 [2008] for more benign reward functions. Further, one can
a0 +b0 +2
(by choosing T0 > (2c) b0 /2+1 ) modify the results for an adaptive adversary with access to
√ an algorithm’s decisions and is able to change the Lipschitz
Setting C = 12 β cd finishes the proof. reward function in each round.
Acknowledgements. We would like to thank Bobby Klein-
4 CONCLUSIONS berg and John Langford and for useful discussions.

We have introduced a novel formulation of the problem References

of displaying relevant web search ads in the form of a
Lipschitz contextual multi-armed bandit problem. This R. Agrawal. The continuum-armed bandit problem. SIAM
model naturally captures an online scenario where search J. Control and Optimization, 33:1926–1951, 1995.
queries (contexts) arrive over time and relevant ads must Peter Auer and Ronald Ortner. Logarithmic online regret
be shown (multi-armed bandit problem) for each query. It bounds for undiscounted reinforcement learning. In Ad-

491
Contextual Multi-Armed Bandits

vances in Neural Information Processing Systems 19, T. L. Lai and Herbert Robbins. Asymptotically efficient
(NIPS 2007), pages 49–56. MIT Press, 2007. adaptive allocation rules. Advances in Applied Mathe-
matics, 6(1):4–22, 1985.
Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finite-
time analysis of the multiarmed bandit problem. Ma- John Langford. How do we get weak action dependence for
chine Learning, 47(2-3):235–256, 2002. learning with partial observations? Blog post: http:
//hunch.net/?p=421, September 2008.
Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund., and
Robert E. Schapire. The nonstochastic multiarmed ban- John Langford and Tong Zhang. The epoch-greedy algo-
dit problem. SIAM Journal on Computing, 32(1):48–77, rithm for multi-armed bandits with side information. In
2003. NIPS, 2007.

Peter Auer, Ronald Ortner, and Csaba Szepesvári. Im- Richard S. Sutton and Andrew G. Barto. Reinforcement
proved rates for the stochastic continuum-armed bandit Learning. MIT Press, 1998.
problem. In Proceedings of the 20th Annual Confer- Chih-Chun Wang, Sanjeev R. Kulkarni, and H. Vin-
ence on Learning Theory, (COLT 2007), pages 454–468. cent Poor. Bandit problems with side observations.
Springer, 2007. IEEE Transactions on Automatic Control, 50(3):338–
355, May 2005.
Sébastien Bubeck, Rémi Munos, Gilles Stoltz, and Csaba
Szepesvári. Online optimization in x-armed bandits. In
NIPS, pages 201–208, 2008.
Nicolò Cesa-Bianchi and Gábor Lugosi. Prediction, Learn-
ing, and Games. Cambridge University Press, 2006.
Luc Devroye and Gábor Lugosi. Combinatorial Methods
in Density Estimation. Springer, 2001.
Eyal Even-Dar, Shie Mannor, and Yishay Mansour. Action
elimination and stopping conditions for the multi-armed
bandit and reinforcement learning problems. Journal of
Machine Learning Research, 7:1079–1105, 2006.
Abraham D. Flaxman, Adam T. Kalai, and H. Brendan
McMahan. Online convex optimization in the bandit set-
ting: gradient descent without a gradient. In Proceed-
ings of the sixteenth annual ACM-SIAM symposium on
Discrete algorithms (SODA 2005), pages 385–394. Soci-
ety for Industrial and Applied Mathematics Philadelphia,
PA, USA, 2005.
Alexander Goldenshluger and Assaf Zeevi. Performance
limitations in bandit problems with side observations.
manuscript, 2007.
Wassily Hoeffding. Probability inequalities for sums of
bounded random variables. Journal of the American Sta-
tistical Association, 58(301):13–30, 1963.
Robert D. Kleinberg. Nearly tight bounds for the
continuum-armed bandit problem. In Lawrence K. Saul,
Yair Weiss, and Léon Bottou, editors, Advances in Neu-
ral Information Processing Systems 17, (NIPS 2005),
pages 697–704. MIT Press, 2005a.
Robert D. Kleinberg. Online Decision Problems with Large
Strategy Sets. PhD thesis, Massachusetts Institute of
Technology, June 2005b.
Robert D. Kleinberg, Aleksandrs Slivkins, and Eli Upfal.
Multi-armed bandits in metric spaces. In Proceedings
of the 40th Annual ACM Symposium, STOC 2008, pages
681–690. Association for Computing Machinery, 2008.

492

Bandit Algorithms (Tor Lattimore, Csaba Szepesvári) (Z-Library)
0% (1)
Bandit Algorithms (Tor Lattimore, Csaba Szepesvári) (Z-Library)
537 pages
Contextual Bandits
No ratings yet
Contextual Bandits
34 pages
Cs6046-Notes 2
No ratings yet
Cs6046-Notes 2
34 pages
MMAB - Mahajan and Teneketzis. 2008. Multi-Armed Bandit Problems. in Foundations and Applications of Sensor Management
No ratings yet
MMAB - Mahajan and Teneketzis. 2008. Multi-Armed Bandit Problems. in Foundations and Applications of Sensor Management
31 pages
Bubeck 11 A
No ratings yet
Bubeck 11 A
41 pages
Regulating Greed Over Time in Multi-Armed Bandits: Stefano Tracà
No ratings yet
Regulating Greed Over Time in Multi-Armed Bandits: Stefano Tracà
99 pages
SSRN Id4823494 Code3832712
No ratings yet
SSRN Id4823494 Code3832712
52 pages
LinUCB Ote
No ratings yet
LinUCB Ote
68 pages
Multi-Armed Bandits Theory and Applications To Online Learning in Networks
No ratings yet
Multi-Armed Bandits Theory and Applications To Online Learning in Networks
167 pages
DLMAIRIL01 Q4-2024 Session3
No ratings yet
DLMAIRIL01 Q4-2024 Session3
47 pages
Contextual Online Decision Making With Infinite-Dimensional Functional Regression
No ratings yet
Contextual Online Decision Making With Infinite-Dimensional Functional Regression
30 pages
APO-Philippines Brand Guide
100% (3)
APO-Philippines Brand Guide
17 pages
Blanchet, Xu 2024
No ratings yet
Blanchet, Xu 2024
21 pages
Bandit Book
No ratings yet
Bandit Book
129 pages
Thompson Sampling For High-Dimensional Sparse Linear Contextual Bandits - ICML23
No ratings yet
Thompson Sampling For High-Dimensional Sparse Linear Contextual Bandits - ICML23
30 pages
Optimal Algorithms For Latent Bandits With Cluster Structure
No ratings yet
Optimal Algorithms For Latent Bandits With Cluster Structure
48 pages
Finite-Time Regret of Thompson Sampling Algorithms For Exponential Family Multi-Armed Bandits
No ratings yet
Finite-Time Regret of Thompson Sampling Algorithms For Exponential Family Multi-Armed Bandits
49 pages
Contextual Information-Directed Sampling
No ratings yet
Contextual Information-Directed Sampling
21 pages
NeurIPS 2019 Batched Multi Armed Bandits Problem Paper
No ratings yet
NeurIPS 2019 Batched Multi Armed Bandits Problem Paper
11 pages
Bandit
No ratings yet
Bandit
8 pages
Simple Regret For Infinitely Many Armed Bandits
No ratings yet
Simple Regret For Infinitely Many Armed Bandits
9 pages
UAI 2025 Cascade Bandits
No ratings yet
UAI 2025 Cascade Bandits
13 pages
W Pg#s
No ratings yet
W Pg#s
34 pages
ConcaveBandits ICML2025
No ratings yet
ConcaveBandits ICML2025
19 pages
Auer - Using Ucb For Exploration-Exploitation Tradeoffs
No ratings yet
Auer - Using Ucb For Exploration-Exploitation Tradeoffs
26 pages
Neural Contextual Bandits With UCB-based Exploration
No ratings yet
Neural Contextual Bandits With UCB-based Exploration
27 pages
Agrawal&Goyal 2013
No ratings yet
Agrawal&Goyal 2013
9 pages
Q1. Explain The Multi-Armed Bandit Problem and Its Key Characteristics. Illustrate Their Real-World Applications
No ratings yet
Q1. Explain The Multi-Armed Bandit Problem and Its Key Characteristics. Illustrate Their Real-World Applications
11 pages
Statistical Inference For Online Decision-Making: in A Contextual Bandit Setting
No ratings yet
Statistical Inference For Online Decision-Making: in A Contextual Bandit Setting
44 pages
2022 Multiarmed Bandit Algorithms On Zynq System-On-Chip Go Frequentist or Bayesian
No ratings yet
2022 Multiarmed Bandit Algorithms On Zynq System-On-Chip Go Frequentist or Bayesian
14 pages
Thompson Sampling For Contextual Bandits With Linear Payoffs
No ratings yet
Thompson Sampling For Contextual Bandits With Linear Payoffs
22 pages
10939-Article Text-14467-1-2-20201228
No ratings yet
10939-Article Text-14467-1-2-20201228
8 pages
Multi-Armed Bandit Problem With Online Clustering As Side
No ratings yet
Multi-Armed Bandit Problem With Online Clustering As Side
13 pages
A12-Online Learning Short 2020
No ratings yet
A12-Online Learning Short 2020
61 pages
NeurIPS 2021 Breaking The Moments Condition Barrier No Regret Algorithm For Bandits With Super Heavy Tailed Payoffs Paper
No ratings yet
NeurIPS 2021 Breaking The Moments Condition Barrier No Regret Algorithm For Bandits With Super Heavy Tailed Payoffs Paper
11 pages
Gaussian Process Optimization in The Bandit Setting
No ratings yet
Gaussian Process Optimization in The Bandit Setting
17 pages
HPE - Dp00002639en - Us - HPE Smart Storage Administrator GUI User Guide
No ratings yet
HPE - Dp00002639en - Us - HPE Smart Storage Administrator GUI User Guide
142 pages
79.-Gaussian Process Optimization
No ratings yet
79.-Gaussian Process Optimization
8 pages
NIPS 2008 Algorithms For Infinitely Many Armed Bandits Paper
No ratings yet
NIPS 2008 Algorithms For Infinitely Many Armed Bandits Paper
8 pages
Evendar 06 A
No ratings yet
Evendar 06 A
27 pages
26202-Article Text-30265-1-2-20230626
No ratings yet
26202-Article Text-30265-1-2-20230626
8 pages
Multi Armed Bandits
No ratings yet
Multi Armed Bandits
34 pages
pdf24 Images Merged
No ratings yet
pdf24 Images Merged
12 pages
Mab Notes
No ratings yet
Mab Notes
15 pages
Filipino Learning - Classic and Contextual Bandits
No ratings yet
Filipino Learning - Classic and Contextual Bandits
9 pages
RL Unit 1 - QA
No ratings yet
RL Unit 1 - QA
10 pages
IntroMulti Armed Bandits Slivkin Microsoft PDF
No ratings yet
IntroMulti Armed Bandits Slivkin Microsoft PDF
174 pages
Intellect OCR To SAP FB60 Integration Proposal
No ratings yet
Intellect OCR To SAP FB60 Integration Proposal
2 pages
Agrawal&Goyal 2017
No ratings yet
Agrawal&Goyal 2017
3 pages
29117-Article Text-33171-1-2-20240324
No ratings yet
29117-Article Text-33171-1-2-20240324
8 pages
The Trịnh Công Sơn Phenomenon
No ratings yet
The Trịnh Công Sơn Phenomenon
47 pages
Garbage In, Reward Out Bootstrapping Exploration in Multi-Armed Bandits
No ratings yet
Garbage In, Reward Out Bootstrapping Exploration in Multi-Armed Bandits
19 pages
Syntax Notes
100% (1)
Syntax Notes
11 pages
012 Cleanliness
No ratings yet
012 Cleanliness
34 pages
Context Attentive Bandits: Contextual Bandit With Restricted Context
No ratings yet
Context Attentive Bandits: Contextual Bandit With Restricted Context
8 pages
Avg. Market Capitalization of Listed Companies During Jul-Dec 2018
No ratings yet
Avg. Market Capitalization of Listed Companies During Jul-Dec 2018
294 pages
Introduction To Bandits: (Some Slides Stolen From Csaba's AAAI Tutorial)
No ratings yet
Introduction To Bandits: (Some Slides Stolen From Csaba's AAAI Tutorial)
16 pages
Necessary and Sufficient Conditions For Achieving Sub-Linear Regret in Stochastic Multi-Armed Bandits
No ratings yet
Necessary and Sufficient Conditions For Achieving Sub-Linear Regret in Stochastic Multi-Armed Bandits
9 pages
Job Application Letter Title
100% (1)
Job Application Letter Title
8 pages
Exploration Exploitation
No ratings yet
Exploration Exploitation
40 pages
Open Problem: Regret Bounds For Thompson Sampling: 1. Background
No ratings yet
Open Problem: Regret Bounds For Thompson Sampling: 1. Background
3 pages
Project 2
No ratings yet
Project 2
7 pages
Chapter 4
No ratings yet
Chapter 4
53 pages
Finite-Time Analysis of The Multi-Armed Bandit Problem With Known Trend
No ratings yet
Finite-Time Analysis of The Multi-Armed Bandit Problem With Known Trend
7 pages
On A Restless Multi-Armed Bandit Problem With Non-Identical Arms
No ratings yet
On A Restless Multi-Armed Bandit Problem With Non-Identical Arms
8 pages
Unit:1 Reinforcement Learning
No ratings yet
Unit:1 Reinforcement Learning
9 pages
Lecture 2 EE675
No ratings yet
Lecture 2 EE675
4 pages
Jeppview For Windows: List of Pages in This Trip Kit
No ratings yet
Jeppview For Windows: List of Pages in This Trip Kit
30 pages
SSC GR 10 Electronics Q4 Module 1 WK 1 - v.01-CC-released-22May2021
No ratings yet
SSC GR 10 Electronics Q4 Module 1 WK 1 - v.01-CC-released-22May2021
20 pages
Online Learning For Causal Bandits
No ratings yet
Online Learning For Causal Bandits
7 pages
Mastering The Rhetorical Analysis Essay
No ratings yet
Mastering The Rhetorical Analysis Essay
15 pages
Cta Cli
No ratings yet
Cta Cli
52 pages
Colorimeter Calibration
No ratings yet
Colorimeter Calibration
3 pages
El Noi de La Mare (Trinity Grade 3) PDF
100% (1)
El Noi de La Mare (Trinity Grade 3) PDF
2 pages
Fly Me To The Moon
100% (1)
Fly Me To The Moon
2 pages
Maths PP2 Marking Scheme - Docx Form 3 End Term 3 Excellence
No ratings yet
Maths PP2 Marking Scheme - Docx Form 3 End Term 3 Excellence
9 pages
Data Challenge - NC Soft
No ratings yet
Data Challenge - NC Soft
4 pages
637768232285587483ce 20ce33pt W3 S3 Sy
No ratings yet
637768232285587483ce 20ce33pt W3 S3 Sy
7 pages
F 305 Final Bill Checklist
No ratings yet
F 305 Final Bill Checklist
2 pages
Sleep Hygiene
No ratings yet
Sleep Hygiene
12 pages
Old French Song
No ratings yet
Old French Song
1 page
Study in E Minor (Tarrega) PDF
No ratings yet
Study in E Minor (Tarrega) PDF
1 page
Charles Crissman Wendy Crissman Christine Crissman v. Dover Downs Entertainment Inc. Dover Downs, Inc, 289 F.3d 231, 3rd Cir. (2000)
No ratings yet
Charles Crissman Wendy Crissman Christine Crissman v. Dover Downs Entertainment Inc. Dover Downs, Inc, 289 F.3d 231, 3rd Cir. (2000)
31 pages
Pressure Transmitter SPTW-P10R-G14-A-M12: Data Sheet
No ratings yet
Pressure Transmitter SPTW-P10R-G14-A-M12: Data Sheet
2 pages
The Japanese Led Light Industry
No ratings yet
The Japanese Led Light Industry
10 pages
Course Biology: Undergraduate Subjects
No ratings yet
Course Biology: Undergraduate Subjects
16 pages
CV Syamsul Maarif
No ratings yet
CV Syamsul Maarif
4 pages
Simple Additive Weighting Method To Determining Employee Salary Increase Rate
No ratings yet
Simple Additive Weighting Method To Determining Employee Salary Increase Rate
7 pages
Choro Mimosa
No ratings yet
Choro Mimosa
1 page
Defining A Function: Docstring
No ratings yet
Defining A Function: Docstring
8 pages
Pset 7 Solution - 7.013
No ratings yet
Pset 7 Solution - 7.013
8 pages
Bravo-Guerrero vs. Bravo, 465 SCRA 244, July 29, 2005
No ratings yet
Bravo-Guerrero vs. Bravo, 465 SCRA 244, July 29, 2005
7 pages
Modified BG Prasad Socio-Economic Classification, Updated - 2020
No ratings yet
Modified BG Prasad Socio-Economic Classification, Updated - 2020
2 pages
RD Rigidsteelconduitimc
No ratings yet
RD Rigidsteelconduitimc
1 page
P18 - KHUSHI Advertising
No ratings yet
P18 - KHUSHI Advertising
2 pages
Vals - Trinity Grade 2
No ratings yet
Vals - Trinity Grade 2
2 pages
CB2201 5
No ratings yet
CB2201 5
1 page
Office Hour Usage?
No ratings yet
Office Hour Usage?
2 pages
Answers Lab2
No ratings yet
Answers Lab2
2 pages
El Noi de La Mare (Trinity Grade 3)
No ratings yet
El Noi de La Mare (Trinity Grade 3)
2 pages
Study Plan
No ratings yet
Study Plan
1 page
Math Enrichment Form
No ratings yet
Math Enrichment Form
1 page
Long Nguyen: Education
No ratings yet
Long Nguyen: Education
1 page
Strain Gauge Measurement: Temperature Compensation For Leadwires in Quarter Bridge
No ratings yet
Strain Gauge Measurement: Temperature Compensation For Leadwires in Quarter Bridge
1 page
Resume
No ratings yet
Resume
1 page
Long Nguyen: Education
No ratings yet
Long Nguyen: Education
1 page
Long Nguyen: Education
No ratings yet
Long Nguyen: Education
1 page
6.033 - Lecture 10: BGP: Autonomous Systems
No ratings yet
6.033 - Lecture 10: BGP: Autonomous Systems
1 page
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet