0% found this document useful (0 votes)

59 views

1 What Is A Randomized Algorithm?: Lecture Notes CS:5360 Randomized Algorithms

This document summarizes a lecture on randomized algorithms. It discusses: 1) What randomized algorithms are and how they differ from deterministic algorithms in their use of randomness. 2) Two classes of randomized algorithms - Las Vegas algorithms that always produce the right answer but have random runtimes, and Monte Carlo algorithms that may err but the probability of error can be controlled. 3) An example randomized algorithm for the balanced partition problem that runs in expected linear time as a Las Vegas algorithm, and with an exponentially small error probability as a Monte Carlo algorithm. 4) Reasons why randomization can improve efficiency, memory usage, simplicity of algorithms.

Uploaded by

Mirza Abdulla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views

1 What Is A Randomized Algorithm?: Lecture Notes CS:5360 Randomized Algorithms

Uploaded by

Mirza Abdulla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Lecture Notes CS:5360 Randomized Algorithms

Lecture 1: Aug 21, 2018

Scribe: Geoff Converse

1 What is a Randomized Algorithm?

These are algorithms that make coin tosses during their execution and take actions that depend
on outcomes of these coin tosses. In other words, these algorithms have access to random bits.
Notice that we are not talking about inputs being generated randomly from some probabalistic
distribution. The randomness is internal to the algorithm.

2 Classifying Random Algorithms

• Las Vegas algorithms: these make no errors, however, the running time of the algorithm is a
random variable. We are typically interested in the expected runtime of these algorithms.

• Monte Carlo algorithms: these algorithms make errors, but with some probability that can be
1 1
controlled. For example, we can say that a certain algorithm has a 10 or maybe a 100 proba-
bility of making an error. The runtime of Monte Carlo algorithms can either be deterministic
or a random variable.

2.1 Example: Balanced Partition

INPUT: a list L of distinct numbers
OUTPUT: Lists L1 and L2 that partition L and satisfy:

1. Every element in L1 is less than every element in L2 .

|L| 2|L|
2. 3 ≤ |L1 | ≤ 3

In order to solve this problem, we can try to find an approximate median of L by using a randomized
step.

FUNCTION: randomizedPartition(L=[1..n])
p <- index chosen uniformly at random from {1,2,...,n}
for i<- 1 to n do:
if L[i] <= L[p]:
L_1 <- L_1 append L[i]
else:
L_2 <- L_2 append L[i]
return (L_1, L_2)

Notice that this does not always return a partition that satisfies both requirements above. In fact,
it has a 23 chance of error.

Lemma 1 The function randomizedPartition runs in O(n) time with an error probability of 2/3.

1
Using randomizedPartition as a subroutine we can design a Las Vegas algorithm for the
BalancedPartition problem and we can also design a Monte Carlo algorithm with a much
smaller error probability. First, we state the Las Vegas algorithm.

FUNCTION: randomizedPartitionLV(L[1..n])
repeat:
(L_1,L_2) <- randomizedPartition(L)
until: |L|/3 <= |L_1| <= 2|L|/3
return (L_1, L_2)

This algorithm will not make any errors, but its runtime is a random variable. This is because
with probability 1/3, the algorithm performs one iteration of the repeat-until loop, with probability
(2/3)(1/3) the algorithm performs two iterations of the repeat-until loop, etc.

FUNCTION: randomizedPartitionMC(L[1..n])
for i <- 1 to k do:
(L_1, L_2) <- randomizedPartition(L)
if (|L|/3 <= |L_1| <= 2|L|/3) then:
return (L_1, L_2)
endfor
return "Failed"

Notice that we are independently repeating the randomizedPartition algorithm k times. This
process of increasing the probability of correctness is called probability amplification.

Theorem 2 BalancedPartition can be solved by a Las Vegas algorithm in expected O(n) time.

Theorem 3 BalancedPartition can be solved by a Monte Carlo algorithm in O(kn) time with
k
2
probability 1 − 3 .

3 Why use randomization in algorithms?

• To improve efficiency with faster runtimes. For example, we could use a randomized quicksort
algorithm instead of the deterministic quicksort. Deterministic quicksort can be quite slow
on certain worst case inputs (e.g., input that is almost sorted), but randomized quicksort is
fast on all inputs.

• To improve memory usage. Random sampling as a way to sparsifying input and then working
with this smaller input is a common technique.

• To make algorithms simpler. For example, see Karger’s min-cut algorithm in the next lecture.

• In parallel/distributed/streaming models of computation, randomization plays an even more

critical role. In distributed computing, each machine only has a part of the data, but still has
to make decisions that affect global outcomes. Randomization plays a key role in informing
these decisions.

2
4 Classification of Problems Based on Randomization
• P = the class of decision problems (problems with boolean answers) that can be solved in
polynomial time. We typically say that these can be solved efficiently.

• RP (randomized polynomial) = the class of decision problems L such that L can be solved
by a polynomial time algorithm A with the property:

– If x ∈ L (x is a “yes” instance of L), then P r(A(x) = 1) ≥ 1/2.

– If x 6∈ L (x is a “no” instance of L), then P r(A(x) = 0) = 1.

Note that in this definition the algorithm A has one-sided error, only for “yes” instances.
Also, we clearly see that P ⊆ RP . Finally, the choice of the constant 1/2 in the above
definition is somewhat arbitrary. By using probability amplification (see below), we can
drive the error probability down quite efficiently.

FUNCTION: amplifiedA(x)
for i <- 1 to k do
if A(x) = 1, then
return 1
return 0

In this use of probability amplification, when we input a “yes” instance, the probability of
the output begin incorrect is 2−k . So then P r(amplifiedA(x) = 1) ≥ 1 − 2−k .

• CoRP = {L|L ∈ RP } = the class of decision problems such that L can be solved by a
polynomial time algorithm A with the property:

– If x ∈ L, then P r(A(x) = 1) = 1.
– If x 6∈ L, then P r(A(x) = 0) ≥ 1/2.

• BPP (bounded error probabalistic polynomial) = the class of decision problems L such that
L has a polynomial time algorithm A with the property:

– If x ∈ L, then P r(A(x) = 1) ≥ 2/3.

– If x 6∈ L, then P r(A(x) = 0) ≥ 2/3.

Notice that for problems in BPP, we can make errors for both positive and negative instances
of L.

Not much is known about the relationship between P , RP , coRP , and BP P . However – maybe
somewhat surprisingly at first glance – many theoretical computer scientists believe the following
conjecture.

Conjecture 4 BPP = P.

3
Figure 1: A venn diagram detailing various complexity classes.

The point here is that randomization is not expected to help in a “gross” sense, i.e., it will not help
us solve in polynomial time a problem that cannot be solved in polynomial time by deterministic
means. However, it can improve a running time that is a high degree polynomial, e.g., O(n8 ), to a
running time that is a low degree polynomial, e.g., O(n2 ). This conjecture is a major open problem
in theoretical computer science.
There are at least one well known problem that is not known to be in P , but is in coRP . This
is the Polynomial Identity Testing problem.

4
Lecture Notes CS:5360 Randomized Algorithms
Lecture 2: Aug 23, 2018
Scribe: Geoff Converse

5 Polynomial Identity Testing (PIT)

INPUT: multivariate polynomials P (x1 , x2 , ..., xm ) and Q(x1 , x2 , ..., xm )
QUESTION: is P ≡ Q?
For example, with m = 3, we may be given

P (x1 , x2 , x3 ) = (3x1 − 7)(4x2 − x3 )(3x2 − 4x1 )

Q(x1 , x2 , x3 ) = 36x1 x22 + 1 − x1 x2 − 25x21 x3 + 7x1 x2 x3

Obviously, we can easily check whether or not the two polynomials are the same by multiplying
everything out. But, multiplying out the terms could lead to a polynomial that has an exponential
number (in m) of terms. It is also possible that even though the final coefficients are small, some
of the intermediate numbers generated by multiplications/additions can be quite huge. These are
some of the reasons why PIT does not have a (deterministic) polynomial-time algorithm yet.

5.1 “Baby version” of PIT

To simplify this problem, let’s just consider the case with m = 1, so that we are only given single
variable polynomials. Further, assume that both polynomials P (x) and Q(x) are given as a product
of monomials (for example, (3x−7)(9x+1) · · ·). We also will assume that each arithmetic operation
take O(1) constant time.
Consider the following simple deterministic algorithm: multiply out both polynomials and
express in standard form. It can be checked that the runtime of this algorithm is O(d2 ), where d
is the degree of the polynomials P and Q.
Now we solve PIT using randomization.

Randomized PIT
(1) Pick a number t uniformly at random from {1,...,100d}
(2) Evaluate P(t), Q(t)
(3) If P(t) = Q(t)
return YES
else
return NO

The runtime of this algorithm is O(d) because it takes O(d) to evaluate P (t) and Q(t). Note that
t can be generated by looking at O(log2 (100d)) = O(log d) bits. If we assume that each random
bit can be generated in O(1) time, then Step (1) takes only O(log d) time.
To analyze the error probability of this algorithm, just look at the two possible cases.

• If P ≡ Q, then the algorithm returns YES with probability 1.

5
• If P 6≡ Q, the analysis is slightly more involved. Note that P (t) = Q(t) iff t is a root of
P (x) − Q(x) = 0. Since P (x) − Q(x) has degree at most d, by the Fundamental Theorem
d 1
of Algebra, P (x) − Q(x) has at most d roots. Then P r(P (t) = Q(t)) ≤ 100d = 100 . So for
99
P 6≡ Q, the algorithm returns NO with a probability ≥ 100 .

6 Independence and Conditional Probabilities

Definition 5 Events E1 and E1 are independent iff P r(E1 ∩ E2 ) = P r(E1 )P r(E2 ).

Definition 6 Events E1 , ..., Ek are mutually independent iff for any subset I ⊆ {1, 2, ..., k}, P r(
T
i∈I Ei ) =
Q
i∈I P r(Ei ).

We have already used the notion of mutual independence in analyzing algorithms that amplify cor-
rectness probability by independent repititions. In some situations, requiring or expecting mutual
independence is too much and a weaker notion of independence suffices.

Definition 7 Events E1 , ..., Ek exhibit p-wise independence iff for any subset I ⊆ {1, 2, ..., k} such
that |I| ≤ p, P r( i∈I Ei ) = i∈I P r(Ei ).
T Q

When p = 2, the independence we get is called pairwise independence. We will encounter this later.

Definition 8 Conditional probabilitiy:

P r(E1 ∩ E2 )
P r(E1 |E2 ) = if P r(E2 ) 6= 0
P r(E2 )

This implies that if P r(E2 ) 6= 0, then P r(E1 ∩ E2 ) = P r(E1 |E2 ) · P r(E2 ). More generally,
   
k
\ k
\
P r(E1 ∩ E2 ∩ · · · ∩ Ek ) = P r E1 | Ej  · P r E2 | Ej  · · · P r (Ek−1 |Ek ) · P r(Ek )
j=2 j=3

We will now use the above formula in the analysis of Karger’s min-cut algorithm. There are many
ways of solving the min-cut problem in polynomial time, but Karger’s algorithm showcases the
simplicity and elegance one gets by using randomization.

7 Karger’s Min-cut Algorithm

INPUT: An undirected multigraph G = (V, E)
OUTPUT: A partition (S, T ) of V such that the number of edges with one endpoint in S and the
other in T is minimized.
In order to understand how Karger’s algorithm works, we first need to understand how the
contract operation works on a graph. This operation takes a graph G and an edge e = {u, v} in
G and outputs a new graph which “contracts” u and v into a “super-vertex” uv. If a vertex w
had an edge to u and an edge to v in G, then after the contract operation w has two edges to the
super-vertex uv. See Figure 3 for an illustration.

6
Figure 2: An example of partition which solves the mincut problem. Here the size of the mincut
is 2. Notice that the size of the mincut is always ≤ the minimum degree, as we can always choose
S to contain just one vertex.

7.1 Karger’s Mincut Algorithm

G_0 <- G
for i <- 1 to n-2 do:
pick an edge e_i uniformly at random from G_{i-1}
G_i <- contract(G_{i-1}, e_i)
return number of edges between two remaining vertices

Note that after each contract operation, the number of vertices decreases by 1. Therefore, the final
graph Gn−2 only has two vertices. This algorithm does not always return the optimal solution, as
demonstrated in Figure 4.

7
Figure 3: An example of the contract operation in action. Notice that contracting a simple graph
can return a multigraph.

Figure 4: Karger’s Mincut Algorithm is not always correct. The original graph has a mincut size
equal to 2, but after one iteration, the min-cut size equals 3.

Sybase DBA User Guide For Beginners
100% (5)
Sybase DBA User Guide For Beginners
41 pages
Unity Certified Programmer
No ratings yet
Unity Certified Programmer
14 pages
Optimization Theory with Applications
From Everand
Optimization Theory with Applications
Donald A. Pierre
4/5 (4)
Randomized
No ratings yet
Randomized
11 pages
Randomized Algorithm
No ratings yet
Randomized Algorithm
9 pages
Randomized Algo Harvey
No ratings yet
Randomized Algo Harvey
234 pages
CS648A 1 Overview of the Course 2025
No ratings yet
CS648A 1 Overview of the Course 2025
35 pages
Randomized Algorithm
No ratings yet
Randomized Algorithm
11 pages
DAA Unit5
No ratings yet
DAA Unit5
22 pages
PWD 2019 20 Class 9 Final PDF
No ratings yet
PWD 2019 20 Class 9 Final PDF
21 pages
Probab 10
No ratings yet
Probab 10
3 pages
02 Lec
No ratings yet
02 Lec
71 pages
Unit V - Daa
No ratings yet
Unit V - Daa
39 pages
04 Randomized Algorithms
No ratings yet
04 Randomized Algorithms
25 pages
Chap 43
No ratings yet
Chap 43
18 pages
Basic Derandomization Techniques
No ratings yet
Basic Derandomization Techniques
18 pages
UNIT V - DAA
No ratings yet
UNIT V - DAA
37 pages
Randomized Quicksort Performance Analysis
No ratings yet
Randomized Quicksort Performance Analysis
4 pages
Notes Randomization
No ratings yet
Notes Randomization
7 pages
L03 Randomized Algorithms
No ratings yet
L03 Randomized Algorithms
61 pages
Randomized Algorithm
No ratings yet
Randomized Algorithm
6 pages
CS648A 2023 Lecture 1
No ratings yet
CS648A 2023 Lecture 1
35 pages
3 RandomizedAlgorithms
No ratings yet
3 RandomizedAlgorithms
14 pages
BNP Unit-5 Lecture 21
No ratings yet
BNP Unit-5 Lecture 21
22 pages
Randomized Algorithms: Department of Computer Science, Stanford University, Stanford, California
No ratings yet
Randomized Algorithms: Department of Computer Science, Stanford University, Stanford, California
5 pages
Lec10 Randomization
No ratings yet
Lec10 Randomization
37 pages
Randomized Algorithms Randomized Algorithms
No ratings yet
Randomized Algorithms Randomized Algorithms
43 pages
Introduction To Randomization & Approximation Algorithms
No ratings yet
Introduction To Randomization & Approximation Algorithms
22 pages
1 s2.0 0166218X9190086C Main
No ratings yet
1 s2.0 0166218X9190086C Main
37 pages
Randomized Algorithms-April 2023-Handouts
No ratings yet
Randomized Algorithms-April 2023-Handouts
12 pages
28 Randomized Algorithms
No ratings yet
28 Randomized Algorithms
22 pages
Randomized Algorithms: Prof. Tapio Elomaa
No ratings yet
Randomized Algorithms: Prof. Tapio Elomaa
37 pages
Randomized Algorithms: CPSC 335
No ratings yet
Randomized Algorithms: CPSC 335
20 pages
Top 20 Features For Online Hotel Reservation System
No ratings yet
Top 20 Features For Online Hotel Reservation System
46 pages
CS174: Note12
No ratings yet
CS174: Note12
5 pages
Probability and Randomized Algorithms
No ratings yet
Probability and Randomized Algorithms
14 pages
Randomized Algosnotes
No ratings yet
Randomized Algosnotes
362 pages
poa
No ratings yet
poa
3 pages
Algorithm Analysis Important Topics
No ratings yet
Algorithm Analysis Important Topics
30 pages
Lec 3
No ratings yet
Lec 3
6 pages
18 Randomized Algorithms
No ratings yet
18 Randomized Algorithms
16 pages
Problem Set 5
No ratings yet
Problem Set 5
4 pages
Unit 5
No ratings yet
Unit 5
69 pages
Introduction To Randomized Algorithm
No ratings yet
Introduction To Randomized Algorithm
46 pages
Introduction To Randomized Algorithms
No ratings yet
Introduction To Randomized Algorithms
18 pages
The Power of Randomness
No ratings yet
The Power of Randomness
24 pages
Module 5 of daa
No ratings yet
Module 5 of daa
5 pages
Randomizedd Algorithms
No ratings yet
Randomizedd Algorithms
195 pages
week_11
No ratings yet
week_11
40 pages
Lecture 08
No ratings yet
Lecture 08
11 pages
Weide B.W. - Statistical Methods in Algorithm Design and Analysis (Thesis) (1978)
No ratings yet
Weide B.W. - Statistical Methods in Algorithm Design and Analysis (Thesis) (1978)
190 pages
combinatorial_algorithms
No ratings yet
combinatorial_algorithms
77 pages
Lower Bound On Deterministic Evaluation Algorithms For NOR Circuits - Yao's Principle For Proving Lower Bounds
No ratings yet
Lower Bound On Deterministic Evaluation Algorithms For NOR Circuits - Yao's Principle For Proving Lower Bounds
61 pages
Randomized Algorithms: Methods and Techniques: Kuldeep Sharma Dr. Deepak Garg
No ratings yet
Randomized Algorithms: Methods and Techniques: Kuldeep Sharma Dr. Deepak Garg
4 pages
Today's Material: - Medians & Order Statistics - Ch. 9
No ratings yet
Today's Material: - Medians & Order Statistics - Ch. 9
15 pages
Probability and Computing: Randomized Algorithms and Probabilistic Analysis
100% (1)
Probability and Computing: Randomized Algorithms and Probabilistic Analysis
366 pages
Probability and Computing PDF
100% (2)
Probability and Computing PDF
366 pages
EE675A Lecture 4
No ratings yet
EE675A Lecture 4
7 pages
Lecture5 Compressed
No ratings yet
Lecture5 Compressed
36 pages
Notes PDF
No ratings yet
Notes PDF
407 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Moments and Deviations: 3.1. Markov's Inequality
No ratings yet
Moments and Deviations: 3.1. Markov's Inequality
27 pages
5.2.2. Application: Bucket Sort: 12-Feb-15 Mat-72306 Randal, Spring 2015 207
No ratings yet
5.2.2. Application: Bucket Sort: 12-Feb-15 Mat-72306 Randal, Spring 2015 207
26 pages
03 Hashing
No ratings yet
03 Hashing
21 pages
1 Markov's Inequality: Lecture Notes CS:5360 Randomized Algorithms
No ratings yet
1 Markov's Inequality: Lecture Notes CS:5360 Randomized Algorithms
11 pages
Proof: Applying Markov's Inequality, For Any: 0 PR (1 +) PR
No ratings yet
Proof: Applying Markov's Inequality, For Any: 0 PR (1 +) PR
19 pages
1 Minimum Spanning Tree (MST) : Lecture Notes CS:5360 Randomized Algorithms
No ratings yet
1 Minimum Spanning Tree (MST) : Lecture Notes CS:5360 Randomized Algorithms
9 pages
Min Max Games
No ratings yet
Min Max Games
68 pages
Yao's Minimax Principle: Game Tree Evaluation
No ratings yet
Yao's Minimax Principle: Game Tree Evaluation
30 pages
Skript 05
No ratings yet
Skript 05
4 pages
Lecture 2: Lower Bounds For Randomized Algorithms
No ratings yet
Lecture 2: Lower Bounds For Randomized Algorithms
10 pages
CSC304 Lecture 6
No ratings yet
CSC304 Lecture 6
21 pages
Lower Bounds in Computer Science
No ratings yet
Lower Bounds in Computer Science
38 pages
Streaming Algorithm: Filtering & Counting Distinct Elements: Compsci 590.02 Instructor: Ashwinmachanavajjhala
No ratings yet
Streaming Algorithm: Filtering & Counting Distinct Elements: Compsci 590.02 Instructor: Ashwinmachanavajjhala
26 pages
Compsci Algorithms For Data Science: Cameron Musco University of Massachusetts Amherst. Fall 2019
No ratings yet
Compsci Algorithms For Data Science: Cameron Musco University of Massachusetts Amherst. Fall 2019
28 pages
Massivedata14 Slidesxx
No ratings yet
Massivedata14 Slidesxx
13 pages
Test
No ratings yet
Test
80 pages
Lecture 4
No ratings yet
Lecture 4
28 pages
Isc Computer Science Project
100% (1)
Isc Computer Science Project
89 pages
Query Optimization: Solutions To Practice Exercises
No ratings yet
Query Optimization: Solutions To Practice Exercises
5 pages
Use Pointer Notation Instead of Subscript Notation!
No ratings yet
Use Pointer Notation Instead of Subscript Notation!
3 pages
CS1103 Graded Quiz Unit 6
No ratings yet
CS1103 Graded Quiz Unit 6
8 pages
ABAP Test Cockpit Checks - SAP Blogs
No ratings yet
ABAP Test Cockpit Checks - SAP Blogs
11 pages
Amey_B-50_Software_Engineering_Lab_Experiment-12
No ratings yet
Amey_B-50_Software_Engineering_Lab_Experiment-12
15 pages
Hive
No ratings yet
Hive
30 pages
Coding Booklet Class Iv
No ratings yet
Coding Booklet Class Iv
16 pages
Soft Skill Interview Questions
No ratings yet
Soft Skill Interview Questions
7 pages
Balanced K-Means Revisited-1
No ratings yet
Balanced K-Means Revisited-1
3 pages
Choosing Swing or HTML - Universal Robots
No ratings yet
Choosing Swing or HTML - Universal Robots
5 pages
X Computer - FT CD
No ratings yet
X Computer - FT CD
2 pages
How to Wait 1 Second in JavaScript - Mastering JS
No ratings yet
How to Wait 1 Second in JavaScript - Mastering JS
3 pages
W516 E1 02+CP1L ELEM - CPU+OperManual PDF
No ratings yet
W516 E1 02+CP1L ELEM - CPU+OperManual PDF
854 pages
Dbms Manual 2023 24
No ratings yet
Dbms Manual 2023 24
57 pages
Lecture 2.3.2 User Defined Function
No ratings yet
Lecture 2.3.2 User Defined Function
21 pages
Introduction-to-Stacks-in-Data-Structures (1)
No ratings yet
Introduction-to-Stacks-in-Data-Structures (1)
12 pages
Bca 3
No ratings yet
Bca 3
13 pages
Accidentally Black Listed Objects in SAP S4HANA 1610 SP00
No ratings yet
Accidentally Black Listed Objects in SAP S4HANA 1610 SP00
2 pages
The System Analyst: Aimp222 Information Analysis and Design Jsalcedo-Olandez, Cpa
No ratings yet
The System Analyst: Aimp222 Information Analysis and Design Jsalcedo-Olandez, Cpa
9 pages
Introduction to the design and analysis of algorithms 3rd edition Edition Levitin instant download
No ratings yet
Introduction to the design and analysis of algorithms 3rd edition Edition Levitin instant download
61 pages
Django
No ratings yet
Django
10 pages
Frontend_Development_Report
No ratings yet
Frontend_Development_Report
10 pages
Blue Print Xi CS Hy Exam 2022-23
No ratings yet
Blue Print Xi CS Hy Exam 2022-23
2 pages
Learn The Architecture - Optimizing C Code With Neon Intrinsics 102467 0201 02 en
No ratings yet
Learn The Architecture - Optimizing C Code With Neon Intrinsics 102467 0201 02 en
40 pages
ABAP Lab
No ratings yet
ABAP Lab
64 pages