0% found this document useful (0 votes)

67 views

Streaming Algorithms: CS6234 Advanced Algorithms February 10 2015

The document summarizes algorithms for processing data streams with limited memory. It begins by introducing the stream model and objectives of computing functions over data streams using sublinear memory. It then overviews various streaming algorithms including the Count-Min Sketch, Bloom Filter, and AMS Sketch. The document specifically describes the Bloom Filter algorithm which uses hash functions to probabilistically determine set membership with false positives but no false negatives using sublinear space.

Uploaded by

vel.sakthi3152

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views

Streaming Algorithms: CS6234 Advanced Algorithms February 10 2015

Uploaded by

vel.sakthi3152

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 90

Streaming Algorithms

CS6234 Advanced Algorithms

February 10 2015

1
The stream model
• Data sequentially enters at a rapid rate from one or more inputs

• We cannot store the entire stream

• Processing in real-time

• Limited memory (usually sub linear in the size of the stream)

• Goal: Compute a function of stream, e.g., median, number of

distinct elements, longest increasing sequence

Approximate answer is usually preferable

2
Overview
Counting bits with DGIM algorithm

Bloom Filter

Count-Min Sketch

Approximate Heavy Hitters

AMS Sketch

AMS Sketch Applications

3
Counting bits with DGIM
algorithm
Presented by
Dmitrii Kharkovskii

4
Sliding windows
• A useful model : queries are about a window of length N

• The N most recent elements received (or last N time units)

• Interesting case: N is still so large that it cannot be stored

• Or, there are so many streams that windows for all cannot be stored

5
Problem description
• Problem
• Given a stream of 0’s and 1’s
• Answer queries of the form “how many 1’s in the last k bits?” where k ≤ N
• Obvious solution
• Store the most recent N bits (i.e., window size = N)
• When a new bit arrives, discard the N +1st bit
• Real Problem
• Slow ‐ need to scan k‐bits to count
• What if we cannot afford to store N bits?
• Estimate with an approximate answer

6
Datar-Gionis-Indyk-Motwani Algorithm (DGIM)
Overview

• Approximate answer

• Uses N) of memory

• Performance guarantee: error no more than 50%

• Possible to decrease error to any fraction 𝜀 > 0 with N) memory

• Possible to generalize for the case of positive integer stream

7
Main idea of the algorithm

Represent the window as a set of exponentially growing non-overlapping buckets

8
Timestamps
• Each bit in the stream has a timestamp - the position in the stream from the
beginning.

• Record timestamps modulo N (window size) - use o(log N) bits

• Store the most recent timestamp to identify the position of any other bit in the
window

9
Buckets
• Each bucket has two components:

• Timestamp of the most recent end. Needs N) bits

• Size of the bucket - the number of ones in it.

• Size is always .

• To store j we need N) bits

• Each bucket needs N) bits

10
Representing the stream by buckets
• The right end of a bucket is always a position with a 1.
• Every position with a 1 is in some bucket.
• Buckets do not overlap.
• There are one or two buckets of any given size, up to some maximum size.
• All sizes must be a power of 2.
• Buckets cannot decrease in size as we move to the left (back in time).

11
Updating buckets when a new bit arrives
• Drop the last bucket if it has no overlap with the window

• If the current bit is zero, no changes are needed

• If the current bit is one

• Create a new bucket with it. Size = 1, timestamp = current time modulo N.

• If there are 3 buckets of size 1, merge two oldest into one of size 2.

• If there are 3 buckets of size 2, merge two oldest into one of size 4.

• ...

12
Example of updating process

13
Query Answering
How many ones are in the most recent k bits?

• Find all buckets overlapping with last k bits

• Sum the sizes of all but the oldest one

Ans = 1 + 1 + 2 + 4 + 4 + 8 + 8/2 = 24
• Add the half of the size of the oldest one

14
k
Memory requirements

15
Performance guarantee
• Suppose the last bucket has size .

• By taking half of it, maximum error is

• At least one bucket of every size less than

• The true sum is at least 1+ 2 + 4 + … + = - 1

• The first bit of the last bucket is always equal to 1.

• Error is at most 50%

16
References

J. Leskovic, A. Rajamaran, J. Ulmann. “Mining of Massive Datasets”.

Cambridge University Press

18
Bloom Filter

Presented by-
Naheed Anjum Arafat

19
Motivation:
The “Set Membership” Problem
• x: An Element
Streaming Algorithm:
• S: A Set of elements (Finite) • Limited Space/item
• Limited Processing time/item
• Input: x, S • Approximate answer based on a summary/sketch
of the data stream in the memory.
• Output:
• True (if x in S)
• False (if x not in S)

Solution: Binary Search on an array of size |S|. Runtime Complexity: O(log|S|)

20
Bloom Filter

• Consists of
• vector of n Boolean values, initially all set false (Complexity:- O(n) )
• k independent and uniform hash functions, , … ,
each outputs a value within the range {0, 1, … , n-1}

F F F F F F F F F F
0 1 2 3 4 5 6 7 8 9

n = 10
21
Bloom Filter
• For each element sϵS, the Boolean value at positions ,
, … •, are set true.
• Complexity of Insertion:- O(k)

𝑠1
=1 =6
=4

F TF F F FT F TF F F F
0 1 2 3 4 5 6 7 8 9

k=3
22
Bloom Filter
• For each element sϵS, the Boolean value at positions ,
, … •, are set true.
Note: A particular Boolean value may
be set to True several times.
𝑠1 =4
𝑠2
=7 =9

F T F F T F T TF F FT

0 1 2 3 4 5 6 7 8 9

k=3
23
Algorithm to Approximate Set Membership Query
Input: x ( may/may not be an element)
Runtime Complexity:- O(k)
Output: Boolean
For all i ϵ {0,1,…,k-1}
if hi(x) is False
return False
return True

𝑠1 𝑠2

F T F F T F T T F T
0 1 2 3 4 5 6 7 8 9

= S1 = S3 k=3
24
Algorithm to Approximate Set Membership Query

False Positive!!

𝑠1 =6
𝑠2
=4
=1 =4 = 9
=7

F T F F T F T
T F T
0 1 2 3 4 5 6 7 8 9

=6
=1
𝑥 =9
k=3
25
Error Types

• False Negative – Answering “is not there” on an element which “is there”
• Never happens for Bloom Filter

• False Positive – Answering “is there” for an element which “is not there”
• Might happens. How likely?

26
Probability of false positives
S1 S2

F T F T F F T F T F

n = size of table
m = number of items
k = number of hash functions
Consider a particular bit 0 <= j <= n-1
Probability that does not set bit j after hashing only 1 item:
Probability that does not set bit j after hashing m items:

27
Probability of false positives
S1 S2

F T F T F F T F T F

n = size of table
m = number of items
k = number of hash functions
Probability that none of the hash functions set bit j after hashing m items:

We know that,
=

28
Probability of false positives
S1 S2

F T F T F F T F T F

n = size of table
m = number of items Approximate
Probability of
k = number of hash functions False Positive

Probability that bit j is not set

The prob. of having all k bits of a new element already set

For a fixed m, n which value of k will minimize this bound? kopt =

Bit per item
The probability of False Positive
29
Bloom Filters: cons

• Small false positive probability

• Cannot handle deletions
• Size of the Bit vector has to be set a priori in order to maintain a
predetermined FP-rates :- Resolved in “Scalable Bloom Filter” –
Almeida, Paulo; Baquero, Carlos; Preguica, Nuno; Hutchison, David (2007), "Scalable Bloom
Filters" (PDF), Information Processing Letters 101 (6): 255–261

30
References

• https://en.wikipedia.org/wiki/Bloom_filter
• Graham Cormode,
Sketch Techniques for Approximate Query Processing, ATT Research
• Michael Mitzenmacher, Compressed Bloom Filters, Harvard
University, Cambridge

31
Count-Min Sketch

Erick Purwanto
A0050717L
Motivation Count-Min Sketch
• Implemented in real system
• – AT&T: network switch to analyze network traffic
using limited memory
– Google: implemented on top of MapReduce
parallel processing infrastructure
• Simple and used to solve other problems
– Heavy Hitters by Joseph
– Second Moment , AMS Sketch by Manupa
– Inner Product, Self Join by Sapumal
Frequency Query
• Given a stream of data vector of length , and
• update (increment) operation,
– we want to know at each time, what is the
frequency of item
– assume frequency 𝑥… 𝑗

• Trivial if we have count array

– we want sublinear space

– probabilistically approximately correct
Count-Min Sketch
• Assumption:
• – family of –independent hash function
– sample hash functions

𝑥… 𝑗 h𝑖 : [ 1 , 𝑚 ] →[ 1 , 𝑤]

1 h𝑖 ( 𝑗) 𝑤

• Use: indep. hash func. and integer array CM[]

Count-Min Sketch
• Algorithm to Update:
– Inc: for each row CM[]

𝑥… 𝑗 h1
h2 CM
+1
+1 1
h𝑑

+1

𝑑

1 𝑤
Count-Min Sketch
• Algorithm to estimate Frequency Query:
– Count: = min CM[]

𝑗 h1
h2 CM
1
h𝑑

𝑑

1 𝑤
Collision
•• Entry
is an estimate of the frequency of item at row
– for example,

𝑥… 3 5 5 8 5 2 5

row
1 7 𝑤
• Let : frequency of , and random variable :

frequency of all ,

Count-Min Sketch Analysis

row
1 h𝑖 ( 𝑗) 𝑤
• Estimate

frequency of at row :

•
Count-Min Sketch Analysis
• Let : approximation error, and set
• • The expectation of other item contribution:

.
Count-Min Sketch Analysis
• Markov Inequality:
•
• Probability an estimate far from true value:
Count-Min Sketch Analysis
• Let : failure probability, and set
•
• Probability final estimate far from true value:
Count-Min Sketch
• Result

– dynamic data structure CM, item frequency query
– set and
– with probability at least ,

– sublinear space, does not depend on nor

– running time update and freq. query
Approximate Heavy Hitters

TaeHoon Joseph, Kim

Count-Min Sketch (CMS)
• takes time

– update values

• takes time

– return the minimum of values

Heavy Hitters Problem
• Input:
– An array of length with distinct items

• Objective:
– Find all items that occur more than times in the array
• there can be at most such items

• Parameter
–
Heavy Hitters Problem: Naïve Solution
• Trivial solution is to use array
1. Store all items and each item’s frequency
2. Find all items that has frequencies
-Heavy Hitters
Problem (-)
• Relax Heavy Hitters Problem

• Requires sub-linear space

– cannot solve exact problem
– parameters : and
-Heavy Hitters
Problem (-)
•1. Returns every item occurs more than times
2. Returns some items that occur more than times
– Count min sketch
Naïve Solution using CMS
… m-2 m-1 m

… j

h

1
h

2
h

𝑑
1

…
𝑑

1
𝑤

Naïve Solution using CMS
• Query the frequency of all items
– Return items with

– slow
Better Solution
• Use CMS to store the frequency

• Use a baseline as a threshold at item

• Use MinHeap to store potential heavy hitters at item

– store new items in MinHeap with frequency
– delete old items from MinHeap with frequency
-Heavy Hitters
Problem (-)
•1. Returns every item occurs more than times
2. Returns some items that occur more than times
– ,
then
Algorithm Approximate Heavy Hitters
•Input
stream , parameter

For each item :

1. Update Count Min Sketch
2. Compare the frequency of with
3. if
Insert or update in Min Heap
4. remove any value in Min Heap with frequency

Returns the MinHeap as Heavy Hitters

1
EXAMPLES
Min-Heap
4

h

𝑑 h2
h

1

1 1

…
1 𝑑
1 𝑤
1
EXAMPLES
Min-Heap
4
{1:4}

h

𝑑 h2
h

1

1 1

…
1 𝑑
1 𝑤
1 2 3 4 5
EXAMPLES
Min-Heap
4 2 6 9 3
{1:3}

{1:2} {1:6}

h

1 h

𝑑 h

2
{1:9} {1:4}

1 1

…
1 𝑑
1 𝑤
1 2 3 4 5 6
EXAMPLES
Min-Heap
4 2 6 9 3 4
{1:3}

{1:2} {1:6}

h

𝑑 h

2 h

1
{1:9} {1:4}

1 1

…
1 𝑑
1 𝑤
1 2 3 4 5 6
EXAMPLES
Min-Heap
4 2 6 9 3 4
{1:3}

{1:2} {1:6}

h

𝑑 h

2 h

1
{1:9} {1:4}

2 1

…
2 𝑑
1 𝑤
1 2 3 4 5 6
EXAMPLES
Min-Heap
4 2 6 9 3 4
{2:4}

h

𝑑 h

2 h

1

2 1

…
2 𝑑
1 𝑤
79
EXAMPLES
Min-Heap
… 2
{16:4}

{20:9} {23:6}

h

𝑑 h

1 h

2

16 1

…
15 𝑑
1 𝑤
79
EXAMPLES
Min-Heap
… 2
{16:4}

{20:9} {23:6}

h

𝑑 h

1 h

2

17 1

…
16 𝑑
1 𝑤
79
EXAMPLES
Min-Heap
… 2
{16:2}

{16:4} {23:6}

h

𝑑 h

1 h

2
{20:9}

17 1

…
16 𝑑
1 𝑤
79 80 81
EXAMPLES
Min-Heap
… 2 1 2
{16:2}

{16:4} {23:6}

h

1 h

𝑑 h

2
{20:9}

3 1

…
4 𝑑
1 𝑤
79 80 81
EXAMPLES
Min-Heap
… 2 1 9
{16:2}

{16:4} {23:6}

h

𝑑 h

1 h

2
{20:9}

20 1

…
25 𝑑
1 𝑤
79 80 81
EXAMPLES
Min-Heap
… 2 1 9
{16:2}

{16:4} {23:6}

h

𝑑 h

1 h

2
{20:9}

21 1

…
26 𝑑
1 𝑤
79 80 81
EXAMPLES
Min-Heap
… 2 1 9
{21:9}

{23:6}

h

𝑑 h

1 h

2

21 1

…
26 𝑑
1 𝑤
Analysis
• Because
is unknown, possible heavy hitters are calculated and
stored every new item comes in
• Maintaining the heap requires extra time
AMS Sketch : Estimate
Second Moment
Dissanayaka Mudiyanselage Emil Manupa Karunaratne
The Second Moment
• Stream :
• The Second Moment :

• The trivial solution would be : maintain a histogram of size n and get the sum of
squares
• Its not feasible maintain that large array, therefore we intend to find a
approximation algorithm to achieve sub-linear space complexity with bounded
errors
• The algorithm will give an estimate within ε relative error with δ failure probability.
(Two Parameters)
The Method
j
+g1(j)
+g2(j)

d rows
+gd-1(j)
+gd(j)

• j is the next item in the stream.

• 2-wise independent d hash functions to find the bucket for each row

• After finding the bucket, 4-wise independent d hash functions to decide

inc/dec :
• In a summary :
The Method
j
+g1(j)
+g2(j)

d rows
+gd-1(j)
+gd(j)

• Calculate row estimate

• Median :
• Choose and , by doing so it will give an estimate with relative error
and failure probability
Why should this method give F2 ?
j

d = 8log 1/δ
+gk(j)

• For kth row :

• Estimate F2 from kth row :
• Each row there would be :
• First part :
• Second part : g(i)g(j) can be +1 or -1 with equal probability, therefore
the expectation is 0.
What guarantee can we give about the
accuracy ?
• The variance of Rk, a row estimate, is caused by hashing collisions.
• Given the independent nature of the hash functions, we can safely
state the variance is bounded by
• Using Chebyshev Inequality,
• Lets assign,
•

• Still the failure probability is is linear in over

What guarantee can we give about the
accuracy ?
• We
had d number of hash functions, that produce R1, R2, …. Rd
estimates.
• The Median being wrong  Half of the estimates are wrong
• These are independent d estimates, like toin-cosses that have
exponentially decaying probability to get the same outcome.
• They have stronger bounds, Chernoff Bounds :

•
•
•
Space and Time Complexity
• E.g. In order to achieve e-10 of tightly bounded accuracy, only 8 * 10
= 80 rows required
• Space complexity is O(log()).
• Time complexity will be explained later along with the application
AMS Sketch and Applications

Sapumal Ahangama
Hash functions
• maps the input domain uniformly to buckets
• should be a pairwise independent hash functions, to cancel
out product terms
– Ex: family of
– For a and b chosen from prime field ,
Hash functions
• maps elements from domain uniformly onto
• should be four-wise independent

• Ex: family of equations

•
– for chosen uniformly from prime field .
Hash functions
• These hash functions can be computed very quickly, faster even than
more familiar (cryptographic) hash functions
• For scenarios which require very high throughput, efficient
implementations are available for hash functions,
– Based on optimizations for particular values of p, and partial precomputations

– Ref: M. Thorup and Y. Zhang. Tabulation based 4-universal hashing with

applications to second moment estimation. In ACM-SIAM Symposium on
Discrete Algorithms, 2004
Time complexity - Update
• The
sketch is initialized by picking the hash functions to use,
and initializing the array of counters to all zeros
• For each update operation, the item is mapped to an entry in
each row based on the hash functions , multiplied by the
corresponding value of
• Processing each update therefore takes time
– since each hash function evaluation takes constant time.
Time complexity - Query
• Found
by taking the sum of the squares of each row of the
sketch in turn, and finds the median of these sums.
– That is for each row k, compute
– Take the median of the d such estimates

• Hence the query time is linear in the size of the sketch,

d = 8log 1/δ
+gk(j)
Applications - Inner product
• AMS
sketch can be used to estimate the inner-product
between a pair of vectors
• Given two frequency distributions

• AMS sketch based estimator is an unbiased estimator for the

inner product of the vectors
Inner Product
• Two
sketches and
• Formed with the same parameters and using the same hash
functions (same )
• The row estimate is the inner product of the rows,
Inner Product
• Expanding

• Shows that the estimate gives with additional cross-terms due

to collisions of items under
• The expectation of these cross terms is zero
– Over the choice of the hash functions, as the function is equally
likely to add as to subtract any given term.
Inner Product – Join size estimation
• Inner product has a natural interpretation, as the size of the
equi-join between two relations…
• In SQL,
SELECT COUNT(*) FROM D, D’ WHERE D.id =
D’.id
Example
UPDATE(23, 1)
23

h1 h2 h3

1 2 3 4 5 6 7 8
1 0 0 0 0 0 0 0 0
2 0 0 0 0 0 0 0 0
d=3 3 0 0 0 0 0 0 0 0

w=8 87
Example
UPDATE(23, 1)
23

h1 h2 h3

3

1 2 3 4 5 6 7 8
1 0 0 -1 0 0 0 0 0
2 -1 0 0 0 0 0 0 0
d=3 3 0 0 0 0 0 0 +1 0

w=8 88
Example
UPDATE(99, 2)
99

h1 h2 h3

1 2 3 4 5 6 7 8
1 0 0 -1 0 0 0 0 0
2 -1 0 0 0 0 0 0 0
d=3 3 0 0 0 0 0 0 +1 0

w=8 89
Example
UPDATE(99, 2)
99

h1 h2 h3

1 2 3 4 5 6 7 8
1 0 0 -1 0 0 0 0 0
2 -1 0 0 0 0 0 0 0
d=3 3 0 0 0 0 0 0 +1 0

w=8 90
Example
UPDATE(99, 2)
99

h1 h2 h3

1 2 3 4 5 6 7 8
1 0 0 -1 0 +2 0 0 0
2 -3 0 0 0 0 0 0 0
d=3 3 0 0 +2 0 0 0 +1 0

w=8 91

CS85: Data Stream Algorithms Lecture Notes, Fall 2009: Amit Chakrabarti Dartmouth College
No ratings yet
CS85: Data Stream Algorithms Lecture Notes, Fall 2009: Amit Chakrabarti Dartmouth College
61 pages
Fractals: On The Edge Of Chaos
From Everand
Fractals: On The Edge Of Chaos
Oliver Linton
3/5 (2)
BDA PT 2
No ratings yet
BDA PT 2
35 pages
Manual Bda 6 7 8
No ratings yet
Manual Bda 6 7 8
6 pages
unit-3.pptx
No ratings yet
unit-3.pptx
49 pages
Data Science 5
No ratings yet
Data Science 5
82 pages
Mining Data Streams
No ratings yet
Mining Data Streams
34 pages
Streaming Algorithms: Ajinkya Potdar Hemanga Krishna Borah
No ratings yet
Streaming Algorithms: Ajinkya Potdar Hemanga Krishna Borah
47 pages
Streams 1
No ratings yet
Streams 1
33 pages
Data Stream Sampling
No ratings yet
Data Stream Sampling
25 pages
module4(3)
No ratings yet
module4(3)
20 pages
Lec1 Bloom Distinctcount
No ratings yet
Lec1 Bloom Distinctcount
76 pages
Bloom Filter
No ratings yet
Bloom Filter
50 pages
Mining Data Streams (Part 1)
No ratings yet
Mining Data Streams (Part 1)
46 pages
Decaying Window
No ratings yet
Decaying Window
16 pages
mining data stream
No ratings yet
mining data stream
31 pages
Mining Data Streams (Part 2)
No ratings yet
Mining Data Streams (Part 2)
56 pages
Mining Data Streams
No ratings yet
Mining Data Streams
67 pages
Mmd04A Streams
No ratings yet
Mmd04A Streams
78 pages
BDA Assignment2 BE6 20
No ratings yet
BDA Assignment2 BE6 20
9 pages
Unit 4 - Lecture 3 - DGIM Algorithm Notes
100% (1)
Unit 4 - Lecture 3 - DGIM Algorithm Notes
8 pages
Streaming Algorithm: Filtering & Counting Distinct Elements: Compsci 590.02 Instructor: Ashwinmachanavajjhala
No ratings yet
Streaming Algorithm: Filtering & Counting Distinct Elements: Compsci 590.02 Instructor: Ashwinmachanavajjhala
26 pages
14 Streams
No ratings yet
14 Streams
6 pages
DSBDA UT 2 Part 2
No ratings yet
DSBDA UT 2 Part 2
21 pages
BDA
No ratings yet
BDA
6 pages
02 StreamsAlgorithms
No ratings yet
02 StreamsAlgorithms
93 pages
6 Filtering and Streaming: 6.1 Bloom Filters
No ratings yet
6 Filtering and Streaming: 6.1 Bloom Filters
6 pages
Unit Ii BD
No ratings yet
Unit Ii BD
74 pages
Bloom Filter
No ratings yet
Bloom Filter
29 pages
B43 BDA Exp7
No ratings yet
B43 BDA Exp7
12 pages
Streams 2
No ratings yet
Streams 2
49 pages
module4(2)
No ratings yet
module4(2)
10 pages
(8) Bloom Filters - A Probabilistic Data Structure _ LinkedIn
No ratings yet
(8) Bloom Filters - A Probabilistic Data Structure _ LinkedIn
7 pages
Counting Ones in A Window: The Cost of Exact Counts
100% (1)
Counting Ones in A Window: The Cost of Exact Counts
13 pages
Algorithms for Massive Data Problems
No ratings yet
Algorithms for Massive Data Problems
28 pages
333
No ratings yet
333
5 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
Assignment 2 BDA
No ratings yet
Assignment 2 BDA
9 pages
Finding Frequent Items in Data Streams
No ratings yet
Finding Frequent Items in Data Streams
11 pages
Probabilistic Data Structures
No ratings yet
Probabilistic Data Structures
26 pages
Assocrules 2
No ratings yet
Assocrules 2
49 pages
Unit 2 Mathematical Foundation of Big Data: - Syllabus
No ratings yet
Unit 2 Mathematical Foundation of Big Data: - Syllabus
26 pages
1 Overview: Lecture 2 - February 3, 2005
No ratings yet
1 Overview: Lecture 2 - February 3, 2005
6 pages
Bloom Filters: Presented By: Eman Shafiq (2017-EE-389) Bareera Azhar (2017-EE-379) Ruqia Rubab (2017-EE-383
No ratings yet
Bloom Filters: Presented By: Eman Shafiq (2017-EE-389) Bareera Azhar (2017-EE-379) Ruqia Rubab (2017-EE-383
14 pages
Viden Io Data Analytics Lecture7 Data Stream Filtering PDF
No ratings yet
Viden Io Data Analytics Lecture7 Data Stream Filtering PDF
20 pages
Compsci Algorithms For Data Science: Cameron Musco University of Massachusetts Amherst. Fall 2019
No ratings yet
Compsci Algorithms For Data Science: Cameron Musco University of Massachusetts Amherst. Fall 2019
28 pages
CS 561, Lecture 2: Randomization in Data Structures: Jared Saia University of New Mexico
No ratings yet
CS 561, Lecture 2: Randomization in Data Structures: Jared Saia University of New Mexico
46 pages
A Simple Algorithm For Finding Frequent Elements in Streams and Bags
No ratings yet
A Simple Algorithm For Finding Frequent Elements in Streams and Bags
5 pages
DSBD_Unit-II_3
No ratings yet
DSBD_Unit-II_3
28 pages
BDA-UNIT3
No ratings yet
BDA-UNIT3
22 pages
Ch05a Streams1
No ratings yet
Ch05a Streams1
48 pages
Module 3 Mining Data Streams
No ratings yet
Module 3 Mining Data Streams
96 pages
An Improved Data Stream Summary: The Count-Min Sketch and Its Applications
No ratings yet
An Improved Data Stream Summary: The Count-Min Sketch and Its Applications
11 pages
Approximate Frequency Counts Over Data Streams
No ratings yet
Approximate Frequency Counts Over Data Streams
87 pages
BDA Experiment 7
No ratings yet
BDA Experiment 7
7 pages
Bloom Filters - Short Tutorial: Web Cache Sharing ( (3) ) Collaborating Web Caches Use Bloom Filters (Dubbed
No ratings yet
Bloom Filters - Short Tutorial: Web Cache Sharing ( (3) ) Collaborating Web Caches Use Bloom Filters (Dubbed
4 pages
Lossy Counting
No ratings yet
Lossy Counting
39 pages
Bloom Filters - Short Tutorial: Web Cache Sharing ( (3) ) Collaborating Web Caches Use Bloom Filters (Dubbed
No ratings yet
Bloom Filters - Short Tutorial: Web Cache Sharing ( (3) ) Collaborating Web Caches Use Bloom Filters (Dubbed
4 pages
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
From Everand
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
Fouad Sabry
No ratings yet
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet
B Sc-Maths PDF
No ratings yet
B Sc-Maths PDF
34 pages
Web Based Electronic Wireless Lab Environment (EWL) For Engineering Students Using Wi-Fi Architecture
No ratings yet
Web Based Electronic Wireless Lab Environment (EWL) For Engineering Students Using Wi-Fi Architecture
10 pages
Different Query Optimization Techniques (QOT) Using Data Mining Technology
No ratings yet
Different Query Optimization Techniques (QOT) Using Data Mining Technology
9 pages
Hybrid Heuristics For High Speed Route Optimization: 2019 Detrack Systems Pte Ltd. All Rights Reserved
No ratings yet
Hybrid Heuristics For High Speed Route Optimization: 2019 Detrack Systems Pte Ltd. All Rights Reserved
20 pages
MC9251 Middleware Technologies
No ratings yet
MC9251 Middleware Technologies
7 pages
Web Mashups-Nirav Shah
No ratings yet
Web Mashups-Nirav Shah
12 pages
Advanced Visual Basic Lab
No ratings yet
Advanced Visual Basic Lab
243 pages
C:/oracle/orahome/jdbc/lib/classes12.jar in The Command Prompt
No ratings yet
C:/oracle/orahome/jdbc/lib/classes12.jar in The Command Prompt
2 pages
Raw Socket Tutorial
No ratings yet
Raw Socket Tutorial
12 pages
An Efficient Mechanism For Handling Inferences in Databases
No ratings yet
An Efficient Mechanism For Handling Inferences in Databases
7 pages
Overview
No ratings yet
Overview
2 pages
A Fuzzy Based Automatic Pap Screening System - ROI Detection
No ratings yet
A Fuzzy Based Automatic Pap Screening System - ROI Detection
4 pages
IT 2353 Web Technology Notes
No ratings yet
IT 2353 Web Technology Notes
52 pages
ADS
No ratings yet
ADS
62 pages
Lab 7 HEAP
No ratings yet
Lab 7 HEAP
6 pages
Adsa Lab Manual
No ratings yet
Adsa Lab Manual
44 pages
Mcdermott 6014 Final Summer B 17
No ratings yet
Mcdermott 6014 Final Summer B 17
14 pages
CSE 241 Algorithms Midterm
No ratings yet
CSE 241 Algorithms Midterm
21 pages
Data Compression (RCS 087)
No ratings yet
Data Compression (RCS 087)
51 pages
Daa Question Bank
No ratings yet
Daa Question Bank
13 pages
Cs33 - Data Structures Questions and Answers
88% (43)
Cs33 - Data Structures Questions and Answers
26 pages
ADSA Lecture Unit II Ch1
No ratings yet
ADSA Lecture Unit II Ch1
45 pages
DS - Heap Sort (9) - SLM
No ratings yet
DS - Heap Sort (9) - SLM
39 pages
W5 PriorityQueue
No ratings yet
W5 PriorityQueue
54 pages
Algo Analysis Q&A Info
No ratings yet
Algo Analysis Q&A Info
24 pages
Module No. 3 - Trees - Swe2001
No ratings yet
Module No. 3 - Trees - Swe2001
18 pages
CS301 Data Structures Final Term of 2012 Solved Subjective With References by Moaaz
67% (3)
CS301 Data Structures Final Term of 2012 Solved Subjective With References by Moaaz
34 pages
Data Structure & Algorithms Solutions
No ratings yet
Data Structure & Algorithms Solutions
9 pages
Introduction To Algorithms: Unit 1
No ratings yet
Introduction To Algorithms: Unit 1
42 pages
DS IV Unit Notes
No ratings yet
DS IV Unit Notes
29 pages
CSE220 Final Spring-24 Set-A
No ratings yet
CSE220 Final Spring-24 Set-A
4 pages
Message
No ratings yet
Message
214 pages
Leetcode 75 Questions (NeetCode On Yt) - Google Sheets
No ratings yet
Leetcode 75 Questions (NeetCode On Yt) - Google Sheets
2 pages
From DS Univ Practical Question Bank: Khan S. Alam 1
No ratings yet
From DS Univ Practical Question Bank: Khan S. Alam 1
51 pages
Full Stack Lab - MANUAL
0% (1)
Full Stack Lab - MANUAL
53 pages
Michelle Bodnar, Andrew Lohr May 22, 2017: H h+1 H 1 I 0 I H
No ratings yet
Michelle Bodnar, Andrew Lohr May 22, 2017: H h+1 H 1 I 0 I H
16 pages
Data Structures Unit 1 and 2
No ratings yet
Data Structures Unit 1 and 2
44 pages
07_Heaps-and-Priority-Queues
No ratings yet
07_Heaps-and-Priority-Queues
20 pages
Daa Practical File
No ratings yet
Daa Practical File
49 pages
Sample Exam 1 Solutions
No ratings yet
Sample Exam 1 Solutions
14 pages
Dsa MCQ
No ratings yet
Dsa MCQ
34 pages
DSA Sheet by Rohit Negi
No ratings yet
DSA Sheet by Rohit Negi
38 pages
DAA-Complete Buddha Series Unit-1 To 5
No ratings yet
DAA-Complete Buddha Series Unit-1 To 5
126 pages