Open navigation menu
Close suggestions
Search
Search
en
Change Language
Upload
Sign in
Sign in
Download free for days
0 ratings
0% found this document useful (0 votes)
78 views
Design and Analysis of Algorithms Lecture Notes by Sandeep Sen
useful libks
Uploaded by
Manmatha Roy
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content,
claim it here
.
Available Formats
Download as PDF or read online on Scribd
Download now
Download
Save Design and Analysis of Algorithms Lecture Notes by... For Later
Download
Save
Save Design and Analysis of Algorithms Lecture Notes by... For Later
0%
0% found this document useful, undefined
0%
, undefined
Embed
Share
Print
Report
0 ratings
0% found this document useful (0 votes)
78 views
Design and Analysis of Algorithms Lecture Notes by Sandeep Sen
useful libks
Uploaded by
Manmatha Roy
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content,
claim it here
.
Available Formats
Download as PDF or read online on Scribd
Download now
Download
Save Design and Analysis of Algorithms Lecture Notes by... For Later
Carousel Previous
Carousel Next
Download
Save
Save Design and Analysis of Algorithms Lecture Notes by... For Later
0%
0% found this document useful, undefined
0%
, undefined
Embed
Share
Print
Report
Download now
Download
You are on page 1
/ 3
Search
Fullscreen
‘A collection of links for streaming algorithms and data structures gistfilet.md 1. General Background and Overview * Probabilistic Date Structures for Web Analytics and Data Mining : A great overview of the space of probabilistic data structures and how they are used in approximation algorithm implementation. * Models and Issues in Data Stream Systems * Philippe Figjolet’s contribution to streaming algorithms : A presentation by Jérémie Lumbroso that visits some of the hostorical perspectives and how it all began with Flajolet * Approximate Frequency Counts over Data Streams by Gurmeet Singh Manku & Rajeev Motwani : One of the early papers on the subject. * Methods for Finding Frequent Items in Data Streams by Graham Cormode & Marios Hadjieleftheriou + The space complexity of approximating the frequency moments by Noga Alon, Yossi Matias, Mario Szegedy : one of the most influential papers introducing succinctness in computing frequency moments * Cuckoo Filter: Practically Better Than Bloom by Bin Fan, David G. Andersen, Michael Kaminsky, Michael D. Mitzenmacher + A Simple Algorithm for Finding Frequent Elements in Streams and Bags by Karp, Shenker and Papadimitriou : used in Spark to find frequent items 2. Hyperloglog and MinHash : Implementation of a form of hyperloglog and adding capabilities of MinHash algorithm on to it which would enable to perform set intersections. While it does require extra processing power to deal with collecting all the minima, it's possible to get satisfactory performance out of the structure for a relatively low storage or memory footprint” 3. Streaming/Sketching Conference from AK Tech : Contains links to videos and slides from the speakers like Muthukrishnan who spoke about Count Min Sketch 4. Q-digest Medians and Beyond: New Aggregation Techniques for Sensor Networks : The paper that Introduced q-digest for range queries and quantile approximation Blog post on q-digest Blog post on approximate quantiles The Art of Approximating Distributions : Histograms end Quantiles at Scale - an alternative approach to q-digest 5. t digest : A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means. Ted Dunning's variant of Q-digest that does someimprovements 6. Implementations * streanvlib : A collection of Stream summarization and cardinality estimation algorithms like CM Sketch, Hyperlogiog, Bloom Filters * Algebird from Twitter * streamDM - Data Mining for Spark Streaming © Sketching library from Yahoo 7. Count-Min Sketch * An Improved Data Stream Summary: The Count-Min Sketch and its Applications - Cormode & Muthukrishnan : The paper that introduced count min sketch * Collection of information on Count Min Sketch * Count Min Sketch by Cormode : Introductory paper * Streaming Algorithms and Sketches - Count Min Sketch on AK Tech Blog + Muthukrishnan telking on Count Min Sketch at AK Tech conference * Sketch Techniques for Approximate Query Processing by Cormode + Sketching data structures - a good overview of Bloom Filters and Count Min Sketch * Sketching can improve linear regression and the talk by David * A Framework for Clustering Massive-Domain Data Streams by Charu Aggarwal * Streaming Anomaly Detection Using Randomized Matrix Sketching by Huang & Kasiviswanathan + Time adaptive sketches for summarizing data streams by Anshumali Shrivastava et. al from Microsoft Research 8. Surveys * References for Data Stream Algorithms by Graham Cormode : an exhaustive set of references with explanations * Data Streams- Algorithms and Applications by S. Muthukrishnan : This is an excellent monograph with surveys of all algorithms related to data streams. Also a free copy of the book is available from Muthu's web site at http://www.cs.rutgers.edu/~muthu/ ‘* Synopses for Massive Data: Samples, Histograms, Wavelets, Sketches by Graham Cormodet, Minos Garofalakis, Peter J. Haas and Chris Jermaine . Describes basic principles and recent developments in approximate query processing. It focuses on four key synopses: random samples, histograms, wavelets, and sketches. It considers issues such as accuracy, space and time efficiency, optimality, practicality, range of applicability, error bounds on query answers, and incremental maintenance. It also discusses the trade-offs between the different synopsis types. 10. Distributed Streams Algorithms for Sliding Windows by Phillip B. Gibbons and Srikanta Tirthapura 11. Frugal Streaming12. A Framework for Clustering Massive-Domain Data Streams by Charu C. Aggarwal 13. A framework for clustering evolving data streams by Charu C. Aggarwal et. al. 14, Unsupervised Feature Selection on Data Streams by Hao Huang 15. Presentations Spark Streaming Use Cases by Paco Nathan Tiny Batches in the wine, Shiny new bits in Spark Streaming by Paco Nathan Real time Data Analysis Patterns by Mikio Braun Streaming Big Data with Apache Spark, Katka and Cassandra by Helena Edelson Streaming Data Analysis and Online Learning by John Myles White Algebra for Analytics by Oscar Boykin @posco 16. Courses Alex Smola course at Berkeley SML: Data Streams Piotr Indyk course at MIT Sketching, Streaming and Sub-linear Space Algorithms Andrew McGregor course at UMass on Advanced Algorithms Amit Chakrabarti course at Dartmouth on Date Stream Algorithms and the entire course notes ina single document Moses Charikar course CS369G: Algorithmic Techniques for Big Data at Stanford Spring Quarter 2016 Piotr Indyk (MIT) and Nelson (Harvard) course on Sketching Algorithms for Big Data - Fall 2017 Nelson's course on Algorithms for Big Data at Harvard in Fall 2015. This includes videos as well. 17. Incremental Learning with Decision Trees for Steamed Data Mining High-Speed Data Streams (Hoeffding Trees) by Pedro Domingos and Geoff Hulten Mining Time Changing Data Streams by G. Hulten, L. Spencer, and P. Domingos. Comprehensive study on techniques of Incremental learning with decision trees for streamed data by Prerana Gupta, Amit Thakkar, Amit Ganatra Use of Hoeffding trees in concept based data stream mining by Hoeglinger, S. and Pears, R. 18, Clustering Data Seams Clustering Data Streams: Theory and Practice by Sudipto Guha, Adam Meyerson, Nina Mishra, Rajeev Motwani and Liadan O'Callaghan Online clustering of data streams by J. Beringer, E. Hillermeier Conquering the divide: Continuous clustering of distributed data streams (2007) by Graham Cormode Clustering on Streams by Suresh Venkat 19. Books
You might also like
Unit 4 Notes PDF
PDF
100% (2)
Unit 4 Notes PDF
27 pages
CS85: Data Stream Algorithms Lecture Notes, Fall 2009: Amit Chakrabarti Dartmouth College
PDF
No ratings yet
CS85: Data Stream Algorithms Lecture Notes, Fall 2009: Amit Chakrabarti Dartmouth College
61 pages
BigData_Mod2
PDF
No ratings yet
BigData_Mod2
12 pages
Data Streams: Models and Algorithms
PDF
No ratings yet
Data Streams: Models and Algorithms
372 pages
Unit 3
PDF
No ratings yet
Unit 3
30 pages
a.
PDF
No ratings yet
a.
3 pages
Mining Data Streams
PDF
No ratings yet
Mining Data Streams
17 pages
Mod4_DWDM_BTECH
PDF
No ratings yet
Mod4_DWDM_BTECH
9 pages
Streaming Algorithms: Ajinkya Potdar Hemanga Krishna Borah
PDF
No ratings yet
Streaming Algorithms: Ajinkya Potdar Hemanga Krishna Borah
47 pages
Mining Data Streams
PDF
No ratings yet
Mining Data Streams
37 pages
Swe2011 Bda - III
PDF
No ratings yet
Swe2011 Bda - III
50 pages
Stream Computing Methods
PDF
No ratings yet
Stream Computing Methods
35 pages
BDA Mod 3
PDF
No ratings yet
BDA Mod 3
57 pages
Big Data Analytics For R-2017 by ArunPrasath S., Sriram Kumar K., Krishna Sankar P.
PDF
No ratings yet
Big Data Analytics For R-2017 by ArunPrasath S., Sriram Kumar K., Krishna Sankar P.
7 pages
Bda Mid Ans
PDF
No ratings yet
Bda Mid Ans
18 pages
Big Data Analytics Unit-2
PDF
No ratings yet
Big Data Analytics Unit-2
11 pages
Methodologies for Stream Data Processing and Stream Data Systems
PDF
No ratings yet
Methodologies for Stream Data Processing and Stream Data Systems
20 pages
DWDM - Unit - VII
PDF
No ratings yet
DWDM - Unit - VII
42 pages
BDA GTU Study Material Presentations Unit-4 29092021094703AM
PDF
No ratings yet
BDA GTU Study Material Presentations Unit-4 29092021094703AM
33 pages
unit-3 notes
PDF
No ratings yet
unit-3 notes
10 pages
Unit 2 BD Mining Data Streams
PDF
No ratings yet
Unit 2 BD Mining Data Streams
34 pages
Bigdata-Mining Data Streams
PDF
No ratings yet
Bigdata-Mining Data Streams
19 pages
Big Data Analytics - Unit 2 Notes
PDF
No ratings yet
Big Data Analytics - Unit 2 Notes
44 pages
Unit2 Bda
PDF
No ratings yet
Unit2 Bda
293 pages
Uint 4miningdatastream 230810162429 9d7c02a7
PDF
No ratings yet
Uint 4miningdatastream 230810162429 9d7c02a7
11 pages
Big Data 3rd Unit
PDF
No ratings yet
Big Data 3rd Unit
16 pages
Crash Course On Data Stream Algorithms: Part I: Basic Definitions and Numerical Streams
PDF
No ratings yet
Crash Course On Data Stream Algorithms: Part I: Basic Definitions and Numerical Streams
76 pages
Bigdata Unit II
PDF
No ratings yet
Bigdata Unit II
19 pages
Bda Sem 7 Book
PDF
No ratings yet
Bda Sem 7 Book
188 pages
BDA
PDF
No ratings yet
BDA
6 pages
Stream
PDF
No ratings yet
Stream
30 pages
Data Stream Mg
PDF
No ratings yet
Data Stream Mg
528 pages
3. Unit 3 - BD - Streaming
PDF
No ratings yet
3. Unit 3 - BD - Streaming
42 pages
5.1 Mining Data Streams
PDF
No ratings yet
5.1 Mining Data Streams
16 pages
TRabl StreamProcessing
PDF
No ratings yet
TRabl StreamProcessing
79 pages
Module II
PDF
No ratings yet
Module II
22 pages
Data Analytics Assignment
PDF
No ratings yet
Data Analytics Assignment
20 pages
Mining Techniques for Streaming Data
PDF
No ratings yet
Mining Techniques for Streaming Data
14 pages
Mining Data Streams
PDF
No ratings yet
Mining Data Streams
33 pages
An Improved Data Stream Summary
PDF
No ratings yet
An Improved Data Stream Summary
18 pages
An Improved Data Stream Summary: The Count-Min Sketch and Its Applications
PDF
No ratings yet
An Improved Data Stream Summary: The Count-Min Sketch and Its Applications
18 pages
Big Data Unit III
PDF
No ratings yet
Big Data Unit III
20 pages
Dwdm Unit 5 Part One
PDF
No ratings yet
Dwdm Unit 5 Part One
29 pages
Kafka
PDF
No ratings yet
Kafka
78 pages
It6006 Data Analytics Syllabus
PDF
No ratings yet
It6006 Data Analytics Syllabus
1 page
Stream Processing and Analytics Handout
PDF
No ratings yet
Stream Processing and Analytics Handout
8 pages
Bigdata Unit-Ii
PDF
No ratings yet
Bigdata Unit-Ii
33 pages
Data Stream Clustering
PDF
No ratings yet
Data Stream Clustering
3 pages
Unit 3-6
PDF
No ratings yet
Unit 3-6
14 pages
Data Analytics Unit 3
PDF
No ratings yet
Data Analytics Unit 3
14 pages
Unit 2
PDF
No ratings yet
Unit 2
23 pages
stream-data
PDF
No ratings yet
stream-data
70 pages
Streaming Algorithms For Data in Motion
PDF
No ratings yet
Streaming Algorithms For Data in Motion
11 pages
Sketch Techniques For Approximate Query Processing
PDF
No ratings yet
Sketch Techniques For Approximate Query Processing
67 pages
Big Data IV Nit
PDF
No ratings yet
Big Data IV Nit
15 pages
Machine Learning For Data Streams With Practical Examples In Moa Adaptive Computation And Machine Learning Series Albert Bifet instant download
PDF
No ratings yet
Machine Learning For Data Streams With Practical Examples In Moa Adaptive Computation And Machine Learning Series Albert Bifet instant download
79 pages
DA
PDF
No ratings yet
DA
1 page