0% found this document useful (0 votes)

15 views22 pages

Lec14 Dhts

The document discusses distributed hash tables (DHTs) and consistent hashing algorithms for mapping keys to nodes in a distributed system. It covers how consistent hashing assigns keys to nodes to balance load and minimize data movement during joins or failures. The Chord DHT is then described, which maps keys and nodes to IDs on a ring, uses a ring finger table for efficient routing, and handles joins and failures. The performance of DHTs is compared to consistent hashing, with DHTs having smaller routing tables and load but higher routing costs. Finally, storage models in DHTs are briefly discussed.

Uploaded by

Ahmed Amamou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views22 pages

Lec14 Dhts

Uploaded by

Ahmed Amamou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 22

1

Distributed Hash Tables

Mike Freedman
COS 461: Computer Networks
Lectures: MW 10-10:50am in Architecture N101

http://www.cs.princeton.edu/courses/archive/spr13/cos461/
Scalable algorithms for discovery
• If many nodes are available to
origin server
cache, which one should file be
assigned to?

• If content is cached in some node,

how can we discover where it is
located, avoiding centralized
directory or all-to-all
communication?
CDN server
CDN server
CDN server

Akamai CDN: hashing to responsibility within cluster

Today: What if you don’t know complete set of nodes?
3

Partitioning Problem
• Consider problem of data partition:
– Given document X, choose one of k servers to use

• Suppose we use modulo hashing

– Number servers 1..k
– Place X on server i = (X mod k)
• Problem? Data may not be uniformly distributed

– Place X on server i = hash (X) mod k

• Problem? What happens if a server fails or joins (k  k±1)?
• Problem? What is different clients has different estimate of k?
• Answer: All entries get remapped to new nodes!
4

Consistent Hashing
insert(key 1,value)
lookup(key 1)

key1=value

key1 key2 key3

• Consistent hashing partitions key-space among nodes

• Contact appropriate node to lookup/store key
– Blue node determines red node is responsible for key1

– Blue node sends lookup or insert to red node

Consistent Hashing

0000 0010 0110 1010 1100 1110 1111

URL
00011 URL2
0100 URL3
1011

• Partitioning key-space among nodes

– Nodes choose random identifiers: e.g., hash(IP)
– Keys randomly distributed in ID-space: e.g., hash(URL)
– Keys assigned to node “nearest” in ID-space
– Spreads ownership of keys evenly across nodes
6

Consistent Hashing
0
• Construction 14
– Assign n hash buckets to random points
on mod 2k circle; hash key size = k 12 Bucket 4

– Map object to random position on circle

– Hash of object = closest clockwise bucket 8

– successor (key)  bucket

• Desired features
– Balanced: No bucket has disproportionate number of objects
– Smoothness: Addition/removal of bucket does not cause
movement among existing buckets (only immediate buckets)
7

Consistent hashing and failures

0
• Consider network of n nodes 14
• If each node has 1 bucket
12 Bucket 4
– Owns 1/n of keyspace in expectation
th

– Says nothing of request load per bucket

8
• If a node fails:
(A) Nobody owns keyspace (B) Keyspace assigned to random node
(C) Successor owns keyspaces (D) Predecessor owns keyspace

• After a node fails:

(A)Load is equally balanced over all nodes
(B)Some node has disproportional load compared to others
8

Consistent hashing and failures

0
• Consider network of n nodes 14
• If each node has 1 bucket
12 Bucket 4
– Owns 1/n of keyspace in expectation
th

– Says nothing of request load per bucket

• If a node fails: 8

– Its successor takes over bucket

– Achieves smoothness goal: Only localized shift, not O(n)
– But now successor owns 2 buckets: keyspace of size 2/n

• Instead, if each node maintains v random nodeIDs, not 1

– “Virtual” nodes spread over ID space, each of size 1 / vn
– Upon failure, v successors take over, each now stores (v+1) / vn
9

Consistent hashing vs. DHTs

Consistent Distributed
Hashing Hash Tables

Routing table size O(n) O(log n)

Lookup / Routing O(1) O(log n)

Join/leave: O(n) O(log n)

Routing updates

Join/leave: O(1) O(1)

Key Movement
10

Distributed Hash Table

0000 0010 0110 1010 1100 1110 1111

0001 0100 1011

• Nodes’ neighbors selected from particular distribution

- Visual keyspace as a tree in distance from a node
11

Distributed Hash Table

0000 0010 0110 1010 1100 1110 1111

• Nodes’ neighbors selected from particular distribution

- Visual keyspace as a tree in distance from a node
- At least one neighbor known per subtree of increasing
size /distance from node
12

Distributed Hash Table

0000 0010 0110 1010 1100 1110 1111

• Nodes’ neighbors selected from particular distribution

- Visual keyspace as a tree in distance from a node
- At least one neighbor known per subtree of increasing
size /distance from node
• Route greedily towards desired key via overlay hops
13

The Chord DHT

• Chord ring: ID space mod 2160
– nodeid = SHA1 (IP address, i)
for i=1..v virtual IDs
– keyid = SHA1 (name)

• Routing correctness:
– Each node knows successor and
predecessor on ring

• Routing efficiency:
– Each node knows O(log n) well-
distributed neighbors
14

Basic lookup in Chord

lookup (id):
if ( id > pred.id &&
id <= my.id )
return my.id;
else
Routing
return succ.lookup(id);

• Route hop by hop via successors

– O(n) hops to find destination id
15

Efficient lookup in Chord

lookup (id):
if ( id > pred.id &&
id <= my.id )
return my.id;
else
Routing
// fingers() by decreasing distance
for finger in fingers():
if id >= finger.id
return finger.lookup(id);
return succ.lookup(id);

• Route greedily via distant “finger” nodes

– O(log n) hops to find destination id
16

Building routing tables

Routing Tables
Routing

For i in 1...log n:
finger[i] = successor ( (my.id + 2i ) mod 2160 )
17

Joining and managing routing

• Join:
– Choose nodeid
– Lookup (my.id) to find place on ring
– During lookup, discover future successor
– Learn predecessor from successor
– Update succ and pred that you joined
– Find fingers by lookup ((my.id + 2i ) mod 2160 )

• Monitor:
– If doesn’t respond for some time, find new

• Leave: Just go, already!

– (Warn your neighbors if you feel like it)
Performance optimizations

0000 0010 0110 1010 1100 1110 1111

• Routing entries need not be drawn from strict

distribution as finger algorithm shown
– Choose node with lowest latency to you
– Will still get you ~ ½ closer to destination

• Less flexibility in choice as closer to destination

Consistent hashing vs. DHTs

Consistent Distributed Distributed
Hashing Hash Tables Hash Tables

Routing table size O(n) O(log n) O(sqrt(n))

Lookup / Routing O(1) O(log n) O( )

Join/leave: O(n) O(log n) O(sqrt(n))

Routing updates

Join/leave: O(1) O(1) O(1)

Key Movement

(A) sqrt (N) (B) log N (C) 1

DHT Design Goals

• An “overlay” network with:
– Flexible mapping of keys to physical nodes
– Small network diameter
– Small degree (fanout)
– Local routing decisions
– Robustness to churn
– Routing flexibility
– Decent locality (low “stretch”)

• Different “storage” mechanisms considered:

– Persistence w/ additional mechanisms for fault recovery
– Best effort caching and maintenance via soft state
21

Storage models
• Store only on key’s immediate successor
– Churn, routing issues, packet loss make lookup
failure more likely
• Store on k successors
– When nodes detect succ/pred fail, re-replicate
– Use erasure coding: can recover with j-out-of-k
“chunks” of file, each chunk smaller than full replica
• Cache along reverse lookup path
– Provided data is immutable
– …and performing recursive responses
22

Summary
• Peer-to-peer systems
– Unstructured systems (next Monday)
• Finding hay, performing keyword search
– Structured systems (DHTs)
• Finding needles, exact match
• Distributed hash tables
– Based around consistent hashing with views of O(log n)
– Chord, Pastry, CAN, Koorde, Kademlia, Tapestry, Viceroy, …
• Lots of systems issues
– Heterogeneity, storage models, locality, churn management,
underlay issues, …
– DHTs deployed in wild: Vuze (Kademlia) has 1M+ active users

DK Stalk
No ratings yet
DK Stalk
105 pages
15-440 Distributed Systems: Hashing and Cdns
No ratings yet
15-440 Distributed Systems: Hashing and Cdns
38 pages
28 Consistent Hashing
No ratings yet
28 Consistent Hashing
6 pages
Dhts and Their Application To The Design of Peer-To-Peer Systems
No ratings yet
Dhts and Their Application To The Design of Peer-To-Peer Systems
70 pages
Chord: A Scalable Peer-To-Peer Lookup Service For Internet Applications
No ratings yet
Chord: A Scalable Peer-To-Peer Lookup Service For Internet Applications
32 pages
Chord
No ratings yet
Chord
47 pages
CH 5
No ratings yet
CH 5
40 pages
VI - Adaptive Overlays
No ratings yet
VI - Adaptive Overlays
46 pages
Freenet
No ratings yet
Freenet
27 pages
Naming, Identifiers and Addresses
No ratings yet
Naming, Identifiers and Addresses
65 pages
DHTLookup
No ratings yet
DHTLookup
56 pages
21 p2p
No ratings yet
21 p2p
64 pages
Institute of Science & Technology, Gauhati University: Freenet
No ratings yet
Institute of Science & Technology, Gauhati University: Freenet
28 pages
05 DHT
No ratings yet
05 DHT
30 pages
Chapter 5
No ratings yet
Chapter 5
45 pages
Distributed Hash Tables: David Tam Patrick Pang
No ratings yet
Distributed Hash Tables: David Tam Patrick Pang
26 pages
Peer-to-Peer (P2P) Systems: DHT, Chord, Pastry
No ratings yet
Peer-to-Peer (P2P) Systems: DHT, Chord, Pastry
122 pages
21 p2p DHT
No ratings yet
21 p2p DHT
63 pages
5.1 Distributed Hash Table
No ratings yet
5.1 Distributed Hash Table
49 pages
A Survey and Comparison of Peer-to-Pee R Overlay Network Schemes
No ratings yet
A Survey and Comparison of Peer-to-Pee R Overlay Network Schemes
26 pages
Chord Hotos
No ratings yet
Chord Hotos
21 pages
p2p and DHT in IT
No ratings yet
p2p and DHT in IT
27 pages
Data Partitioning
No ratings yet
Data Partitioning
39 pages
Distributed Hash Table (DHT)
No ratings yet
Distributed Hash Table (DHT)
12 pages
Introducao A DHTs
No ratings yet
Introducao A DHTs
39 pages
The Effects of Churn On Complex Search Techniques: Jamie Furness, Mario Kolberg
No ratings yet
The Effects of Churn On Complex Search Techniques: Jamie Furness, Mario Kolberg
21 pages
Unit-III Peer To Peer Middleware Routing Overlay
No ratings yet
Unit-III Peer To Peer Middleware Routing Overlay
39 pages
Highly Available DHTS: Keeping Data Consistency After Updates
No ratings yet
Highly Available DHTS: Keeping Data Consistency After Updates
11 pages
UNIT - V (Distributed System) Class1
No ratings yet
UNIT - V (Distributed System) Class1
20 pages
Consistent Hashihhhhng Explained
No ratings yet
Consistent Hashihhhhng Explained
28 pages
Distributed Hash Tables
No ratings yet
Distributed Hash Tables
20 pages
Distributed Systems R19 - Unit-5
No ratings yet
Distributed Systems R19 - Unit-5
32 pages
Chord: A Scalable Peer-To-Peer Lookup Service For Internet Applications
No ratings yet
Chord: A Scalable Peer-To-Peer Lookup Service For Internet Applications
12 pages
An Introduction To Peer-to-Peer Networks: Presentation For CSE620:Advanced Networking Anh Le Nov. 4
No ratings yet
An Introduction To Peer-to-Peer Networks: Presentation For CSE620:Advanced Networking Anh Le Nov. 4
128 pages
Usenix CR PDF
No ratings yet
Usenix CR PDF
14 pages
Prefix Hash Tree: An Indexing Data Structure Over Distributed Hash Tables
No ratings yet
Prefix Hash Tree: An Indexing Data Structure Over Distributed Hash Tables
10 pages
Self Chord-Achieving Load Balancing in Peer To Peer Network: M.Divya B.Saranya
No ratings yet
Self Chord-Achieving Load Balancing in Peer To Peer Network: M.Divya B.Saranya
5 pages
Chord
No ratings yet
Chord
15 pages
Routing Protocols of Distributed Hash Table Based Peer To Peer Networks
No ratings yet
Routing Protocols of Distributed Hash Table Based Peer To Peer Networks
5 pages
7 Distributed Hash Tables
No ratings yet
7 Distributed Hash Tables
16 pages
Structured P2P Networks by Example Chord, DKS (N, K, F) : Jun Qin
No ratings yet
Structured P2P Networks by Example Chord, DKS (N, K, F) : Jun Qin
11 pages
Distributed Hash Table
No ratings yet
Distributed Hash Table
9 pages
Rohini 57218532986
No ratings yet
Rohini 57218532986
8 pages
Free Split: A Write-Ahead Protocol To Improve Latency in Distributed Prefix Tree Indexing Structures
No ratings yet
Free Split: A Write-Ahead Protocol To Improve Latency in Distributed Prefix Tree Indexing Structures
8 pages
Peer-To-Peer Distributed Lookup Services: Distributed Hash Table (DHT) Will
No ratings yet
Peer-To-Peer Distributed Lookup Services: Distributed Hash Table (DHT) Will
8 pages
Consistent Hashing
No ratings yet
Consistent Hashing
19 pages
P07 SkipGraphs AspnesShah 2003
No ratings yet
P07 SkipGraphs AspnesShah 2003
10 pages
Routing Algorithms For DHTS: Some Open Questions
No ratings yet
Routing Algorithms For DHTS: Some Open Questions
5 pages
Chapter 08 بالمحذوف
No ratings yet
Chapter 08 بالمحذوف
29 pages
Hashing in Networked Systems: Mike Freedman
No ratings yet
Hashing in Networked Systems: Mike Freedman
24 pages
Vtu 7TH Sem Cse/ise Data Warehousing & Data Mining Notes 10cs755/10is74
94% (18)
Vtu 7TH Sem Cse/ise Data Warehousing & Data Mining Notes 10cs755/10is74
70 pages
Workshop Report For IPTPS'02 1st International Workshop On Peer-to-Peer Systems
No ratings yet
Workshop Report For IPTPS'02 1st International Workshop On Peer-to-Peer Systems
21 pages
DHT 01
No ratings yet
DHT 01
2 pages
Consistent Hashing - Explanation and Implementation
No ratings yet
Consistent Hashing - Explanation and Implementation
7 pages
Summary On Consistent Hashing and Random Trees: Distributed Caching Protocols For Relieving Hot Spots On The World Wide Web
No ratings yet
Summary On Consistent Hashing and Random Trees: Distributed Caching Protocols For Relieving Hot Spots On The World Wide Web
1 page
Sloppy Hashing and Self-Organizing Clusters: Michael J. Freedman and David Mazi'eres NYU Dept of Computer Science
No ratings yet
Sloppy Hashing and Self-Organizing Clusters: Michael J. Freedman and David Mazi'eres NYU Dept of Computer Science
6 pages
Distributed Search Engine
No ratings yet
Distributed Search Engine
5 pages
50 - Questions Love Babber
No ratings yet
50 - Questions Love Babber
3 pages
Consistent Hashing
No ratings yet
Consistent Hashing
2 pages
Dxhash: A Scalable Consistent Hashing Based On The Pseudo-Random Sequence
No ratings yet
Dxhash: A Scalable Consistent Hashing Based On The Pseudo-Random Sequence
12 pages
Veeam Certified Engineer 2021 VMCE2021 Exam Dumps
No ratings yet
Veeam Certified Engineer 2021 VMCE2021 Exam Dumps
11 pages
Btree Data Structure
No ratings yet
Btree Data Structure
25 pages
Unit 3 MCQ
No ratings yet
Unit 3 MCQ
20 pages
Mysql 2. Oracle 3. Microsoft SQL Server
No ratings yet
Mysql 2. Oracle 3. Microsoft SQL Server
11 pages
291 Exam 1
100% (1)
291 Exam 1
11 pages
إدارة أنظمة المعلومات
No ratings yet
إدارة أنظمة المعلومات
31 pages
Intelligent Storage For Data Protection Deep Dive
No ratings yet
Intelligent Storage For Data Protection Deep Dive
21 pages
Agriculture 13 02141 v2
No ratings yet
Agriculture 13 02141 v2
23 pages
Fast Data Smart and at Scale
No ratings yet
Fast Data Smart and at Scale
51 pages
Unit 4: Defining The CDS Data Model Projection: Week 2: Developing A Read-Only List Report App
No ratings yet
Unit 4: Defining The CDS Data Model Projection: Week 2: Developing A Read-Only List Report App
9 pages
Pbl2 Merged
No ratings yet
Pbl2 Merged
26 pages
SAP BI Question and Answers
No ratings yet
SAP BI Question and Answers
14 pages
SMAPI Latest
No ratings yet
SMAPI Latest
57 pages
Edited DrJSKPaper2
No ratings yet
Edited DrJSKPaper2
9 pages
Informatica Cloud For NYF
No ratings yet
Informatica Cloud For NYF
19 pages
DWDM QB
No ratings yet
DWDM QB
6 pages
How To Install Kubernetes Cluster On Ubuntu 24.04 LTS (Step-by-Step Guide)
No ratings yet
How To Install Kubernetes Cluster On Ubuntu 24.04 LTS (Step-by-Step Guide)
4 pages
Data Warehousing With SQL Server 2012 (Core)
No ratings yet
Data Warehousing With SQL Server 2012 (Core)
8 pages
Znote Analyzer 1931427
No ratings yet
Znote Analyzer 1931427
12 pages
Applsci 13 10063
No ratings yet
Applsci 13 10063
21 pages
For Resubmission of Esf7 March13
No ratings yet
For Resubmission of Esf7 March13
7 pages
Yield Observation Outlier Detection With Unsupervised Machine Learning in Harvest Machines
No ratings yet
Yield Observation Outlier Detection With Unsupervised Machine Learning in Harvest Machines
7 pages
1 s2.0 S187705092100377X Main
No ratings yet
1 s2.0 S187705092100377X Main
8 pages
1 s2.0 S1570826810000612 Main
No ratings yet
1 s2.0 S1570826810000612 Main
9 pages
Apache KAFKA - Training Contents - 5 Days
No ratings yet
Apache KAFKA - Training Contents - 5 Days
2 pages
Chapter 3 Data Modeling Using The Entity-Relationship (ER) Model-15!05!2021
No ratings yet
Chapter 3 Data Modeling Using The Entity-Relationship (ER) Model-15!05!2021
91 pages
21 PDFsam Redis Cookbook
No ratings yet
21 PDFsam Redis Cookbook
5 pages
Install Mydumper
No ratings yet
Install Mydumper
5 pages
XII IP Assign PA2 (2024-25)
No ratings yet
XII IP Assign PA2 (2024-25)
2 pages
Operations Dashboard 7.2: ST-OST 200 SP06
No ratings yet
Operations Dashboard 7.2: ST-OST 200 SP06
46 pages
Database Assignment 2
No ratings yet
Database Assignment 2
5 pages
Guia Ims Kamailio Loadmultiplier
No ratings yet
Guia Ims Kamailio Loadmultiplier
14 pages
Characteristics and Challenges of Big Data: Piyush Bhardwaj, Dr. Suruchi Gautam, Dr. Payal Pahwa, Neha Singh
No ratings yet
Characteristics and Challenges of Big Data: Piyush Bhardwaj, Dr. Suruchi Gautam, Dr. Payal Pahwa, Neha Singh
4 pages
07 SV TECH Harish Teradata DBA
No ratings yet
07 SV TECH Harish Teradata DBA
6 pages
About Cognos Security CAMID
No ratings yet
About Cognos Security CAMID
1 page
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
CCNA Interview Questions You'll Most Likely Be Asked
From Everand
CCNA Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Lec14 Dhts

Uploaded by

Lec14 Dhts

Uploaded by

1

Distributed Hash Tables

• If content is cached in some node,

Akamai CDN: hashing to responsibility within cluster

• Suppose we use modulo hashing

– Place X on server i = hash (X) mod k

key1 key2 key3

• Consistent hashing partitions key-space among nodes

– Blue node sends lookup or insert to red node

0000 0010 0110 1010 1100 1110 1111

• Partitioning key-space among nodes

– Map object to random position on circle

– successor (key)  bucket

Consistent hashing and failures

– Says nothing of request load per bucket

• After a node fails:

Consistent hashing and failures

– Says nothing of request load per bucket

– Its successor takes over bucket

• Instead, if each node maintains v random nodeIDs, not 1

Consistent hashing vs. DHTs

Routing table size O(n) O(log n)

Lookup / Routing O(1) O(log n)

Join/leave: O(n) O(log n)

Join/leave: O(1) O(1)

Distributed Hash Table

0000 0010 0110 1010 1100 1110 1111

• Nodes’ neighbors selected from particular distribution

Distributed Hash Table

0000 0010 0110 1010 1100 1110 1111

• Nodes’ neighbors selected from particular distribution

Distributed Hash Table

0000 0010 0110 1010 1100 1110 1111

• Nodes’ neighbors selected from particular distribution

The Chord DHT

Basic lookup in Chord

• Route hop by hop via successors

Efficient lookup in Chord

• Route greedily via distant “finger” nodes

Building routing tables

Joining and managing routing

• Leave: Just go, already!

0000 0010 0110 1010 1100 1110 1111

• Routing entries need not be drawn from strict

• Less flexibility in choice as closer to destination

Consistent hashing vs. DHTs

Routing table size O(n) O(log n) O(sqrt(n))

Lookup / Routing O(1) O(log n) O( )

Join/leave: O(n) O(log n) O(sqrt(n))

Join/leave: O(1) O(1) O(1)

(A) sqrt (N) (B) log N (C) 1

DHT Design Goals

• Different “storage” mechanisms considered:

You might also like