0% found this document useful (0 votes)

7 views10 pages

The Performance of Distributed Data-Structures Running on a Cache-Coherent in-Memory Data Grid

Uploaded by

17280164070

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views10 pages

The Performance of Distributed Data-Structures Running on a Cache-Coherent in-Memory Data Grid

Uploaded by

17280164070

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

http://highscalability.com/blog/2012/8/20/the-performance-of-distributed-data-structures-running-on-a.

html

The Performance of Distributed Data-Structures

Running on a "Cache-Coherent" In-Memory
Data Grid
Monday, August 20, 2012 at 9:15AM

Todd Hoff in Product, datagrid, imdg

This is a guest post by Ron Pressler,

the founder and CEO of Parallel
Universe, a Y Combinator company
building advanced middleware for
real-time applications.

A little over a month ago, we open-sourced a new in-memory data grid

called Galaxy. An in-memory data grid, or IMDG, is a clustered data
storage and processing middleware that uses RAM as the authoritative
and primary storage, and distributes data over a cluster for purposes of
data and processing scalability and high-availability. A common feature
of IMDGs is co-location of code and data, meaning that application code
runs on all cluster nodes, each instance processing those data items
residing in the local node's RAM.

While quite a few commercial and open-source IMDGs are available (like
Terracotta, Gigaspaces, Oracle Coherence, GemFire, Websphere eXtreme
Scale, Infinispan and Hazelcast), Galaxy has adopted a completely
different architecture from all other IMDGs, to service some usage
scenarios ill-fitted to the other solutions.

All other IMDGs, as well as most distributed NoSQL databases (like Riak
and Cassandra) employ what is known as distributed hash-tables (DHTs)

http://weibo.com/developerworks 2012-11-11 整理第 1／10页

http://highscalability.com/blog/2012/8/20/the-performance-of-distributed-data-structures-running-on-a.html

to partition and locate data items in the cluster. DHTs assign a data item to
one or more cluster nodes based on a static hash value computed for each
item's key (those systems provide access to items by keys). This means
that an item's owning cluster-node(s) can be easily located, and require
just one network roundtrip per access in the worst case (for a read or a
write). However, that one network roundtrip is also required in the
common case.

Galaxy, on the other hand, dynamically migrates items among cluster

nodes making a different tradeoff: accessing an item might take more than
one network roundtrip in the worst-case scenario, but the common case
requires no hops at all (note that this is not the same as write-through
caches).

This tradeoff is suitable when the data and its access patterns induce a
metric over the data-space, where nearby data-points are more likely to be
accessed together (in the same query/transaction) than faraway points, and
data items dynamically move in that metric space. Two examples are
spatial data (the original motivation for Galaxy) and graphs. When an
application accesses one graph-node, it is likely to access its neighbors in
the same query, and unlikely to access distant nodes.

Galaxy borrows its design from that of CPU L1 cores, and this blog post
explains the rational behind the design, so I won't expand further on this
subject here.

Galaxy is meant to serve as a foundation for building distributed data-

structures, and an analysis of one such structure is the main topic of this
post.

Shared-state vs. Message passing

One issue that often comes up when discussing distributed systems is the
http://weibo.com/developerworks 2012-11-11 整理第 2／10页
http://highscalability.com/blog/2012/8/20/the-performance-of-distributed-data-structures-running-on-a.html

question of message-passing architecture versus shared-state.

It is important to realize that all shared-state systems (even the shared

RAM on your computer) are an abstraction implemented under the hood
with message-passing. It is an abstraction that gives developers a very
useful illusion.

When the shared state abstraction is used, performance issues normally

arise only in the presence of write contention — when two independent
processes attempt to write the same address/item concurrently. It is
contention that requires the underlying implementation to pass messages,
and it is this communication that derails performance, because it is usually
done over a relatively slow channel.

So it is not shared state that performs slower than message-passing. It is

contention that causes communication — which is slow. It can be
claimed, though, that a message-passing architecture makes
communication explicit and thus calls attention to it, while shared-state
hides it behind an abstraction, making developers less aware of pitfalls.
Nevertheless, for many applications (or software components) shared state
is a natural and very convenient model — particularly for data stores.

A short overview of Galaxy's

implementation and its implications
As the data-structure we're about to discuss is the B+-tree, in order to
avoid confusion, we shall henceforth call the Galaxy cluster nodes
machines, while the tree nodes will be called, simply, nodes.

But before beginning our analysis we need to describe, very concisely,

how Galaxy is implemented; i.e. how Galaxy uses a message-passing
foundation — a computer network — to provide the illusion of one large,
http://weibo.com/developerworks 2012-11-11 整理第 3／10页
http://highscalability.com/blog/2012/8/20/the-performance-of-distributed-data-structures-running-on-a.html

shared memory space.

A full and detailed description is given in this blog post series, but here
I'll provide a very short summary.

Every data-item in Galaxy (accessed not through a key, but using an ID

assigned by the framework) is owned by exactly one machine at any given
time, and may be shared by many. The item resides in the local RAM of
both owner and sharers in an object called a cache-line (the name has been
adopted from that used by L1 caches, the inspiration Galaxy's
architecture). Item reads, therefore — both by the owner and the sharers
— are serviced by RAM and require no network hops.

If a machine wishes to write an item, it must first request ownership over

the item — if it not already the owner (requiring 1 network roundtrip) —
and then invalidate the item's cache-line in all of its sharers (1 roundtrip *
#sharers) to become an exclusive owner, and then the write can proceed in
RAM. Quite often, the former sharers will again request to share the item
for future reads, requiring another roundtrip per sharer. In total, the worst-
case update of an item may require 1 + 2 * #sharers per update (Galaxy
employs some heuristics to reduce write latency — fully discussed in the
linked post series — but they are not relevant to our current discussion).

This may sound bad, but bear in mind that in most well-implemented
distributed data structures, most items are not shared at all — as we shall
see soon — and in the common case an update requires 0 network hops.

We will now analyze the performance of such a data-structure.

Amortized analysis of inserts to a

distributed B+-tree

http://weibo.com/developerworks 2012-11-11 整理第 4／10页

http://highscalability.com/blog/2012/8/20/the-performance-of-distributed-data-structures-running-on-a.html

The data structure I have chosen for this discussion is the ubiquitous B+-
tree. The B+ tree performs all search, insert and delete operations in
logarithmic time, and has the following very desired property for our
purposes: modifications (i.e. inserts and deletes) are normally done solely
at the leaves, and only occasionally require modifications to inner nodes,
with modifications being rarer the higher the node is in the tree. We will
analyze amortized cost (the worst-case cost of a sequence of operations,
each having a different individual cost) of the insert operation. But unlike
traditional algorithm analysis, we will not count computer instructions
required, but rather the number of network roundtrips, as those are the
weakest link in the performance of any distributed system.

We will assume that the tree is implemented using Galaxy such that each
node is one Galaxy data-item, and that each node has a capacity of b>2
children (b is the tree's fanout). We will denote the number of machines in
the cluster as M, and the number of elements stored in the tree (the size of
the data set) as n.

Let us also assume that some algorithm has distributed the nodes among
machines such that below a certain level L of the tree, all nodes are
exclusively owned by one machine — i.e. they have no sharers — and
that the algorithm directs any operation to the machine best suited to
execute it. so the vast majority of nodes are below level L.

The drawing shows how the nodes are distributed: each color represents a
machine, and the nodes' colors denote which machines share the node. If a
node has only one color, it is exclusively owned by one machine, and
updating it entails no network roundtrips.

http://weibo.com/developerworks 2012-11-11 整理第 5／10页

http://highscalability.com/blog/2012/8/20/the-performance-of-distributed-data-structures-running-on-a.html

In addition to all the nodes below level L, each machines shares all of its
owned nodes' parents, all the way to the root. This means that the tree's
root node is shared by all M machines, the nodes at level 2 are each shared
by machines, at the level below that by , and so on. Because of
this, updating the root requires M network roundtrips, updating a node at
the level below the root requires roundtrips etc.

Now, remember that the B+-tree insert operation first inserts the element
into one of the leaves. If it overflows, the leaf splits into two — each new
leaf containing half of the old leaf's element, and then inserts the new leaf
as a child into the parent node. When the parent node overflows, it, too,
splits, and the new node is inserted into its parent and so on.

We will begin the analysis when all nodes — leaves and inner nodes —
contain children each (for simplicity's sake we assume b is even) — the
http://weibo.com/developerworks 2012-11-11 整理第 6／10页
http://highscalability.com/blog/2012/8/20/the-performance-of-distributed-data-structures-running-on-a.html

minimum allowed by the B+-tree — and the tree is of height h (so the
data-structure contains exactly elements, because the B+-tree
stores all data elements in the leaves).

Let's now count the number of network roundtrips required, in the worst-
case, by a sequence of consecutive insert operations, with

In the worst case, all inserts will hit the same leaf over and over. After the
first inserts, the leaf will split. After inserts, its parent node will

split as well, and after . Eventually nodes at level L and above will
split, and that will entail roundtrips. Occasionally, an insert will trigger a
cascade of updates going all the way to the root, requiring all necessary
roundtrips at all levels above L. So the total number of roundtrips for
inserts is:

(The last term inside the parentheses on the third line was added to
compensate for getting rid of the ceilings in the other terms).

To find the amortized cost for one insert operation, we divide by the

http://weibo.com/developerworks 2012-11-11 整理第 7／10页

http://highscalability.com/blog/2012/8/20/the-performance-of-distributed-data-structures-running-on-a.html

number of operations, , and get:

Now, if you look at the definition of L and at the drawing, you'll realize
that , so, finally, we get:

Let's examine this result. First, if M stays constant, the amortized cost per-
operation decreases with n. This actually makes sense. If M is constant
and n increases, then the number of nodes stored on each machine
increases as well, and more nodes are at levels below L. There's less
necessity for inter-machine communication; more of the work is done
local to each node. In real life, though, this doesn't happen, as the number
of tree nodes that can be stored on each machine is bounded, so when the
data-set grows very large, we must increase the number of machines.

If, on the other hand, we increase the number of machines while keeping n
constant, the amortized cost rises. This too makes sense. If we distribute
the same amount of data to more machines, more of the nodes will be
closer to level L, and more inter-machine communication will be needed,
since each machine is now responsible for a smaller subset of the data.

But I think that in real life scenarios, the number of machines is more-or-
less linearly related to the size of the data set. This does not mean that we
cram as much data to each machine as its RAM can store — we also want
each machine to have enough CPU resources to process the information it
holds. How much processing is required is application dependent, but for
http://weibo.com/developerworks 2012-11-11 整理第 8／10页
http://highscalability.com/blog/2012/8/20/the-performance-of-distributed-data-structures-running-on-a.html

any given application the ratio of processing power per data items stored
may either change very gradually or very rarely (say when new users join,
and even then this tends to be accompanied by more data) so that it can be
treated as constant. If we decide, then, that we want each machine to store
and process up to C objects — or tree nodes in our case — M becomes
related to n ( ), and now our cost becomes:

True, this is not the whole story. Once k exceeds h, the height of the tree
will grow, requiring some re-balancing, possibly moving some tree nodes
from one machine to another. I claim (without rigorous proof), that the
number of nodes transferred will be , and this event will only
happen once every inserts, so the two will cancel out.

Conclusion
Though we started out by saying that Galaxy sacrifices worst-case
performance for common-case performance, when using a data-structure
that fits well with Galaxy's distribution model, and assuming the number
of machines is related to the size of the data set, not only have we got O
(1) (amortized) worst-case performance in terms of network roundtrips,
but the actual constant is much less than 1 (because C>>2b), which is
what we'd get when using a distributed hash-table. This is perfect
scalability, and it is a direct result of the properties of the B+-tree (and
other similar tree data structures), and is not true for all data structures.

Also, we have computed the amortized cost of the insert operation, but, as
we've seen, within a sequence of such operations, while most inserts entail
no network hops at all, a small portion result in quite a few roundtrips,
like the rare insert that causes node splits to propagate all the way to the
http://weibo.com/developerworks 2012-11-11 整理第 9／10页
http://highscalability.com/blog/2012/8/20/the-performance-of-distributed-data-structures-running-on-a.html

root. The O(1) amortized behavior means that throughput is unaffected by

the number of machines (accepting our assumption that machines are
added to accommodate processing more data), but the latency of a single,
rare, operation might be. Nevertheless, such events are so rare (and
Galaxy's heuristics can reduce the write latency in many cases), that even
a very latency-aware application — perhaps all but hard real-time systems
— are better off using Galaxy's design rather than a DHT tree, provided
that their data, and data operations, are well served by a data-structure that
fits this design well.

Related Articles
On Hacker News
YC’s Parallel Universe Developing Spatial Databases For
Matrix-Style Games
Paralell Universe Blog
Parallel Universe open sources a novel in-memory data grid
On Hacker News

Article originally appeared on High Scalability (http://highscalability.com/).

See website for complete article licensing information.

http://weibo.com/developerworks 2012-11-11 整理第 10／10页

CLOUD COMPUTING_1
No ratings yet
CLOUD COMPUTING_1
41 pages
Patterson6e_MIPS_Ch05_Modified_Part2 (3)
No ratings yet
Patterson6e_MIPS_Ch05_Modified_Part2 (3)
121 pages
Search Trees
No ratings yet
Search Trees
55 pages
Com 126 Pc Upgrade Maintenance 082757
No ratings yet
Com 126 Pc Upgrade Maintenance 082757
49 pages
How to Use Cursor Agent for Beginners
No ratings yet
How to Use Cursor Agent for Beginners
7 pages
facebook架构设计中文版
No ratings yet
facebook架构设计中文版
39 pages
Computer Maintenance & Support
No ratings yet
Computer Maintenance & Support
112 pages
Module 2 Trees
No ratings yet
Module 2 Trees
167 pages
CA Lecture 4
No ratings yet
CA Lecture 4
11 pages
slides.06
No ratings yet
slides.06
69 pages
Build a Perplexity Clone in 8min With AI.txt
No ratings yet
Build a Perplexity Clone in 8min With AI.txt
3 pages
Peer-to-Peer (P2P) Systems: DHT, Chord, Pastry
No ratings yet
Peer-to-Peer (P2P) Systems: DHT, Chord, Pastry
122 pages
Chapter 6
No ratings yet
Chapter 6
69 pages
telco-cloud-platform-5g-edition-data-plane-performance-tuning-guide
No ratings yet
telco-cloud-platform-5g-edition-data-plane-performance-tuning-guide
36 pages
William Stallings Computer Organization and Architecture 10 Edition
0% (1)
William Stallings Computer Organization and Architecture 10 Edition
52 pages
Ethane__An Asymmetric File System for Disaggregated Persistent Memory
No ratings yet
Ethane__An Asymmetric File System for Disaggregated Persistent Memory
18 pages
BCA 2nd Sem Assignment 2022-23
No ratings yet
BCA 2nd Sem Assignment 2022-23
18 pages
Did the Microsoft Stack Kill MySpace
No ratings yet
Did the Microsoft Stack Kill MySpace
14 pages
Lecture 5 Network Topologies for Parallel Architectures - Updated
No ratings yet
Lecture 5 Network Topologies for Parallel Architectures - Updated
46 pages
Dsa Q bank sem 4
No ratings yet
Dsa Q bank sem 4
14 pages
Algorithm Design Unit 2
No ratings yet
Algorithm Design Unit 2
35 pages
Chong - DTCOSTCO in The Era of Vertical Integration
No ratings yet
Chong - DTCOSTCO in The Era of Vertical Integration
49 pages
TripAdvisor Architecture - 40M Visitors 200M Dynamic Page Views 30TB Data
No ratings yet
TripAdvisor Architecture - 40M Visitors 200M Dynamic Page Views 30TB Data
11 pages
Amazon Architecture
No ratings yet
Amazon Architecture
11 pages
Why My Slime Mold is Better Than Your Hadoop Cluster
No ratings yet
Why My Slime Mold is Better Than Your Hadoop Cluster
8 pages
Russ’ 10 Ingredient Recipe for Making 1 Million TPS on $5K Hardware
No ratings yet
Russ’ 10 Ingredient Recipe for Making 1 Million TPS on $5K Hardware
7 pages
Sharding the Hibernate Way
No ratings yet
Sharding the Hibernate Way
7 pages
Using Gossip Protocols for Failure Detection Monitoring Messaging and Other Good Things
No ratings yet
Using Gossip Protocols for Failure Detection Monitoring Messaging and Other Good Things
7 pages
Data Structure BST
No ratings yet
Data Structure BST
37 pages
Flickr Architecture
No ratings yet
Flickr Architecture
9 pages
Lecture 4 Network Topologies For Parallel Architecture
No ratings yet
Lecture 4 Network Topologies For Parallel Architecture
34 pages
YouTube Architecture
No ratings yet
YouTube Architecture
8 pages
How to Build an AI Customer Service Bot
No ratings yet
How to Build an AI Customer Service Bot
6 pages
Is It Time to Get Rid of the Linux OS Model in the Cloud
No ratings yet
Is It Time to Get Rid of the Linux OS Model in the Cloud
6 pages
Misco - A MapReduce Framework for Mobile Systems - Start of the Ambient Cloud
No ratings yet
Misco - A MapReduce Framework for Mobile Systems - Start of the Ambient Cloud
7 pages
Practice - MCQS-ST3
No ratings yet
Practice - MCQS-ST3
32 pages
ESA_2023 (1)
No ratings yet
ESA_2023 (1)
5 pages
P07 SkipGraphs AspnesShah 2003
No ratings yet
P07 SkipGraphs AspnesShah 2003
10 pages
Heroku Emergency Strategy - Incident Command System and 8 Hour Ops Rotations for Fresh Minds
No ratings yet
Heroku Emergency Strategy - Incident Command System and 8 Hour Ops Rotations for Fresh Minds
4 pages
At Some Point the Cost of Servers Outweighs the Cost of Programmers
No ratings yet
At Some Point the Cost of Servers Outweighs the Cost of Programmers
4 pages
Build a Basic Airbnb App With Cursor AI Tricks.txt
No ratings yet
Build a Basic Airbnb App With Cursor AI Tricks.txt
5 pages
The Anatomy of Search Technology - Blekko's NoSQL Database
No ratings yet
The Anatomy of Search Technology - Blekko's NoSQL Database
5 pages
Facebook at 13 Million Queries Per Second Recommends - Minimize Request Variance
No ratings yet
Facebook at 13 Million Queries Per Second Recommends - Minimize Request Variance
3 pages
Strategy - Diagonal Scaling - Don't Forget to Scale Out and Up
No ratings yet
Strategy - Diagonal Scaling - Don't Forget to Scale Out and Up
3 pages
Llama2 Extracted
No ratings yet
Llama2 Extracted
4 pages
Unit 5 Trees
No ratings yet
Unit 5 Trees
4 pages
ARM926EJ S Overview PDF
No ratings yet
ARM926EJ S Overview PDF
12 pages
OS Assignment 1: Introduction: Student Name
No ratings yet
OS Assignment 1: Introduction: Student Name
2 pages
Cursor + V0 Can We Build an AI Next.js App in 8 Minutes
No ratings yet
Cursor + V0 Can We Build an AI Next.js App in 8 Minutes
2 pages
I Made an IOS App With Cursor and It's Super Fun
No ratings yet
I Made an IOS App With Cursor and It's Super Fun
3 pages
Cloud Programming Directly Feeds Cost Allocation Back Into Software Design
No ratings yet
Cloud Programming Directly Feeds Cost Allocation Back Into Software Design
3 pages
10 Golden Principles for Building Successful Mobile-Web Applications
No ratings yet
10 Golden Principles for Building Successful Mobile-Web Applications
3 pages
Thẻ Ghi Nhớ CEA 201 Full Quizlet
No ratings yet
Thẻ Ghi Nhớ CEA 201 Full Quizlet
144 pages
Mini Test ICT
No ratings yet
Mini Test ICT
2 pages
Problem Bank 06: Assignment I
No ratings yet
Problem Bank 06: Assignment I
10 pages
Data Structure (DS) Solved MCQs
No ratings yet
Data Structure (DS) Solved MCQs
6 pages
Netflix - Continually Test by Failing Servers With Chaos Monkey
No ratings yet
Netflix - Continually Test by Failing Servers With Chaos Monkey
2 pages
Unit 1 - Basic Computer Engineering
No ratings yet
Unit 1 - Basic Computer Engineering
27 pages
yzelman07b-19
No ratings yet
yzelman07b-19
1 page
Report Fujitsu Lifebook E8210
No ratings yet
Report Fujitsu Lifebook E8210
102 pages
Brushwood: Distributed Trees in Peer-to-Peer Systems: Chi Zhang Arvind Krishnamurthy Randolph Y. Wang
No ratings yet
Brushwood: Distributed Trees in Peer-to-Peer Systems: Chi Zhang Arvind Krishnamurthy Randolph Y. Wang
6 pages
Tech Questions in Interview
No ratings yet
Tech Questions in Interview
176 pages
Dsa Quize
No ratings yet
Dsa Quize
4 pages
How To Choose Right Data Structure
No ratings yet
How To Choose Right Data Structure
5 pages
UNIT I - Question Bank
No ratings yet
UNIT I - Question Bank
24 pages
HWiNFO64 v4
No ratings yet
HWiNFO64 v4
70 pages
Lecture16 PDF
No ratings yet
Lecture16 PDF
4 pages
SNUG Home Gateway Architecture Case Study
No ratings yet
SNUG Home Gateway Architecture Case Study
25 pages
Multi-Level Transaction Management For Complex Objects: Implementation, Performance, Parallelism
No ratings yet
Multi-Level Transaction Management For Complex Objects: Implementation, Performance, Parallelism
62 pages
Computer Architecture Syllabus
No ratings yet
Computer Architecture Syllabus
2 pages
Dpco Unit 3,4,5 QB
No ratings yet
Dpco Unit 3,4,5 QB
18 pages
Data Structure Interview Question
No ratings yet
Data Structure Interview Question
8 pages
Data Structure Aucse
No ratings yet
Data Structure Aucse
10 pages
Debugger Mips
No ratings yet
Debugger Mips
79 pages
AMD Opteron™Processor Product Data Sheet: 940-Pin Package Specific Features
No ratings yet
AMD Opteron™Processor Product Data Sheet: 940-Pin Package Specific Features
4 pages
Analyzing CUDA Workloads Using A Detailed GPU Simulator
No ratings yet
Analyzing CUDA Workloads Using A Detailed GPU Simulator
12 pages
Architecture of Pentium Microprocessor
67% (3)
Architecture of Pentium Microprocessor
3 pages
Application of DSP On Tms320c6713 DSK
No ratings yet
Application of DSP On Tms320c6713 DSK
50 pages
VLSI Interview Questions
No ratings yet
VLSI Interview Questions
3 pages
Article e Mag
No ratings yet
Article e Mag
2 pages
Modern Web Apps using Rust: Build full-stack applications using Rust-based Leptos framework, GraphQL, WebAssembly, and cloud-native deployment
From Everand
Modern Web Apps using Rust: Build full-stack applications using Rust-based Leptos framework, GraphQL, WebAssembly, and cloud-native deployment
Nira Talvyn
No ratings yet
Modern Web Apps using Rust
From Everand
Modern Web Apps using Rust
Nira Talvyn
No ratings yet
Efficient Parallel Computing with Dask: Definitive Reference for Developers and Engineers
From Everand
Efficient Parallel Computing with Dask: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
From Everand
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
Marcin Jamro
No ratings yet
Distributed Systems and Beyond
From Everand
Distributed Systems and Beyond
Pasquale De Marco
No ratings yet
Parallel Python with Dask: Perform distributed computing, concurrent programming and manage large dataset
From Everand
Parallel Python with Dask: Perform distributed computing, concurrent programming and manage large dataset
Tim Peters
No ratings yet
Rust for Network Programming and Automation, Second Edition
From Everand
Rust for Network Programming and Automation, Second Edition
Gilbert Stew
No ratings yet
Rust for Network Programming and Automation, Second Edition: Work around designing networks, TCP/IP protocol, packet analysis and performance monitoring using Rust 1.68
From Everand
Rust for Network Programming and Automation, Second Edition: Work around designing networks, TCP/IP protocol, packet analysis and performance monitoring using Rust 1.68
Gilbert Stew
No ratings yet
Beginning XML
From Everand
Beginning XML
Joe Fawcett
3/5 (1)
Learn C++
From Everand
Learn C++
Aishik Dutta
No ratings yet
NNG Reference Manual, Second Edition
From Everand
NNG Reference Manual, Second Edition
Garrett D'Amore
No ratings yet
Software-Defined Networks: A Systems Approach
From Everand
Software-Defined Networks: A Systems Approach
Larry Peterson
5/5 (1)
Learning Concurrent Programming in Scala
From Everand
Learning Concurrent Programming in Scala
Aleksandar Prokopec
No ratings yet
Parallel Python with Dask
From Everand
Parallel Python with Dask
Tim Peters
No ratings yet
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
Management Strategies for the Cloud Revolution (Review and Analysis of Babcock's Book)
From Everand
Management Strategies for the Cloud Revolution (Review and Analysis of Babcock's Book)
BusinessNews Publishing
No ratings yet
Learning Cascading
From Everand
Learning Cascading
Michael Covert
No ratings yet
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
From Everand
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
Anthony Serpico
No ratings yet
Essential Algorithms: A Practical Approach to Computer Algorithms
From Everand
Essential Algorithms: A Practical Approach to Computer Algorithms
Rod Stephens
4.5/5 (2)
Relayd and Httpd Mastery: IT Mastery, #11
From Everand
Relayd and Httpd Mastery: IT Mastery, #11
Michael W. Lucas
No ratings yet
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
From Everand
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
Byron Ellis
No ratings yet
“Information Systems Unraveled: Exploring the Core Concepts”: GoodMan, #1
From Everand
“Information Systems Unraveled: Exploring the Core Concepts”: GoodMan, #1
Patrick Mukosha
No ratings yet
Node.js, JavaScript, API: Interview Questions and Answers
From Everand
Node.js, JavaScript, API: Interview Questions and Answers
John Edward Cooper Berg
5/5 (1)
Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
Elements of Android Room
From Everand
Elements of Android Room
Mark Murphy
No ratings yet
Google BigQuery Analytics
From Everand
Google BigQuery Analytics
Jordan Tigani
3/5 (1)
Learn NodeJS in 1 Day: Complete Node JS Guide with Examples
From Everand
Learn NodeJS in 1 Day: Complete Node JS Guide with Examples
Krishna Rungta
3.5/5 (4)
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
From Everand
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
Eric Chou
No ratings yet
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Software Defined Networking (SDN) - a definitive guide
From Everand
Software Defined Networking (SDN) - a definitive guide
Rajesh Kumar Sundararajan
2/5 (2)
Dataflow and Reactive Programming Systems
From Everand
Dataflow and Reactive Programming Systems
Matt Carkci
No ratings yet
Build Your Own Distributed Compilation Cluster - A Practical Walkthrough
From Everand
Build Your Own Distributed Compilation Cluster - A Practical Walkthrough
Hunter Davis
No ratings yet
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Edge Cloud Operations: A Systems Approach
From Everand
Edge Cloud Operations: A Systems Approach
Larry L Peterson
No ratings yet
Concise Oracle Database For People Who Has No Time
From Everand
Concise Oracle Database For People Who Has No Time
Billy Aung Myint
No ratings yet
Java / J2EE Interview Questions You'll Most Likely Be Asked
From Everand
Java / J2EE Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

The Performance of Distributed Data-Structures Running on a Cache-Coherent in-Memory Data Grid

Uploaded by

The Performance of Distributed Data-Structures Running on a Cache-Coherent in-Memory Data Grid

Uploaded by

http://highscalability.com/blog/2012/8/20/the-performance-of-distributed-data-structures-running-on-a.

The Performance of Distributed Data-Structures

Todd Hoff in Product, datagrid, imdg

This is a guest post by Ron Pressler,

A little over a month ago, we open-sourced a new in-memory data grid

http://weibo.com/developerworks 2012-11-11 整理 第 1／10页

Galaxy, on the other hand, dynamically migrates items among cluster

Galaxy is meant to serve as a foundation for building distributed data-

Shared-state vs. Message passing

question of message-passing architecture versus shared-state.

It is important to realize that all shared-state systems (even the shared

When the shared state abstraction is used, performance issues normally

So it is not shared state that performs slower than message-passing. It is

A short overview of Galaxy's

But before beginning our analysis we need to describe, very concisely,

shared memory space.

Every data-item in Galaxy (accessed not through a key, but using an ID

If a machine wishes to write an item, it must first request ownership over

We will now analyze the performance of such a data-structure.

Amortized analysis of inserts to a

http://weibo.com/developerworks 2012-11-11 整理 第 4／10页

http://weibo.com/developerworks 2012-11-11 整理 第 5／10页

http://weibo.com/developerworks 2012-11-11 整理 第 7／10页

number of operations, , and get:

root. The O(1) amortized behavior means that throughput is unaffected by

Article originally appeared on High Scalability (http://highscalability.com/).

See website for complete article licensing information.

http://weibo.com/developerworks 2012-11-11 整理 第 10／10页

You might also like

http://weibo.com/developerworks 2012-11-11 整理第 1／10页

http://weibo.com/developerworks 2012-11-11 整理第 4／10页

http://weibo.com/developerworks 2012-11-11 整理第 5／10页

http://weibo.com/developerworks 2012-11-11 整理第 7／10页

http://weibo.com/developerworks 2012-11-11 整理第 10／10页