Cost Based Analysis of Hierarchical p2p
Cost Based Analysis of Hierarchical p2p
3.3. Example: Homogeneous Peers Figure 2: Total network costs against superpeer ratio α
HLF
30 A HLF = 100%
Costs per Superpeer
B
0%
1% 10% Superpeer Ratio α 100%
[Gbit/s]
A
30,000 peers are UMTS-connected PDAs, and 30,000
peers are built of GPRS-connected cell phones. All
peers are modeled according to the parameters given in 0,3
Table 1.
Parameter Value With regard to the HLF diagram, we can draw the
TPING 10s following conclusions: When the number of superpeers
TSTAB 10s is less than 13,775 (number of superpeers in point A),
TFIX 60s we observe HLFs of more then 100%, i.e. overloaded
DSL superpeers. This is also the case for a number of
TREP 600s
superpeers between 30,001 and 45,291 (UMTS super-
Mean message size 1000 Bits
peers overloaded) and whenever GPRS peers act as
superpeers. As stated above these superpeers ratios
When increasing the number of superpeers in the should be avoided to ensure system stability. Interest-
following analysis, we first promote DSL peers to su- ingly, this also implies that neither a centralized system
perpeers. For scenarios with more than 30,000 super- (due to overloaded index server) nor a flat Chord over-
peers we also take UMTS peers. Finally, if we have lay (due to overloaded GPRS peers) is an applicable
more than 60,000 superpeers, even GPRS peers will act solution for the given system, thus only a hierarchical
as superpeers. Figures 5 and 6 show the total network architecture fulfils the system requirements.
costs and the highest load factor for the given overlay The optimal number of superpeers for the given sys-
network against the number of superpeers in the sys- tem lies between 13,775 (point A) and 30,000 (point
tem. B). As we can see from figures 5 and 6, at point A the
As expected, we again notice increasing total net- total network costs are minimized without overloading
work costs for an increasing number of superpeers. any participating peer in the system. At point B the
This corresponds directly to our argumentation in Sec- HLF is minimized. In this case all DSL peers act as
tion 3.3. superpeers with a load factor of 51%, while all UMTS
and GPRS peers are leafnodes and therefore have much
lower load factors.
3
We focus on the upstream bit rate of peers here because
this is mainly the bottleneck in today’s overlay networks,
e.g. due to an asymmetric down- and upstream in DSL.
5. On Distributed Algorithms to Determine minimizing the costs for the highest loaded peer in the
Optimal Superpeer Ratios system.
In our future work we plan to extend our analytical
framework to other hierarchical DHT designs, in order
There are two instances of the problem of distributed to compare them to the system architecture proposed in
determining optimal operating points of a network. this paper. We also intend to evaluate the impact of
How can an optimal point be reached from any operat- churn on our analysis. Further, we are currently work-
ing state and how can it be maintained once reached? ing on distributed algorithms to balance leafnodes and
Answering these questions in detail is an important shared objects uniformly over superpeers, and to main-
item of our future work, here we just hint on a possible tain optimal superpeer ratios.
answer to the second question in order to back our be-
lief that distributed implementations of the presented
ideas are feasible.
References
Under the assumption of sequential peer arrivals an
[1] I. Stoica, R. Morris, D. Karger, M. Kaashoek, and H.
optimal superpeer ratio can be maintained roughly as Balakrishnan, "Chord: A Scalable Peer-to-Peer Lookup
follows. When a new peer joins the network it can Service for Internet Applications", ACM SIGCOMM
communicate its cost limit to the superpeer through Conference, 2001.
which it is about to join. The superpeer is responsible [2] A. Rowstron and P. Druschel, "Pastry: Scalable, Dis-
for deciding which role the arriving peer should take tributed Object Location and Routing for Large-Scale
and whether it should replace one or more superpeers Peer-to-Peer Systems", IFIP/ACM International Con-
that are already in the network. To be able to do this, ference on Distributed Systems Platforms (Middle-
all superpeers need to maintain approximations of the ware), 2001.
[3] P. Maymounkov and D. Mazieres, "Kademlia: A Peer-
current state of the network. In particular, good ap-
to-Peer Information System Based on the XOR Metric",
proximations of the sizes of the superpeer population International Workshop on Peer-to-Peer Systems
and the entire network as well as an estimate of the (IPTPS'02), 2002.
load distribution across peers are necessary. We be- [4] K. Aberer, P. Cudré-Mauroux, A. Datta, Z. Despotovic,
lieve that the DHT size estimation from [11] can be M. Hauswirth, M. Punceva, and R. Schmidt, "P-Grid: A
extended and applied to our problem. On the other Self-organizing Structured P2P System", SIGMOD Re-
hand, it seems to be hard to discover an accurate esti- cord, vol. 32, 2003.
mate of the load distribution. We wonder whether it [5] P. Ganesan, K. Gummadi, and H. Garcia-Molina,
suffices to maintain samples made up of routing "Canon in G Major: Designing DHTs with Hierarchical
Structure", International Conference on Distributed
neighbors only or larger sets are necessary. In case
Computing Systems (ICDCS 2004), 2004.
they are needed, these sets can be spread among [6] L. Garces-Erice, E. W. Biersack, K. W. Ross, P. A.
neighbors, either by broadcasting them periodically or Felber, and G. Urvoy-Keller, "Hierarchical P2P Sys-
piggybacking them to other exchanged messages. tems", ACM/IFIP International Conference on Parallel
and Distributed Computing (Euro-Par), 2003.
6. Conclusion and Future Work [7] S. Zoels, S. Schubert, W. Kellerer, and Z. Despotovic,
"Hybrid DHT Design for Mobile Environments", Inter-
national Workshop on Agents and Peer-to-Peer Com-
We showed in this paper that hierarchical DHT or-
puting (AP2PC 2006), 2006.
ganizations provide a plausible approach to building [8] N. Christin and J. Chuang, "A Cost-Based Analysis of
P2P overlay networks. We presented an analytical Overlay Routing Geometries", IEEE INFOCOM'05,
framework to analyze a specific type of hierarchical 2005.
systems, where superpeers build a conventional Chord [9] S. Rhea, D. Geels, T. Roscoe, and J. Kubiatowcz,
overlay and leafnodes use them as proxies. We evalu- "Handling Churn in a DHT", USENIX 2004 Annual
ated the costs for running the whole network as well as Technical Conference, 2004.
the costs for every single participant, in order to deter- [10] P. K Gummadi,, R. Gummadi, S. D. Gribble, S. Ratna-
mine an optimal superpeer ratio for a given system. We samy, S. Shenker, and I. Stoica. "The Impact of DHT
Routing Geometry on Resilience and Proximity", ACM
found that total network costs decrease with centraliza- SIGCOMM Conference, 2003.
tion; while on the other hand, centralization may over- [11] G. S. Manku, "Routing Networks for Distributed Hash
load peers and therefore endanger system stability. As Tables", 22nd ACM Symposium on Principles of Dis-
our main results, we showed that hierarchical DHT tributed Computing (PODC), 2003.
design is better than flat design and that there is a
trade-off between minimizing total network costs and