Routing Algorithms For DHTS: Some Open Questions
Routing Algorithms For DHTS: Some Open Questions
Proximity Routing: Proximity routing is when the Question 11 Can one choose identifiers in a one-
routing choice is based not just which neighboring node dimensional key space that will adequately capture the
makes the “most” progress towards the key, but is also geographic layout of nodes?
based on which neighboring node is “closest” in the However, this may not matter because the geographic
sense of latency. Various algorithms implement prox- layout may not offer significant advantages over the two
imity routing differently, but they all adopt the same proximity methods.
basic approach of weighing progress in identifier space Question 12 Can the two local techniques of proximity
against cost in latency (or geography). Simulations routing and proximity neighbor selection achieve most
have shown this to be a very effective tool in reducing of the benefit of global geographic layout?
the average path latency.
Moreover, these geographically-informed layout meth-
Question 8 Can one formally characterize the effec- ods may interfere with the robustness, hotspot, and
tiveness of these proximity routing approaches? other properties mentioned in previous sections.
Question 13 Does geographic layout have an impact
Proximity Neighbor Selection: This is a variant of on resilience, hotspots, and other aspects of perfor-
the idea above, but now the proximity criterion is ap- mance?
plied when choosing neighbors, not just when choosing
the next hop.
Question 9 Can one show that proximity neighbor se- 7 Extreme Heterogeneity
lection is always better than proximity routing? Is this
All of the algorithms start by assuming that all nodes
difference significant?
have the same capacity to process messages and then,
As mentioned earlier, if the
node-pair dis- only later, add on techniques for coping with hetero-
tances (as measured by latency) are known, the Plax- geneity.3 However, the heterogeneity observed in cur-
ton/Tapestry algorithm can choose the neighbors so as rent P2P populations [13] is quite extreme, with dif-
to minimize the expected overlay path latency. This is ferences of several orders of magnitude in bandwidth.
an extremely important property, that is (so far) the ex- One can ask whether the routing algorithms, rather than
clusive domain of the Plaxton/Tapestry algorithms. We merely coping with heterogeneity, should instead use
don’t whether other algorithms can adopt similar ap- 2
Note that geographic layout differs from the two above proxim-
proaches. ity methods in that here there is an attempt to affect the global lay-
out of the node identifiers, whereas the proximity methods merely
Question 10 If one had the full
distance matrix, affect the local choices of neighbors and forwarding nodes.
could one do optimal neighbor selection in algorithms 3
The authors of [13] deserve credit for bringing the issue of het-
other than Plaxton/Tapestry? erogeneity to our attention.
it to their advantage. At the extreme, a star topology [5] D RUSCHEL , P., AND ROWSTRON , A. Past: Persistent and
with all queries passing through a single hub node and anonymous storage in a peer-to-peer networking environ-
ment. In Proceedings of the 8th IEEE Workshop on Hot Top-
then routed to their destination would be extremely ef-
ics in Operating Systems (HotOS 2001) (Elmau/Oberbayern,
ficient, but would require a very highly capable nub Germany, May 2001), pp. 65–70.
node (and would have a single point of failure). But [6] D RUSCHEL , P., AND ROWSTRON , A. Pastry: Scalable, dis-
perhaps one could use the very highly capable nodes tributed object location and routing for large-scale peer-to-
as mini-hubs to improve routing. In another position peer systems. In Proceedings of the 18th IFIP/ACM Interna-
paper here, some of us argue that heterogeneity can be tional Conference on Distributed Systems Platforms (Middle-
ware 2001)W (Nov 2001).
used to make Gnutella-like systems more scalable. The
question is whether one could similarly modify the cur- [7] K UBIATOWICZ , J., B INDEL , D., C HEN , Y., C ZERWIN -
SKI , S., E ATON , P., G EELS , D., G UMMADI , R., R HEA ,
rent DHT routing algorithms to exploit heterogeneity: S., W EATHERSPOON , H., W EIMER , W., W ELLS , C., AND
Z HAO , B. OceanStore: An architecture for global-scale per-
Question 14 Can one redesign these routing algo- sistent storage. In Proceeedings of the Ninth international
Conference on Architectural Support for Programming Lan-
rithms to exploit heterogeneity? guages and Operating Systems (ASPLOS 2000) (Boston, MA,
It may be that no sophisticated modifications are November 2000), pp. 190–201.
needed to leverage heterogeneity. Perhaps the sim- [8] N G , E., AND Z HANG , H. Towards global network position-
plest technique to cope with heterogeneity, one that has ing. In Proceedings of ACM SIGCOMM Internet Measure-
already been mentioned in the literature, is to clone ment Workshop 2001 (Nov. 2001).
highly capable nodes so that they could serve as multi- [9] P LAXTON , C., R AJARAMAN , R., AND R ICHA , A. Access-
ing nearby copies of replicated objects in a distributed envi-
ple nodes; i.e., a node that was times more powerful
ronment. In Proceedings of the ACM SPAA (Newport, Rhode
than other nodes could function as virtual nodes. 4 Island, June 1997), pp. 311–320.
When combined with proximity routing and neighbor [10] R ATNASAMY, S., F RANCIS , P., H ANDLEY, M., K ARP, R.,
selection, cloning would allow nodes to route to them- AND S HENKER , S. A scalable content-addressable network.
selves and thereby “jump” in key space without any for- In Proc. ACM SIGCOMM (San Diego, CA, August 2001),
warding hops. pp. 161–172.
[11] R ATNASAMY, S., H ANDLEY, M., K ARP, R., AND
Question 15 Does cloning plus proximity routing and S HENKER , S. Application-level Multicast using Content-
neighbor selection lead to significantly improved per- Addressable Networks. In Proceedings of NGC 2001 (Nov.
formance when the node capabilities are extremely het- 2001).
erogeneous? [12] R ATNASAMY, S., H ANDLEY, M., R ICHARD K ARP, AND
S HENKER , S. Topologically-aware overlay construction and
server selection. In Proceedings of Infocom ’2002 (Mar.
References 2002).
[13] S AROIU , S., G UMMADI , K., AND G RIBBLE , S. A measure-
[1] A. ROWSTRON , A-M. K ERMARREC , M. C., AND D R - ment study of peer-to-peer file sharing systems. In Proceed-
USCHEL , P. Scribe: The design of a large-scale event no- ings of Multimedia Conferencing and Networking (San Jose,
tification infrastructure. In Proceedings of NGC 2001 (Nov. Jan. 2002).
2001).
[14] S TOICA , I., M ORRIS , R., K ARGER , D., K AASHOEK , M. F.,
[2] BASED C HAT, C. http://jxme.jxta.org/demo.html, 2001. AND B ALAKRISHNAN , H. Chord: A scalable peer-to-peer
[3] C ABRERA , L. F., J ONES , M. B., AND T HEIMER , M. Herald: lookup service for internet applications. In Proceedings of
Achieving a global event notification service. In Proceedings the ACM SIGCOMM ’01 Conference (San Diego, California,
of the 8th IEEE Workshop on Hot Topics in Operating Systems August 2001).
(HotOS-VIII) (Elmau/Oberbayern, Germany, May 2001). [15] Z HAO , B. Y., K UBIATOWICZ , J., AND J OSEPH , A. Tapestry:
[4] DABEK , F., K AASHOEK , M. F., K ARGER , D., M ORRIS , R., An infrastructure for fault-tolerant wide-area location and
AND S TOICA , I. Wide-area cooperative storage with CFS. routing. Tech. Rep. UCB/CSD-01-1141, University of Cal-
In Proceedings of the 18th ACM Symposium on Operating ifornia at Berkeley, Computer Science Department, 2001.
Systems Principles (SOSP ’01) (To appear; Banff, Canada, [16] Z HUANG , S., Z HAO , B., J OSEPH , A. D., K ATZ , R. H., AND
Oct. 2001). K UBIATOWICZ , J. Bayeux: An architecture for wide-area,
4
This technique has already been suggested for some of the al- fault-tolerant data dissemination. In Proceedings of NOSS-
gorithms, and could easily be applied to the others. However, in DAV’01 (Port Jefferson, NY, June 2001).
some algorithms it would require alteration in the way the node
identifiers were chosen so that they weren’t tied to the IP address of
the node.