Flexnetwork Architecture Delivers Higher Speed, Lower Downtime With HP Irf Technology
Flexnetwork Architecture Delivers Higher Speed, Lower Downtime With HP Irf Technology
Flexnetwork Architecture Delivers Higher Speed, Lower Downtime With HP Irf Technology
August 2011
This document briefly explains IRF concepts and benefits, and then describes procedures and results for IRF tests involving VMware vMotion; network bandwidth; and resiliency.
Introducing IRF
IRF consolidates multiple physical switches so that they appear to the rest of the network as a single logical entity. Up to nine switches can comprise this virtual fabric, which runs on HPs high-end switch/routers1 and can encompass hundreds or thousands of gigabit and 10-gigabit Ethernet ports. IRF offers advantages in terms of simpler network design; ease of management; disaster recovery; performance; and resiliency. A virtual fabric essentially flattens the data center from three layers into one or two using HP Virtual Connect technology, requiring fewer switches. Device configuration and management also becomes simpler. Within an IRF domain, configuration of a single primary switch is all thats needed; the primary switch then distributes relevant configuration and protocol information to other switches in the IRF domain. IRF also supports an in-service software upgrade (ISSU) capability that allows individual switches to be taken offline for upgrades without affecting the rest of the virtual fabric. For disaster recovery, switches within an IRF domain can be deployed across multiple data centers. According to HP, a single IRF domain can link switches up to 70 kilometers (43.5 miles) apart.
Page
2
1
IRF support is included at no cost on HP 12500, 9500, 7500, 58xx, and 55xx switches.
Page
By virtualizing the core switching infrastructure, IRF increases network capacity (see Figure 2). Here, all inter-switch ports are available, with no loss in redundancy compared with spanning tree. The core switches appear to the rest of the network as a single logical device. That converged device can forward traffic on all links with all attached access switches. Moreover, IRF can be used together with the link aggregation control protocol (LACP) to add capacity to links between switches (or between switches and servers), again with all ports available all the time. This isnt possible with STP or RSTP since at least half (and possibly more) of all inter-switch links must remain in blocking state at all times.
Page
For both RSTP and IRF scenarios, engineers used custom-developed scripts to trigger vMotion migration of 128 virtual machines. In all three trials run, IRF clearly outperformed RSTP in terms of average vMotion migration time (see Figure 3). On average, migrations over RSTP took around 70 seconds to complete, while vMotion over IRF took 43 seconds.
Page
6
2
Network Test did not compare IRF and rapid spanning tree protocol [RSTP] throughput because RSTP results would be identical to those with STP in this particular configuration.
Network Test followed the procedures described in RFCs 2544 and 2889 to determine system throughput. Engineers configured the Spirent TestCenter traffic generator/analyzer to offer bidirectional traffic in an east/west direction, meaning all frames on the west side of the test bed were destined for the east side and vice-versa. The aggregate load offered to each side was equivalent to 20 Gbit/s of traffic, equal to the theoretical maximum capacity of the access-core links. To measure throughput, engineers offer traffic at varying loads, each for a 60-second duration, to determine the highest rate at which the switches would forward all frames with zero frame loss. As defined in RFC 1242, this is the throughput rate.
Page
Note also that bandwidth utilization nearly doubles with IRF in every test case, regardless of frame length. This validates IRFs ability to deliver far more network bandwidth, which in turn speeds performance for all applications, regardless of traffic profile. The throughput figures presented in Figure 5 are given as percentages of theoretical line rate. As a unit of measurement, throughput itself is actually a rate and not a percentage. Figure 6 below presents the same data from the throughput tests, this time with throughput expressed in frames per second for each test case. Regardless of how its expressed, IRF provides nearly double the network bandwidth as other layer-2 and layer-3 resiliency mechanisms.
Page
Page
Link failure: With traffic active, engineers disconnected the link between the east 12500 and east 5820 switches, forcing traffic to be rerouted onto an alternative path
In all cases, Spirent TestCenter offered bidirectional streams of 64-byte frames at exactly 50 percent of line rate throughout the test. This is the most stressful non-overload condition possible; in theory, the system under test is never congested, even during component failure. Thus, any frames dropped during this test were a result of, and only of, path re-computation following a component failure. Figure 7 below compares convergence times for conventional STP with IRF configured for layer-2 operation. For all failure modes, IRF converges vastly faster than STP. Further, IRF converges far faster than HPs 50-ms claim in all cases. In fact, the differences between STP and layer-2 IRF are so large that they cannot be compared on the same scale as with other failover mechanisms. STP convergence times in this test are, if anything, lower than those typically seen in production networks. In this test, with only two sets of interfaces transitioning between forwarding and blocking states, convergence occurred relatively quickly; in production networks with more ports and switches, STP convergence times typically run on the order of 45 to 60 seconds. IRF convergence times in production may be higher as well, although by a far smaller amount than with STP.
Page
10
Figure 7: STP vs. IRF convergence times
Network Test also compared IRF with rapid spanning tree, the newer and faster mechanism described in IEEE specification 802.1w. While RSTP converges much faster than STP, its still no match for IRF in recovering from network outages (see Figure 8).
As with STP, the convergence times measured for RSTP were substantially lower than those typically seen on production networks, perhaps due to the small number of links involved. In production settings, RSTP convergence often takes between 1 and 3 seconds following a link or component failure. The final test compared VRRP and IRF convergence times, with the HP 12500 and HP 5280 both configured in layer-3 mode. In this case, both VRRP and IRF present a single IP address to other devices in the network, and this address migrates to a secondary system when a failure occurs. Here again, IRF easily outpaced VRRP when recomputing paths after a network failure (see Figure 9). VRRP took between 1.9 and 2.2 seconds to converge, compared with times in the single milliseconds or less for IRF.
Page
11
Conclusion
These test results validate IRFs benefits in the areas of network design, performance, and reliability. IRF simplifies network architectures in campus networks and data centers by combining multiple physical switches and presenting them as a single logical fabric to the rest of the network. This approach results in far faster transfer times for virtual machines using VMware vMotion. Performance testing also shows that IRF nearly doubles available bandwidth by virtue of its active/active design, compared with active/passive designs that tie up switch ports for redundancy. And the results also show huge improvements in convergence times following network failures, both in layer-2 and layer-3 modes, enhancing reliability and improving application performance.
Page
12
Appendix C: Disclaimer
Network Test Inc. has made every attempt to ensure that all test procedures were conducted with the utmost precision and accuracy, but acknowledges that errors do occur. Network Test Inc. shall not be held liable for damages that may result from the use of information contained in this document. All trademarks mentioned in this document are property of their respective owners.
Version 2011082200. Copyright 2011 Network Test Inc. All rights reserved. Network Test Inc. 31324 Via Colinas, Suite 113 Westlake Village, CA 91362-6761 USA +1-818-889-0011 http://networktest.com [email protected]
Page
13