Vmware Horizon View Best Practices Performance Study
Vmware Horizon View Best Practices Performance Study
Vmware Horizon View Best Practices Performance Study
Table of Contents
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Horizon 6 Feature and Performance Enhancements. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Hardware-Accelerated 3D Graphics. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Performance Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
RDSH Sizing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
CPU Usage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
Bandwidth Usage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
Light 3D Workload . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
CAD Workload . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
PCoIP Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
Conclusion. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
References. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
Authors and Contributors. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
T E C H N I C A L W H I T E PA P E R
/ 2
Introduction
The VMware Horizon with View centralized desktop infrastructure offers advantages for both end users and
IT staff. End users are no longer locked to a particular machine and can access their system and files from
anywhere, anytime. Horizon with View transforms IT by simplifying and automating desktop and applications
management. IT administrators can quickly create virtual desktops on demand based on locations and profiles.
By centrally maintaining desktops, applications, and data, Horizon with View reduces costs, improves security,
and increases availability and flexibility for end users.
Centralized Virtual
De
Desktops
VM
wa
re
vS
ph
ere
(ES
Xi)
Linked
Clones
Microsoft
Active
Directory
View
Connection
Server
Zero Client
OS
Thin Client
Horizon Client
for Android
VMware
vCenter
Horizon Client
for iOS
View Composer
Horizon Client
for Windows
Endpoint
Devices
Master
Image
Horizon Client
for Mac
Horizon Client
for Linux
Horizon Client
for Windows
Store
HTML
T E C H N I C A L W H I T E PA P E R / 3
T E C H N I C A L W H I T E PA P E R / 4
Hardware-Accelerated 3D Graphics
In response to user demand for an ever-richer set of applications to be supported in the virtual environment,
VMware improved 3D graphics support in View 5.x, with additional improvements added in Horizon 6. The
following 3D capabilities expand both the target user base and potential use cases that IT organizations can
deliver with virtual desktops.
Soft 3D Introduced with View 5.0, support for software-accelerated 3D graphics is provided by a Soft 3D
graphics driver without physical GPUs installed in the VMware ESXi (hypervisor) virtual machine host.
vSGA (Virtual Software Graphics Acceleration) Introduced with View 5.2, multiple virtual machines can
leverage physical GPUs installed locally in the ESXi virtual machine hosts to provide hardware-accelerated 3D
graphics to multiple virtual desktops.
vDGA (Virtual Dedicated Graphics Acceleration) Introduced with View 5.3, a single virtual machine is
mapped to one physical GPU installed in the ESXi virtual machine host to provide high-end, hardwareaccelerated workstation graphics where a dedicated GPU is needed.
The 3D graphics acceleration is built into the VMware vSphere platform. Horizon with View fully leverages
vSphere and delivers a robust set of 3D offerings to end users. You can administer support for 3D desktops
from the View Administrator console. You can enable it on a per pool or per virtual machine basis using
the vSphere client. More information and a detailed performance study about 3D graphics acceleration are
available in the VMware community forum paper, VMware Horizon 6 and Hardware Accelerated 3D Graphics
Performance and Best Practices.
In contrast to a physical workstation that has sole use of its GPU, GPUs in a virtualized environment are
a shared resource. Therefore, it is important to ensure that each virtual machine does not waste the GPU
resource. For instance, if View is configured to remote at a lower frame rate (the default is 30 FPS), it usually
does not make sense for a 3D application to render hundreds of frames per second (FPS). For these situations,
you can configure a View registry setting to limit the maximum application frame rate either in the template
virtual machine or on a per-virtual-machine basis. The value is typically set to the maximum frame rate used
by PCoIP. Setting the following registry entry for a 3D workload can significantly improve the performance and
consolidation ratios achievable:
HKLM\SOFTWARE\VMware, Inc.\VMware SVGA DevTap\MaxAppFrameRate
T E C H N I C A L W H I T E PA P E R / 5
VM
VM
VM
VM
VM
VM
VM
VM
VM
VM
VM
vSphere
Virtual SAN
Clustered
Virtual SAN Datastore
T E C H N I C A L W H I T E PA P E R / 6
Performance Results
To test the performance of the new and enhanced features, we used VMware View Planner 3.5, a workload
generator. View Planner simulates typical end-user operations, such as typing in Microsoft Word, playing a
PowerPoint slideshow, reading Outlook emails, viewing PDF files, browsing Web pages, and watching a video.
View Planner also mimics user behavior by allowing for think time during activities. For more information, see
the View Planner Installation and User Guide.
Microsoft Office
Other Applications
Figure 3: Applications Used in View Planner Workload
To simulate a heavy user, all applications were selected, a fast moving video with many screen changes was
played, and a think time of 2 seconds was used. For a medium user, all the applications were selected, but a
slower video with fewer changes and a think time of 5 seconds were used.
View Planner was run for multiple iterations, with each iteration completing all user operations for the specific
group. Each iteration has three phases: ramp-up, steady state, and ramp-down. During each iteration, View
Planner reports the latencies for each operation performed within each virtual machine.
View Planner divides tests into groups. Group A represents interactive operations, and Group B includes CPUand I/O-sensitive operations. Quality of service (QoS) is determined for Group A user operations as 1 second,
and Group B user operations as 6 seconds.
T E C H N I C A L W H I T E PA P E R / 7
RDSH Sizing
With sizing, the goal is to consolidate as many sessions as possible on a particular infrastructure without
sacrificing quality. To assess sizing, three aspects of performance were examined:
How many users or sessions per physical core can a desktop or application session support?
How many vCPUs are used for an RDSH virtual machine?
How many RDSH virtual machines are needed?
Windows 7 32-bit,
1 vCPU, 1 GB RAM
VM
VM
VM
PCoIP
VM
VM
VM
VM
VM
VM
VM
VM
VM
Client VMs
or Users
Figure 4: System Under Test for RDSH Server Virtual Machine Sizing
T E C H N I C A L W H I T E PA P E R / 8
Figure 5: View Planner Group B Response Time with Increasing Users per Core
T E C H N I C A L W H I T E PA P E R / 9
Figure 6: View Planner Group B Response Time with Increasing Application Sessions per Core
T E C H N I C A L W H I T E PA P E R / 1 0
CPU Usage
Figure 7 shows the percentage of the guest CPU used when performing the various workload tasks. PCoIP used
on average 71.6 percent, RDP used 68 percent, and ICA 71.2 percent. All the protocols used about the same
amount of compute power on the guest operating system.
Figure 7: CPU Usage in the RDSH Virtual Machine for Three Competing Remote Display Protocols
Bandwidth Usage
Figure 8 shows the bandwidth usage of the 60 sessions for each display protocol. The average kilobits per
second (Kbps) are calculated for the workload runs. PCoIP averaged 44.7 Kbps per session, RDP used 50.7
Kbps, and ICA 48.4 Kbps. PCoIP bandwidth performance is about 10 percent better than RDP and ICA.
Figure 8: Bandwidth Usage for View Planner Medium Workload per Remote Display Protocols
T E C H N I C A L W H I T E PA P E R / 1 1
Windows 7 32-bit,
1 vCPU, 1 GB RAM
VDI VMs
VM
VM
Windows 7 32-bit,
1 vCPU, 1 GB RAM
VM
VM
VM
VM
Client VMs
or Users
T E C H N I C A L W H I T E PA P E R / 1 2
Figure 10 shows the response time for Groups A and B, comparing Horizon 6 with View and View 5.3.
Horizon 6 with View shows about a 10 percent lower response time in both interactive and I/O-sensitive
operations compared to View 5.3. Bandwidth usage was about 30 percent lower in Horizon 6 with View due to
changes in PCoIP default settings.
6
5
4
View 5.3
Horizon 6 with View
3
2
1
0
Group A 95%
Group B 95%
Figure 10: View Planner Response Times Comparing Horizon 6 with View and View 5.3
T E C H N I C A L W H I T E PA P E R / 1 3
CAD workload 2 A Solid Edge CAD viewer is run in isolation for the duration of the test. During the test, a 3:1
reducer model was used, as illustrated in Figure 11.
Figure 11: 3-to-1 Reducer Model Used in Performance Tests with the Solid Edge Viewer
A think time of 5 seconds was used. All tests used the default settings. The setup is shown in Figure 12. The
workloads were run on a single Dell R720 server with different VM consolidation ratios. The number of VMs
that can be supported per GPU can be dictated by either the GPUs compute resources being exhausted or the
GPUs available memory being exhausted.
VM
VM
VM
VM
VM
Figure 12: Setup for Measuring the Performance of the vSGA Stack and Horizon 6 with View
T E C H N I C A L W H I T E PA P E R / 1 4
Light 3D Workload
For the light 3D workload, the number of VMs was gradually increased until the View Planner response
threshold was exceeded. The results are shown in Figure 13. The corresponding CPU utilization, as measured
using esxtop, is shown by the line graph.
1.20
FAIL
1.00
80
PASS
70
0.80
60
0.60
50
40
0.40
30
20
0.20
10
0.00
16
32
View Planner
Threshold
90
48
64
80
96
Number of VMs
120
128
Based on this data, it is clear that the vSGA stack can support 120 users while each user is executing the light
3D workload. Running on higher-performance processors typically delivers even higher consolidation ratios.
The results shown in Figure 13 were obtained using desktop VMs configured with 128 MB video RAM. Because
the test bed had two NVIDIA GRID K2 GPUs with 8 GB DRAM each, the GPUs can support only about 128
desktop VMs. For this light 3D workload, the maximum consolidation ratio achieved on the dual-socket server
under test was 128. The test was stopped when the VM responsiveness exceeded the upper limit allowed by the
View Planner responsiveness threshold.
CAD Workload
In this workload, a Solid Edge viewer runs a single model: a 3-to-1 reducer, as shown in Figure 11. As with the
light 3D workload, the simulated users interaction with the model mimics a real users pattern. Figure 14
illustrates the scalability of the vSGA as the load on the server with two K2 cards is increased. As the number of
VMs is increased from 1 to 72, the aggregate remotely delivered FPS increased by 35 times for K2, and 30 times
for K1. Peak GPU usage was about 95 percent.
Figure 16 presents the same results as Figure 14, but shows the remotely delivered FPS data on a per-VM
basis. This view of the data highlights that the performance of the individual VMs moderately decreases as the
number of VMs is scaled.
T E C H N I C A L W H I T E PA P E R / 1 5
100
35
90
80
30
70
25
60
20
50
15
40
30
10
20
5
0
10
16
32
48
Number of VMs
64
72
80
Fails FPS
Threshold
Criterion
100
35
90
80
30
70
25
60
20
50
15
40
30
10
20
5
0
10
16
32
48
Number of VMs
56
64
72
Figure 14: Scalability of the vSGA Solution as the Server Load Is Increased with NVIDIA K2 Cards
Figure 15: Scalability of the vSGA Solution as the Server Load Is Increased with NVIDIA K1 Cards
T E C H N I C A L W H I T E PA P E R / 1 6
1.20
1.00
0.80
FPS Threshold
0.60
PASS
0.40
FAIL
0.20
0.00
8
16
32
Number of VMs
48
64
72
80
Figure 16: Remotely Delivered FPS Data on a per-VM Basis with K2 Graphics Cards
1.20
1.00
0.80
FPS Threshold
0.60
PASS
0.40
FAIL
0.20
0.00
8
16 32
Number of VMs
48
56
64
72
Figure 17: Remotely Delivered FPS Data on a per-VM Basis with K1 Graphics Cards
T E C H N I C A L W H I T E PA P E R / 1 7
Desktop VMs
Windows 7 32-bit,
1 vCPU, 1 GB RAM
VM
VM
VM
VM
VM
VM
HP ProLiant BL 460,
8-core Intel Xeon E5540,
2.53 GHz, 96 GB RAM with
hybrid storage array
with FC protocol
Figure 18: Setup for Virtual SAN (Host Configuration for a 3-Node Cluster)
T E C H N I C A L W H I T E PA P E R / 1 8
To highlight the PCoIP default changes, we compared the View Planner score on Virtual SAN 5.5 (vSphere 5.5
U1) on View 5.3.2, and Horizon 6 with View and Virtual SAN public beta (vSphere 5.5). On a three-node cluster,
VDImarkthe maximum number of desktop VMs that can run passing QoS criteriais obtained for both Virtual
SAN 5.5 and Virtual SAN public beta. The results are shown in Figure 19.
400
350
VDImark
300
250
286
305
341
200
150
100
50
0
The results show that with Virtual SAN 5.5, we can scale up to 305 VMs on a three-node cluster, which is about
5 percent more consolidation than Virtual SAN public beta. Compared to prior releases, Virtual SAN 5.5 has
about 10 percent more consolidation with Horizon 6 with View. This improvement is primarily due to PCoIP
default changes, which provide better host consolidation and lower bandwidth usage.
T E C H N I C A L W H I T E PA P E R / 1 9
T E C H N I C A L W H I T E PA P E R / 2 0
vCPU
One for Windows XP and Windows 7 and Windows 8; two for multimediaintensive apps
Memory
Network adapter
vmxnet3, flexible
Storage adapter
VMware Tools
Latest installed (also make sure that the balloon driver is functioning properly)
Visual settings
Disable services*
Disable hibernation
Disable system restore
Set screensaver to None
Other settings
T E C H N I C A L W H I T E PA P E R / 2 1
PCoIP Settings
After optimizing the master VM, you can adjust PCoIP settings to realize the best user experience. Table 2 lists
group policy object (GPO) settings to improve the user experience for WAN environments. But even the default
settings save lots of bandwidth. More PCoIP settings are discussed in the VMware View 5 with PCoIP: Network
Optimization Guide.
S ETTI NG
D E FAULT
R ECOMMEN DATION
DESCR IPTION
Session Audio
BW limit
500 Kbps
50100 Kbps
Maximum
frame rate
30
Maximum session
bandwidth
Client-side
cache size
250 MB
T E C H N I C A L W H I T E PA P E R / 2 2
Conclusion
This paper presented performance data and best practices for VMware Horizon 6 and highlighted some of the
new and enhanced features. It describes some of the sizing practices for RDSH virtual machines to get the best
performance for RDSH desktop sessions and applications. According to the findings, the vCPU for the RDSH
should be configured so that the VM fits within one NUMA node, and the number of RDSH server instances
should allow all hyperthreaded cores to be fully utilized. For RDSH desktop and application sessions, 300 MHz
per session provides a good user experience for a medium workload.
In Horizon 6 with View, the PCoIP defaults have changed, resulting in a 10 percent improvement in the user
experience and 30 percent lower bandwidth usage. Similar results are seen when the desktops are deployed on
Virtual SAN 5.5.
For the vSGA feature, the results illustrated the ability of VMware hardware-backed 3D support to scale
efficiently, demonstrating the benefits of GPU virtualization and the strength of the VMware 3D strategy.
T E C H N I C A L W H I T E PA P E R / 2 3
References
VMware View Planner Installation and User Guide
VMware View Planner download
VMware View Planner: Measuring True Virtual Desktop Experience at Scale
Comprehensive User Experience Monitoring
Optimization Guide for Windows 7 and Windows 8 Virtual Desktops in Horizon with View
Remote Desktop Protocol 8.0 update for Windows 7 and Windows Server 2008 R2
VMware Horizon View 5.2 and Hardware Accelerated 3D Graphics: Performance Study
VMware View 5 with PCoIP: Network Optimization Guide
VDI Benchmarking Using View Planner on VMware Virtual SAN (VSAN), part 1
VDI Benchmarking Using View Planner on VMware Virtual SAN (VSAN), part 2
VDI Benchmarking Using View Planner on VMware Virtual SAN (VSAN), part 3
VDI Performance Benchmarking on VMware Virtual SAN 5.5
Simulating different VDI users with View Planner 3.0
VMware Horizon with View
VMware Virtual SAN
VMware Horizon 6 Reference Architecture
VMware Horizon 6 RDSH Performance and Best Practices Whitepaper
VMware Horizon 6 and Hardware Accelerated 3D Graphics - Performance and Best Practices
T E C H N I C A L W H I T E PA P E R / 2 4
VMware, Inc. 3401 Hillview Avenue Palo Alto CA 94304 USA Tel 877-486-9273 Fax 650-427-5001 www.vmware.com
Copyright 2015 VMware, Inc. All rights reserved. This product is protected by U.S. and international copyright and intellectual property laws. VMware products are covered by one or more patents listed at
http://www.vmware.com/go/patents. VMware is a registered trademark or trademark of VMware, Inc. in the United States and/or other jurisdictions. All other marks and names mentioned herein may be
trademarks of their respective companies. Item No: VMW-TWP-HORZVIEWPERFBESTPRAC-USLET-20150324-WEB