How To Configure A DRS Cluster in VMware
How To Configure A DRS Cluster in VMware
nakivo.com/blog/vmware-cluster-drs-configuration/
In this post, we are going to explore how DRS works and how to configure a DRS cluster with an
explanation of the main settings.
Complete data protection for VMware vSphere VMs and instant recovery options. Secure backup targets
onsite, offsite and in the cloud. Anti-ransomware features.
DISCOVER SOLUTION
If an ESXi host is overloaded and VMs are overloaded, VM performance degrades. With DRS configured,
particular VMs are moved to ESXi hosts with free resources. VMware vMotion provides VM live migration
for running VMs.
1/16
DRS can proactively manage resources in the cluster so that the load between hosts is shared
more evenly.
DRS rules can make sure that certain VMs or groups of VMs are always on different hosts or the
same hosts.
Starting from vSphere 6.5, DRS can take network bandwidth into consideration when scheduling
resources to prevent oversubscription.
When a host is added to a DRS cluster, the host’s resources become part of the cluster’s resources and
are managed accordingly. This allows for the use of cluster-level resource pools and resource allocation
policies.
Resource pools allow you to allocate resources according to your needs, policies, or VM requirements.
This can give some VMs access to more hardware resources and others to fewer resources.
There is a wide range of VMware DRS settings in vSphere Client for managing the resources of vSphere
clusters, and these settings are covered in our configuration guide below.
DRS requirements
DRS requirements are identical to the general vSphere cluster requirements and include the following:
Shared storage. This is storage that is accessible by all cluster nodes and is used to store VM files.
Processor compatibility. Since DRS utilizes vMotion, the vMotion process transfers the running CPU
state between the various ESXi hosts, requiring that destination hosts have at least the equivalent
CPU instruction capability.
Hosts in the cluster must be part of a vMotion network.
2/16
4. Now you are in the Configure tab. In the vSphere vCenter 7 user interface, the HA and DRS
settings are configured using the Configure tab of the cluster. DRS is turned off by default. Select
vSphere DRS in the Services section of the middle pane and click EDIT.
5. We see a simple switcher where DRS can be enabled. Switch on the vSphere DRS toggle to
enable or disable vSphere DRS (see the screenshot below).
3/16
There are four tabs with VMware DRS settings:
Automation
Additional Options
Power Management
Advanced Options
If we drill down into the DRS settings, we can customize the behavior of DRS within our VMware cluster.
VMware vSphere DRS settings are flexible and allow you to configure DRS as you need.
Automation
The Automation tab contains DRS automation settings (see the screenshot above). The automation level
can be selected to decide how much control DRS has over migrating resources.
Automation level. VMware DRS settings allow you to set the level of automation of your DRS cluster.
You have three options:
4/16
Fully automated. VMware vSphere fully automates VM placement and migration within a cluster. If
a DRS detects that the cluster is imbalanced and that the situation can be improved, VM migration
is started automatically. Initial VM placement and VM migration recommendations are applied
automatically.
Partially automated. Initial VM placement recommendations are applied automatically. When DRS
generates VM migration recommendations for load balancing, you have to apply them manually.
Manual. In this mode, DRS generates recommendations for initial VM placement and for VM
migrations. You have to apply them manually when you agree with these recommendations.
Migration Threshold. Use this parameter to set the agression level for DRS behavior when there are
conditions for VM migration and to define how the cluster tolerates the imbalance.
There are 5 slider positions to configure the migration threshold – from most conservative to most
aggressive (left to right slider positions):
1. With the most conservative value, the DRS tolerates the imbalance and initiates VM migrations to
meet the affinity rules when enabling the maintenance mode for an ESXi host.
2. The DRS provides recommendations if a vSphere cluster has imbalanced workloads or resources
are not enough to satisfy VM demands.
3. The middle position of the slider is the default value for DRS. It allows you to have a balance
between a high number of VM migrations and resource distribution in the cluster. This value is the
optimal threshold value for VMware DRS configuration in most cases (see the screenshot above).
4. The DRS provides recommendations when workloads in the cluster are fairly imbalanced. You can
use this DRS threshold position for clusters with workload bursts.
5. The most aggressive mode initiates VM migration even if minimal load balancing benefits and
performance boost can be achieved. The most aggressive mode causes frequent VM migrations in
the DRS cluster. A DRS doesn’t tolerate even the smallest imbalance in this mode.
Predictive DRS. This feature is used only with vRealize Operations Manager. A combination of
Distributed Resource Scheduler and vRealize Operations Manager (with the Dynamic Thresholds
feature) allows you to predict cluster resource demand in the future and migrate VMs before the cluster
becomes imbalanced. Predictive DRS initiates VM migration before having resource contention.
For example, there are spikes in the consumption of CPU and memory resources for a particular VM
every work day from 9 AM to 10 AM and from 6 PM to 7 PM. Knowing this information after collecting
resource consumption data during VM monitoring, vSphere can predict when these spikes will occur the
next time with a high probability. VMware monitoring can significantly help you use your infrastructure
rationally.
Virtual Machine Automation. You can configure custom DRS automation settings for individual VMs and
override global cluster automation settings. A VM can have automation settings that are different from the
DRS cluster automation settings. Select the checkbox to enable this option.
To configure more VM override options, go to the Configure tab for a cluster and in the Configuration
section of the middle pane, hit VM Overrides (Cluster > Manage > Settings > VM Overrides).
5/16
Additional options
Additional options for DRS settings allow you to configure VM distribution, CPU over-commitment, and
scalable shares. There are useful VMware DRS settings in this tab.
VM Distribution. When enabling this option, VMs are distributed across ESXi hosts in the cluster. The
even distribution can improve availability and sometimes reduce the number of VM migrations. You can
use DRS VM Distribution together with Migration Threshold if you are not sure which Migration Threshold
value to choose when you need a lower number of VM migrations.
CPU Over-Commitment. This feature lets you set the number of virtual processors or processor cores
that you can provision based on the existing number of physical processor cores. An ESXi server has
physical processors with the corresponding number of cores.
One virtual processor or processor core of a VM consumes one physical processor core of a physical
server. In this case, the ratio is 1:1, and the CPU performance of VMs is optimal. Hyper-threading is not
considered.
However, sometimes you provision more CPU cores for virtual processors than the number of physical
processor cores on the ESXi server. In this case, the performance of virtual machines that are sensitive to
CPU resources can degrade. For example, if an ESXi server has 48 processor cores (48 pCPU), and you
create 48 VMs each with 2 single-core virtual processors (96 vCPU), you get CPU over-commitment with
a 2:1 ratio.
The CPU Over-Commitment setting allows you to set the maximum over-commitment ratio to tolerate for
VMs on the ESXi host. If the over-commitment ratio exceeds the set value, VM migration is initiated to
other ESXi hosts by DRS. Also, when setting CPU Over-Commitment in VMware DRS configuration, you
ensure that VM migrations to other hosts do not affect CPU performance.
You can follow these recommendations to configure CPU Over-Commitment in your DRS cluster
depending on workloads (vCPU:pCPU ratio):
As you can see, a tier 1 cluster should use the lowest ratio and a tier 4 one the highest ratio.
MaxVcpusPerClusterPct – controls the vCPU:pCPU ratio at the cluster level. The total number of all
VM vCPUs in the cluster is divided by the sum of pCPUs on all ESXi hosts.
MaxVCPUsPerCore – controls the vCPU:pCPU ratio on each ESXi host individually. This value
cannot be exceeded on any ESXi host within a cluster.
6/16
Scalable Shares. This feature improves sharing of computing resources when using multiple resource
pools of different priorities. When enabling Scalable Shares, you can ensure that CPU resources (CPU
shares) are provisioned for high-priority VMs first and only then provisioned for lower-priority VMs.
Scalable shares solve the old issue of when VMs of low and high priorities get the same amount of CPU
resources because their entitlement to get resources is identical.
Power management
Power management options in VMware DRS settings are used to configure Distributed Power
Management (see the screenshot below). Distributed Power Management (DPM) is a feature that initiates
VM migration from a slightly loaded ESXi host to other hosts within the cluster to free up the unloaded
ESXi host and then shut down this host. The idea is to power off the ESXi hosts when there is enough
capacity on other hosts within the cluster to save power.
Note: Shutting down an ESXi host means putting the ESXi host in standby mode. Then the host can be
started with the Wake On LAN feature.
DPM is disabled by default in VMware DRS configuration, and this is the recommended value for optimal
VM performance. Select the Enable checkbox if you need to use the DPM and save power. You need to
enable Wake On LAN in the BIOS of ESXi hosts that are members of the cluster. Network adapters must
support Wake On LAN.
Automation level is the parameter of VMware DRS settings that defines the automation level of DRS.
There are three values for this setting:
7/16
DPM Threshold is a parameter that defines the priority for ESXi shutdown recommendations.
The conservative value (the left position) is used for less frequent VM migrations with vMotion. DPM
initiates VM migrations and server shutdown if there are priority 1 recommendations.
The aggressive value is used to power off ESXi hosts if resource consumption on the hosts is lower than
the target. This causes more frequent VM migration with vMotion. DPM initiates VM migration and server
shutdown even for priority 5 recommendations.
There are 5 values for the DPM threshold. You can set intermediate values between the conservative and
aggressive ones. The median value (3) is set by default.
Note that DPM threshold and DRS threshold are different and independent parameters for VMware DRS
configuration.
Advanced options
Advanced options is the tab that allows you to fine-tune the DRS cluster with advanced configuration of
vSphere DRS settings. Enter a VMware DRS configuration parameter and supported value manually in
the line.
Example 1:
8/16
You can set a period of demand history to evaluate. The collected information is used to evaluate power-
off cost and benefits to make recommendations for DPM. To do this, add the parameter and a value:
HostsMinUptimeSecs 0-max
Example 2:
To set the maximum vCPU:pCPU ratio on the cluster level, enter the following parameter and value:
MaxVcpusPerClusterPct 100
Where 100 is 100% and means a 1:1 ratio, 300% is 3:1, 500% is 5:1, etc.
Any value between 0 and 99 sets the under-committment, where less than 1 vCPU can be used per 1
pCPU.
Affinity rules
Affinity rules define which VMs must be located (or not located) on the same ESXi host or run together (or
separately) on any ESXi host. Affinity rules are also called VM/Host rules. Let’s take a look at how to
configure affinity rules in VMware DRS settings of vSphere Client.
9/16
4. The Create VM/Host Rule window opens.
5. Enter a rule name, for example, Affinity Rule 01.
6. Select the rule type in the drop-down list. There are four affinity rule types:
Keep Virtual Machines Together
Separate Virtual Machines
Virtual Machines to Hosts
Virtual Machines to Virtual Machines
We select the first option (Keep Virtual Machines Together) in this example.
7. Click + Add to select the virtual machines that must run together on any ESXi host of the cluster.
10/16
8. Select the needed VMs in the Add Virtual Machine window.
9. Click OK when done.
11/16
To configure other affinity rule types in VMware DRS settings, you must create VM groups or host groups
first. Let’s find out how to create VM/Host groups in VMware vSphere Client.
12/16
4. The Create VM/Host Group window opens.
5. Enter a group name.
6. Select a type (VM group or host group) in the drop-down list.
7. Click + Add to add the objects (VMs or ESXi hosts) to the group.
8. Click OK to save settings and close the window.
Once you have created VM groups and host groups, you can create the corresponding VM/Host rules
with rule types that require using groups.
13/16
vSphere DRS score
After you enable DRS on the cluster and configure DRS settings, you can check the cluster DRS score:
VMware vSphere DRS measures the happiness for each VM and displays the DRS score for a cluster.
This score is updated every minute. If resources are not sufficient for VMs, the score is less than 100%,
and DRS generates migration recommendations to migrate VMs (manually or automatically, depending
on VMware DRS configuration). The DRS score range of 80-100% is optimal for a cluster.
14/16
The DRS logic and principle of calculating the DRS score were significantly improved in VMware vSphere
7. In addition to VM happiness and ESXi host loads, DRS estimates VM migration costs. DRS estimates
multiple characteristics before making a recommendation for VM migration. The DRS estimates the
workload types, including CPU, memory, and network load.
For example, if there are many memory pages that are continuously updated (dirty memory pages), then
VM migration costs are high because live vMotion takes a long time in this case. The intelligent DRS
algorithms calculate that the potential benefits of such a VM migration are not significant and this affects
the provision of VM migration recommendations.
You can use vSphere DRS and HA together in the same cluster to ensure load balancing and high
availability for your VMs.
Conclusion
VMware DRS technology allows efficient and effective resource management of multiple hosts in the
vSphere cluster. As shown, the feature is easily enabled and is very intuitive to configure. The new
features of vSphere 7.0 also allow the technology to be predictive and proactive. VMware DRS settings
provide an ability to use flexible settings for a cluster.
NAKIVO Backup & Replication is the universal data protection solution that supports VMware backup and
allows you to automate the backup process even for VMs in the cluster and fault-tolerant VMs. Download
Free Edition of the NAKIVO solution and back up VMs in your cluster.
15/16
1 Year of Free Data Protection: NAKIVO Backup & Replication
Deploy in 2 minutes and protect virtual, cloud, physical and SaaS data. Backup, replication, instant
recovery options.
16/16