VxRail Upgrade Guide
VxRail Upgrade Guide
Table of Contents
1. Introduction .............................................................................................................. 5
2. Upgrade Stages........................................................................................................ 6
2.1 LCM Introduction............................................................................................... 6
2.2 Uploading and Unpacking Composite Bundle ................................................... 6
2.2.1 Upload and Unpack common Issues .......................................................... 6
2.3 Upgrade VxRail Manager .................................................................................. 9
2.3.1 Upgrade VxRail Manager Common issues ................................................. 9
2.4 Upgrade Platform Service Controller and vCenter .......................................... 10
2.4.1 Platform Service Controller and vCenter common issues ......................... 10
2.5 Upgrade ESXi ................................................................................................. 13
2.5.1 Service Module (iSM) ............................................................................... 13
2.5.2 DellPTagent .............................................................................................. 14
2.5.3 Platform Service ....................................................................................... 14
2.5.4 Firmware ................................................................................................... 15
2.5.5 ESXi and vibs ........................................................................................... 15
2.5.6 ESXi and vibs Common issues, ................................................................ 15
3. Upgrade pre-checks .............................................................................................. 17
3.1 Upgrade Paths ................................................................................................ 17
3.2 System compatibilities .................................................................................... 21
3.2.1 NSX Compatibilities .................................................................................. 21
3.2.2 VDP .......................................................................................................... 24
3.2.3 RP4VM (Recover Point for VM) ................................................................ 25
3.3 Unsupported CPUs ......................................................................................... 26
3.4 Resources utilization ....................................................................................... 28
3.4.1 Compute and space utilization .................................................................. 28
3.5 Health checks ................................................................................................. 30
3.6 Password verification checks .......................................................................... 33
3.6.1 vCenter / VCSA VM .................................................................................. 33
3.6.2 PSC VM .................................................................................................... 33
3.6.3 VxRail Manager ........................................................................................ 34
3.6.4 Host / Node ............................................................................................... 34
3.7 DNS Check ..................................................................................................... 34
3.8 Temporary IP check ........................................................................................ 35
3.9 Upgrade bundle Checksum ............................................................................ 35
3.10 BOSS firmware check ..................................................................................... 36
3.11 Scripts ............................................................................................................. 36
1. Introduction
Dell EMC- VxRail Upgrade Troubleshooting Guide is a document to assist in
troubleshooting the failures occurs while upgrading the VxRail cluster from a code
version to another. In this document, resolutions and workarounds for the upgrade
failures are proposed based on the official Dell EMC knowledge base articles and
documents, in addition to other resolutions that are used to resolve the similar issues
when happened in the VxRail systems previously.
Upgrade Guide v1.0 5 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
2. Upgrade Stages
VxRail LCM is the life cycle management module that provides the upgrade of VxRail system
components. It exposes a set of CLI commands as Python script for users to directly launch LCM
operations. It is delivered as a ZIP bundle.
In this stage VxRail LCM script checks for free space in VxRail Manager before upload starts, java code
LocalDiskChecker runs to check if there is a free space available . The VxRail Upload Bundle is uploaded
at /data/store2/lcm , VxRail Composite upload Bundle is also uploaded in the vSAN Datastore at
upgradeBundles , if there is not enough free space in the vSAN Datastore , lcm will through an IO Failure
, in addition , communication or permission issues will cause the same IO failure in VxRail GUI
Upgrade Guide v1.0 6 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
Upgrade Guide v1.0 7 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
Upgrade Guide v1.0 8 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
https://support.emc.com/kb/517613
Upgrade Guide v1.0 9 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
Phase 1: The process will deploy a new PSC and VCSA and assign it with a temp IP address
that it is not used in the network click here for Temp IP prechecks
Phase 2 Once deployment completes it will migrate the data from the source system, then shut
down the old appliances, the IP address of the old VM will be configured on the newly deployed
appliance.
Notice: The above process will be observed during the upgrade if the target version of PSC and
VC are different generation.
Example: Upgrading from 6.0 to 6.5
Upgrade Guide v1.0 10 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
Upgrade Guide v1.0 11 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
https://support.emc.com/kb/525023
VxRail: Failed to pre-check virtual machine VMware vCenter Server Platform Services
Controller 6.5 meets upgrade requirements
Article Number 000520723
https://support.emc.com/kb/520723
Upgrade Guide v1.0 12 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
There are three main components that contribute in upgrading ESXi , any communication issues
on those components will cause an upgrade failure.
The iDRAC Service Module (iSM) is a small OS-resident process that expands iDRAC
management into supported host operating systems. Specifically, iSM adds the following
services:
OS information
Lifecycle Controller Log replication into operating system
Automatic system recovery
Windows Management Instrumentation (WMI) providers inclusive of storage data
SupportAssist collection (formerly known as Automatic Technical Support Report Collection)
NVMe PCIe SSD device removal support
iSM is the primary connection between iDRAC and ESXi , failure in communication will affect
the functionality of Dell PTagent
Upgrade Guide v1.0 13 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
2.5.2 DellPTagent
Upgrade Guide v1.0 14 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
2.5.4 Firmware
Upgrading Firmware hardware component will require the host to reboot, it is always a best
practice to monitor firmware upgrade process by logging to the iDRAC virtual console to verify
that the firmware upgrade is going smooth.
Upgrade Guide v1.0 15 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
Upgrade Guide v1.0 16 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
3. Upgrade pre-checks
Upgrade Guide v1.0 17 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
Upgrade Guide v1.0 18 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
Upgrade Guide v1.0 19 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
Upgrade Guide v1.0 20 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
Does the customer have NSX (Network & Security) on the system?
Before you upgrade VxRail to 4.7 please make sure customer has
upgraded the NSX to the compatible version in case it causes
downtime.
Upgrade Guide v1.0 21 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
If the customer is having NSX-T and he wants to upgrade the VxRail cluster, then the
compatibility check is needed.
To check if the customer is having NSX-T, you have to run debug_vxrm-vc.sh script on VxRail
Manager CLI.
Upgrade Guide v1.0 22 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
If the customer is having NSX-T please check the compatibility of vCenter and NSX. Click
here to check the compatibility.
Upgrade Guide v1.0 23 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
3.3.2 VDP
On April 5th, 2017, VMware announced the End of Availability (EOA) of the VMware vSphere
Data Protection (VDP) product. VMware vSphere 6.5 is the last release to include vSphere Data
Protection and future vSphere releases will no longer include this product. Therefore
vSphere/vCenter 6.7 do not include support for VDP .
Please use the debug_vxrm-vc.sh script to determine if the customer is using VDP and wants
to upgrade to VxRail 4.7.x code.
Upgrade Guide v1.0 24 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
Refer to KB https://kb.vmware.com/s/article/2149614
Support Matrix:
https://www.vmware.com/resources/compatibility/sim/interop_matrix.php#interop&68=&2=
Dell EMC KB: https://support.emc.com/kb/536118
Customers should contact their local team and/or Sales representative for the latest information.
Upgrade Guide v1.0 25 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
4. If RP4VM is present and customer wants 4.7.X – RP4VM's cluster(s) must be upgraded
to 5.2 Patch 4 first [5.2.0.4] [Customer Upgradeable not support]
Comparing the processors supported by vSphere 6.5, vSphere 6.7 no longer supports the
following processors:
• AMD Opteron 13xx Series
• AMD Opteron 23xx Series
• AMD Opteron 24xx Series
• AMD Opteron 41xx Series
• AMD Opteron 61xx Series
• AMD Opteron 83xx Series
• AMD Opteron 84xx Series
• Intel Core i7-620LE Processor
Upgrade Guide v1.0 26 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
If you are not able to access a vCenter UI and need to run the check of CPU type via CLI please
use the following command on each host or on all hosts via vxtool
vim-cmd hostsvc/hostsummary | grep cpuModel
Upgrade Guide v1.0 27 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
This shows how much resources will be available when a host is rebooted during the upgrade.
Recommendations:
• Space usage in the time before upgrade with no failures should be 75% or less.
• Space usage after failure/re-protection should be 90% or less.
Exceeding these values increases the chances of a resync occurring during the upgrade and
will cause extended delays while waiting for the resyncs to complete.
Customers can try and free up space by deleting snapshots or moving VM's off the VxRail
before the upgrade otherwise a capacity upgrade [add more disks] would be required.
The utilization checks should be performed for both compute and space resources and this is
illustrated with the steps in the below sections.
Command> rvc
password:
0 /
1 localhost/
> ls
0 /
Upgrade Guide v1.0 28 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
1 localhost/
> cd 1
/localhost> ls
0 MARVIN-Datacenter (datacenter)
/localhost> cd 0
/localhost/MARVIN-Datacenter> ls
0 storage/
1 computers [host]/
2 networks [network]/
3 datastores [datastore]/
4 vms [vm]/
/localhost/MARVIN-Datacenter> cd 1
/localhost/MARVIN-Datacenter/computers> ls
0 Production-Cluster
1 MARVIN-Virtual-SAN-Cluster-0df82f82-4860-4728-9aa0-6257f52a86d5 (cluster):
cpu 82 GHz, memory 307 GB
/localhost/MARVIN-Datacenter/computers> vsan.whatif_host_failures 1 -s
Note: Please use the number shown at the start of the line "*-Virtual-SAN-Cluster-UUID"
in the command.
In this case the value is a '1' but can vary between each VxRail setup and whether
customer has renamed the cluster.
+----------------------+---------+--------------+------+----------+-----------------+--------------+
| | | HDD Capacity | | | Components | SSD Capacity |
| Host | NumHDDs | Total | Used | Reserved | Used | Reserved |
+----------------------+---------+--------------+------+----------+-----------------+--------------+
| g3node01.vspex.local | 2 | 3576.98 GB | 20 % | 15 % | 73/9000 (1 %) | 0 % |
| g3node03.vspex.local | 2 | 3576.98 GB | 20 % | 14 % | 73/9000 (1 %) | 0 % |
| g3node02.vspex.local | 2 | 3576.98 GB | 26 % | 23 % | 73/9000 (1 %) | 0 % |
Upgrade Guide v1.0 29 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
+----------------------+---------+--------------+------+----------+-----------------+--------------+
+-----------------+-----------------------------+-----------------------------------+
| Resource | Usage right now | Usage after failure/re-protection |
+-----------------+-----------------------------+-----------------------------------+
| HDD capacity | 22% used (8370.69 GB free) | 33% used (4793.71 GB free) |
| Components | 1% used (26781 available) | 1% used (17781 available) |
| RC reservations | 0% used (0.00 GB free) | 0% used (0.00 GB free) |
+-----------------+-----------------------------+-----------------------------------+
In order to ensure successful upgrade process, the available compute resources in the cluster
should tolerate placing at least one host in maintenance mode.
As recommended by VMware, there should be 30% or more free resources available for cluster
management, maintenance activities and High Availability fail-over.
The below steps can help in examining the cluster if it will allow the ESXi host to be placed in
maintenance mode or not.
1- Change the ClomDelay value of one host to 360 instead of 60 to avoid triggering a
resync.
3- If the host is placed successfully in maintenance mode, this means that there are
sufficient compute resources to place hosts in maintenance mode.
5- In case of failures while placing the host in maintenance mode because of insufficient
resources, a downtime is needed to power off some VMs to proceed with the upgrade.
The data health should be checked both from vCenter CLI and vSphere web client.
/localhost/MARVIN-Datacenter/computers> ls
Upgrade Guide v1.0 30 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
0 Production-Cluster
1 MARVIN-Virtual-SAN-Cluster-0df82f82-4860-4728-9aa0-6257f52a86d5 (cluster):
cpu 82 GHz, memory 307 GB
/localhost/MARVIN-Datacenter/computers> vsan.health.health_summary 1
Upgrade Guide v1.0 31 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
One more hralth check on vCenter CLI is the oject health check, to check the validation of
objects to proceed with the upgrade.
1- Open “rvc”
2- Follow the below steps
/localhost/MARVIN-Datacenter/computers> ls
0 Production-Cluster
Upgrade Guide v1.0 32 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
Please perform a password check to verify all passwords are working and are not expired
Use the following guide to test all the passwords
Take the IP / FQDN from the URL on the browser page for vSphere Web Client
Please verify that the administrator Single-Sign-On account is logging in to the web UI
successfully as well as the root account from the CLI.
If the root account is not logging in successfully, click here for the article to reset it.
If SSH service is not enabled, click here for the procedure to enable it
3.7.2 PSC VM
Select the PSC from the list of VM’s and navigate to summary, copy the IP from the details pane.
User Account required:
• Root
SSH to the node and try to login using the root account of PSC. If the root account is not logging
in successfully, click here for the article to reset it.
If SSH service is not enabled, click here for the procedure to enable it
Upgrade Guide v1.0 33 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
Take the IP / FQDN from the URL of the VXRM web page or retrieve using same instructions as for PSC
VM.
User Account required:
• mystic AND root
SSH to the VxRail Manager VM and try to login using the mystic account of VxRail Manager.
The default mystic password is VxRailManager@201602!
If the mystic account is locked due to multiple incorrect login attempts, click here for the
unlock steps
After logging in to the CLI with the mystic account, run the below command to test the root
password login.
su root
If the root account is not logging in successfully, click here for the article to reset it.
VxRail Manager v4.5 and higher does not support internal DNS. Please prepare for and supply
your external DNS prior to upgrading to v4.5.x.
• Run Vxtool to get the DNS IP address, compare this to the vCenter Server IP. If they
match then the customer is using the vCenter DNS Server.
• Please inform customer to supply the external DNS before upgrading to v4.5.x code.
Upgrade Guide v1.0 34 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
A temporary IP Address on the same subnet is needed to upgrade if Customer is using internal
vCenter server.
Try to ping the temporary IP provided by the customer to confirm that is not currently used and
please confirm with customer that this IP was not assigned to any decommissioned component.
Checksum is used to verify that the upgrade bundle is not corrupted or wrongly downloaded.
Click here to access the knowledge base article for the procedure to check the bundle
validation.
Upgrade Guide v1.0 35 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential
Dell EMC- VxRail Upgrade Troubleshooting Guide
Use the vxtool [Option 3] and run the following command on all nodes to get the current
firmware level.
This should return the BOSS Firmware level for any 14G nodes. If firmware is below
2.5.13.3011 then it needs to be upgraded BEFORE the VxRail upgrade is completed.
3.12 Scripts
• Checker.py
• debug_vxrm-vc.sh
• fixed_local_cert.sh
• preupgrade_check.sh
• supportool-0.3.2.py
• vsan_issue_check.sh
• VSANDTA525859.py
• vxtool_v3
Upgrade Guide v1.0 36 EMC Confidential Information- for internal use only
Dell Internal Communication - Confidential