OneFS UpgradePlanning8.00
OneFS UpgradePlanning8.00
June 2020
Rev. 14
Notes, cautions, and warnings
NOTE: A NOTE indicates important information that helps you make better use of your product.
CAUTION: A CAUTION indicates either potential damage to hardware or loss of data and tells you how to avoid the
problem.
WARNING: A WARNING indicates a potential for property damage, personal injury, or death.
© 2013 - 2020 Dell Inc. or its subsidiaries. All rights reserved. Dell, EMC, and other trademarks are trademarks of Dell Inc. or its
subsidiaries. Other trademarks may be trademarks of their respective owners.
Contents
2 Planning an upgrade..................................................................................................................... 6
Checklist - Planning an Upgrade..........................................................................................................................................6
Upgrade planning overview .................................................................................................................................................6
Prerequisites for completing an upgrade............................................................................................................................6
Review required documentation.......................................................................................................................................... 7
Types of OneFS upgrades.................................................................................................................................................... 7
Parallel upgrades.............................................................................................................................................................. 7
Rolling upgrades............................................................................................................................................................... 8
Simultaneous upgrades................................................................................................................................................... 8
Verify the upgrade path........................................................................................................................................................8
Upgrade Paths from OneFS 8.x to OneFS 8.x and later............................................................................................ 8
Check supportability and compatibility requirements....................................................................................................... 9
Assess upgrade impact........................................................................................................................................................10
Consider upgrade limitations.........................................................................................................................................10
Review feature changes and known issues................................................................................................................ 10
Plan an upgrade schedule................................................................................................................................................... 10
Contents 3
SyncIQ backup............................................................................................................................................................... 20
NDMP backup................................................................................................................................................................20
Back up custom settings............................................................................................................................................... 21
Complete or stop jobs in progress.....................................................................................................................................22
Complete system jobs...................................................................................................................................................22
Update drive firmware........................................................................................................................................................ 22
Configure IPMI ports...........................................................................................................................................................22
Secure Remote Services (SRS)........................................................................................................................................ 22
4 Contents
1
Introduction to this guide
Topics:
• About this guide
• Provide feedback about this document
• Where to go for support
PowerScale OneFS You can find product documentation, troubleshooting guides, videos, and other resources about
Documentation Info Hubs PowerScale OneFS products and features on the PowerScale OneFS Info Hubs page.
6 Planning an upgrade
• OneFS web administration interface
NOTE: If you are not logged into the OneFS cluster with root privileges, you might not be able to run all of the
commands in this guide.
Required documentation
• OneFS Upgrade Process Flowchart
Review this step-by-step reference guide for OneFS upgrades.
• OneFS Release Notes and Maintenance Release Notes
Read the OneFS release notes for information about new features and changes, resolved issues, known issues, and supported upgrade
paths.
• Current PowerScale Software Releases
Confirm which current OneFS releases have reached Target Code status.
• Current PowerScale OneFS Patches
Review patches that have been released for the version of OneFS to which you are upgrading.
• PowerScale Supportability and Compatibility Guide
Confirm that your PowerScale software and PowerScale hardware is compatible with the version of OneFS to which you are
upgrading.
• PowerScale OneFS Technical Specifications Guide
Confirm the recommended settings and thresholds for the version of OneFS to which you are upgrading.
• OneFS Technical and Security Advisories
Determine whether any PowerScale Technical Advisories or Security Advisories have been issued for the version of OneFS to which
you are upgrading.
Parallel upgrades
A parallel upgrade installs the new operating system on a subset of nodes and restarts that subset of nodes at the same time. Each subset
of nodes attempts to make a reservation for their turn to upgrade until all nodes are upgraded. Node subsets and reservations are based
on diskpool and node availability.
During a parallel upgrade, node subsets that are not being upgraded remain online and can continue serving clients. However, clients that
are connected to a restarting node are disconnected and reconnected. How the client connection behaves when a node is restarted
depends on several factors including client type, client configuration (mount type, timeout settings), IP allocation method, and how the
client connected to the cluster.
Parallel upgrades are recommended whenever possible, as they require a smaller maintenance window than rolling upgrades, and do not
require the interruption of service like simultaneous upgrades.
NOTE: The parallel upgrade feature is available in OneFS version 8.2.2 and newer. Only upgrades from OneFS 8.2.2 to
newer OneFS versions can take advantage of the parallel upgrade feature. Any upgrades where the starting cluster is on
OneFS 8.2.1 or older cannot take advantage of the parallel upgrade feature.
Planning an upgrade 7
Rolling upgrades
A rolling upgrade installs the new operating system and restarts each node individually in the OneFS cluster so that only one node is offline
at a time. A rolling upgrade takes longer to complete than a simultaneous upgrade. During a rolling upgrade, nodes that are not actively
being upgraded remain online and can continue serving clients. However, clients that are connected to a restarting node are disconnected
and reconnected. How the client connection behaves when a node is restarted depends on several factors including client type, client
configuration (mount type, timeout settings), IP allocation method, and how the client connected to the cluster.
You can specify the order in which nodes are upgraded by using the --node parameter of the isi upgrade cluster start
command. By default, nodes are upgraded in ascending order from the node with the lowest Array ID to the node with the highest Array
ID. For more information about Array IDs, see Understanding OneFS Group Changes on the Online Support site.
NOTE: See the Verify the upgrade path topic for information about which types of upgrades are supported between
OneFS versions.
Simultaneous upgrades
A simultaneous upgrade installs the new operating system and restarts all nodes in the OneFS cluster at the same time. Simultaneous
upgrades are faster than rolling upgrades but require a temporary interruption of service during the upgrade process. All client connections
to the cluster must be terminated prior to completing the upgrade and data is inaccessible until the installation of the new OneFS
operating system is complete and the cluster is back online.
NOTE: You cannot designate which nodes to upgrade when performing a simultaneous upgrade. You must upgrade all of
the nodes in the cluster when performing a simultaneous upgrade.
isi_for_array uname -r
Review the supported upgraded paths to confirm which version of OneFS you can upgrade your cluster to.
For additional information about upgrade paths compatible with your version of OneFS, please contact your account team.
8 Planning an upgrade
Table 1. OneFS Upgrade Paths(continued)
Upgrade 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 8. 9.
from 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 1. 1. 1. 1. 1. 1. 1. 1. 1. 2. 2. 2. 0.
current 0. 0. 0. 0. 0. 0. 0. 0. 1. 1. 1. 0. 0. 0. 0. 0. 1. 1. 2. 3. 0. 1. 2. 0.
version 0 1 2 3 4 5 6 7 0 1 2 0 1 2 3 4 0 1 0 0 0 0 0 0
8.0.0.0 O O O O O O O O O O O O O O O O O O O O O O
8.0.0.1 O O O O O O O O O O O O O O O O O O O O O
8.0.0.2 O O O O O O O O O O O O O O O O O O O
8.0.0.3 O O O O O O O O O O O O O O O O O O
8.0.0.4 O O O O O O O O O O O O O O O O O
8.0.0.5 O O O O O O O O O O O O O O
8.0.0.6 O O O O O O O O O O O
8.0.0.7 O O O O O
8.0.1.0 O O O O O O O O O O O O O O
8.0.1.1 O O O O O O O O O O O O O
8.0.1.2 O O O O O O O O O O
8.1.0.0 O O O O O O O O O O O O
8.1.0.1 O O O O O O O O O O O
8.1.0.2 O O O O O O O O O O
8.1.0.3 O O O O O O O
8.1.0.4 O O O O O O
8.1.1.0 O O O O O O O
8.1.1.1 O O O O O O
8.1.2.0 O O O O O
8.1.3.0 O O O
8.2.0.0 O O O
8.2.1.0 O O
8.2.2.0 =
9.0.0.0
Planning an upgrade 9
Assess upgrade impact
Consider all the areas of your environment that might be affected by upgrading to a new version and plan a strategy for performing the
tasks that must be completed and addressing issues that might occur. Assessing the impact of the upgrade on your environment, clients,
and performance helps ensure that the upgrade does not disrupt your cluster, workflow, or users.
NOTE: If you have a performance-sensitive workload, PowerScale recommends that you consult your Sales Engineer for
assistance during the pre-upgrade planning process.
Data back-up and Estimate the time that it takes to back up your data, considering cluster size, number of files, types of files, and
information file size. Also include time to collect information about the cluster such as status, logs, and settings.
collection
Cluster Schedule time for node and drive health checks and replacement of bad hardware. Include time to update
preparation and configurations and settings that are not supported in the new version.
readiness
Upgrade Estimate the time that it takes to run the upgrade considering cluster size and upgrade type (parallel, rolling, or
maintenance simultaneous). Schedule time to inform users when the upgrade will take place and that client connections might
window be slow, file access might be affected, and clients might be disconnected. A best practice is to upgrade the
cluster during an off-hours maintenance window.
If performing a parallel or rolling upgrade, consider whether you will configure client connection drain times, which
will extend the required maintenance window, but lower the impact on client connections.
Build in time to let the upgrade jobs run to completion and to reestablish permissions and connections. Schedule
time or extend the maintenance window to accommodate post-upgrade tasks such as reconfiguring custom
settings, updating scripts to reflect command and functionality changes in the upgrade version, and potential
troubleshooting.
(Optional) If available, upgrading a test cluster with the same current version of OneFS before you upgrade your production
Upgrade a test cluster can expose issues that could slow down or prevent the upgrade of your production system.
cluster
10 Planning an upgrade
After you upgrade a test cluster, verify that the cluster is operational and validate key workflows on the test
cluster by simulating how administrators, users, and applications interact with the system.
Planning an upgrade 11
3
Completing pre-upgrade tasks
Topics:
• Checklist - Pre-upgrade
• About pre-upgrade tasks
• Collect cluster information
• Check cluster readiness
• Verify configurations and settings
• Download the OneFS installation file
• Upgrade compatibility check utility
• On-Cluster Analysis tool
• HealthCheck tool
• Backup data
• Complete or stop jobs in progress
• Update drive firmware
• Configure IPMI ports
• Secure Remote Services (SRS)
Checklist - Pre-upgrade
Use this checklist to help you track your progress as you perform the pre-upgrade tasks.
Back up data
isi_gather_info
The files generated during the gathering process are stored on the cluster in the /ifs/data/Isilon_Support/pkg directory.
isi status -v
2. Run the following command to check for drives that do not report a status of HEALTHY, L3, or JOURNAL.
3. Run the following command to check the mirror status of the boot drives on each node.
NOTE: If a drive is degraded, do not continue with the upgrade until the issue is resolved.
4. If the cluster has an InfiniBand network, run the following command to confirm whether a node has been assigned the OpenSM
(subnet manager) master role.
Confirm that the output displays only one node in the cluster with the phrase master (opensm). The output should be similar to
the following:
NOTE:
If the command does not return output that includes master (opensm) for any node, then another device has
assumed the OpenSM master role. In that case, confirm that cables from another cluster are not connected to this
cluster's primary or secondary InfiniBand switch, and then run the command again. For more information, contact
PowerScale Technical Support
5. It is recommended that you enable the Virtual Hot Spare (VHS) feature. VHS ensures that the cluster has enough free space available
to smartfail a drive and reprotect the drive's data. See article 471814, How to enable and configure Virtual Hot Spare (VHS) for
instructions.
6. In the OneFS web administration interface, perform the following steps for each node in the cluster.
a. Click Dashboard > Cluster Overview > Cluster Status.
b. In the Status area, click the ID of a node.
c. In the Chassis and drive status area, click Platform .
d. If any correctable or fatal errors are reported, do not continue with the upgrade. If the errors cannot be resolved, contact
PowerScale Technical Support
NOTE: For more information about these commands and checking hardware health, see the following articles:
Node Each node cannot be more than 92 Make more available space.
percent capacity.
Node pool Each node pool cannot be more than Make more available space.
90 percent capacity.
root partition (/) The root partition cannot be more If this directory is at or near the minimum available-space
than 97 percent capacity. requirement, see the following resources for steps to
address the issue:
• Knowledge base article 464118, Node reached 95%
used capacity on the root file system.
• Event ID 100010003, The /(root) partition is near
capacity, is the OneFS event reference.
/ifs The /ifs directory cannot be more If this directory is at or near the minimum available-space
than 90 percent capacity. requirement, see the following resources for steps to
address the issue:
• Knowledge base article 471816, "There is at least one
SmartPool at or over capacity " or "The SmartPool
'[name]' is near or over capacity".
• Event ID 100010004, The cluster's /ifs partition is
near capacity, is the OneFS event reference.
/var The /var partition cannot be more If this directory is at or near the minimum available-space
than 90 percent capacity. requirement, see the following resources for steps to
address the issue:
• Knowledge base article 471789, The /var partition is
near capacity (95% used).
• Event ID 100010001, The /var partition is near
capacity, is the OneFS event reference.
/var/crash The /var/crash directory cannot If this directory is at or near the minimum available-space
be more than 90 percent capacity. requirement, see the following resources for steps to
address the issue:
• Knowledge base article 458364, The crash partition
of a node in the cluster has reached 90% capacity
alert.
• Event ID 100010002, The /var/crash partition is near
capacity, is the OneFS event reference.
For more information, see the Best Practices Guide for Maintaining Enough Free Space on Clusters and Pools.
isi stat
• To confirm how much space is being used in each node pool, run the following command:
isi stat -p
• To confirm how much space is being used by critical directories on the cluster, run the following command:
The isi_for_array output is similar to the following for each node in the cluster:
virtual-demo-3: Filesystem Size Used Avail Capacity Mounted on
--------------------------------------------------------------------
virtual-demo-3: /dev/imdd0a 495M 418M 38M 92% /
virtual-demo-3: devfs 1.0K 1.0K 0B 100% /dev
virtual-demo-3: /dev/imdd1a 495M 6.7M 449M 1% /var
virtual-demo-3: /dev/imdd2a 496M 5.9M 451M 1% /var/crash
virtual-demo-3: OneFS 40G 283M 20G 1% /ifs
• If the command returns any critical errors, check the log files in the following directories for more information:
○ /var/log
○ /var/log/messages
○ /var/crash
• NOTE: If any log files contain messages about a dynamic sector recovery (DSR) failure or a Data Integrity (IDI)
failure, contact PowerScale before you upgrade.
2. Cancel non-critical events before upgrading to prevent a recurrence of notifications that you know to be harmless.
See the following content for more information:
• Article 317661, How to quiet or cancel an event (alert)
• Article 304312, How to reset the CELOG database and clear all historical alerts
• Article 471816, There is at least one smartpool at or over capacity
• Article 454806, Detected IDI failure, attempting DSR
• Article 373706, DSR - Dynamic Sector Recovery Failure
• Article 373712, IDI error. Shallow verification failure in block
• Article 447864, Detected IDI network checksum error on path
• Article 454399, Cluster needs to be restriped but FlexProtect is not running
• OneFS Event Reference, information about specific events for your current version of OneFS
</files>
</user_preserve>
For more information, see article 304460, How to configure a cluster to use Kerberos with NFS in a non-Active Directory environment.
NOTE: See the InsightIQ 4.1.3 guides for additional information about monitoring OneFS 8.2.0 clusters.
NOTE: If you are upgrading from OneFS 8.1.x or older to OneFS 8.2.x or newer, a maximum of 99 FSA reports will be
retained after the upgrade. For more information, see KB article 540000.
b. Optional: To validate the integrity of the downloaded installation file, compare the MD5 or SHA-256 checksum value that you
recorded from the downloads page on the Online Support site to the checksum value returned from the MD5 or SHA-256
command. If the values do not match, re-download the installation file.
1. Open a secure shell (SSH) connection to any node in the cluster and log in to the cluster with the root account.
2. Start the upgrade compatibility check utility by running the following command, where <install-image-path> is the file path of the
upgrade installation file.
isi upgrade cluster assess <install-image-path>
NOTE: The upgrade compatibility check utility might take several minutes to run. If the utility returns errors, resolve
the errors before continuing with the upgrade. Warnings are informational and do not prevent an upgrade.
Free space Checks cluster free space and Do not continue with the upgrade if the utility reports a capacity-related
returns a warning or an error if error. See the Check the available free space section of the Upgrade
capacity thresholds are exceeded for Planning and Process Guide for more information.
the following partitions and node
pools:
/ Returns a
warning if
the root
partition
exceeds 97
percent of
its capacity.
Returns an
error if the
partition
reaches 100
percent
capacity.
/var Returns a
warning if
the /var
partition is
more than
90 percent
full. Returns
an error if
the partition
reaches 100
percent full.
/ifs Returns a
warning if
the /ifs
partition
exceeds 95
percent of
its capacity.
Returns an
error if the
partition
reaches 100
percent
capacity.
Node pools Returns a
warning if
one or more
node pools
exceed 90
percent of
its capacity.
Returns an
error if all the
node pools
reach 90
percent
capacity.
Drive stalls Checks the health of the drives in If the utility reports a stalled drive, do not continue with the upgrade until you
the cluster and returns a warning if smartfail and replace the stalled drive or resolve the stall. See article article
the cluster contains stalled drives. 466391, Introduction to drive stalls, for information about resolving drive
stalls.
Smartfail Determines whether a smartfail If a smartfail operation is running, the utility returns an error. Wait for the
operation status operation is running on any drives or smartfail operation to complete before continuing with the upgrade.
nodes in the cluster.
IntegrityScan job Determines whether the If the integrity scan job is running, wait for the job to complete before
status integrityScan job is running. continuing with the upgrade.
Unresolved Checks for unresolved, critical If the utility reports that there are unresolved critical events, do not continue
critical events events. with the upgrade until you resolve the issues. See the OneFS Event
Reference for more information.
SMB access Checks if one or more SMB shares If the utility reports an error, the upgrade will succeed. However, you cannot
zone association are associated with multiple access create access zones until the path overlap is fixed.
zones where the share paths overlap.
Kerberos keytab Checks whether Kerberos settings If the utility reports an error, you must configure Kerberos settings before
have been configured to preserve upgrading OneFS. See the Preserve the Kerberos keytab file section for
the keytab file. more information. The keytab file is used to migrate Kerberos settings into
the OneFS web administration interface and command-line interface.
HDFS Kerberos Checks whether HDFS Kerberos If the utility reports an error, you must configure Kerberos settings before
keytab settings have been configured to upgrading OneFS.See the Preserve the Kerberos keytab file section for more
preserve the keytab file. information. The keytab file is used to migrate Kerberos settings into the
OneFS web administration interface and command-line interface.
HealthCheck tool
The HealthCheck tool enables you to evaluate the status of specific software and hardware components of your cluster and your cluster
environment. For more information about the HealthCheck tool, see this site: HealthCheck Info Hub
Backup data
It is recommended to backup your cluster data immediately before you upgrade. Schedule sufficient time for the back up to complete
before the upgrade window.
SyncIQ backup
SyncIQ is one option you can use to backup your OneFS cluster. SyncIQ creates and references snapshots to replicate a consistent point-
in-time image of a root directory.
NOTE: If you are upgrading your cluster from OneFS 8.1.0.x or earlier to OneFS 8.1.1.x, 8.1.2.x, or 8.1.3.x, and your
cluster is in Compliance mode, you must ensure that all SyncIQ partners are on the same code and patch level OneFS
before restarting SyncIQ backups, or the backups fail. This issue is resolved in OneFS 8.2.0 and later.
For more information about backing up your OneFS cluster, see the OneFS CLI Administration Guide or the OneFS Web Administration
Guide for your version of OneFS.
NDMP backup
Other OneFS cluster backup options include using the Network Data Management Protocol (NDMP).
From a backup server, you can perform both NDMP three-way backup and NDMP two-way backup processes between a cluster and
backup devices such as tape devices, media servers, and virtual tape libraries (VTLs).
See the OneFS Web Administration Guide or the OneFS CLI Administration Guide for information about backing up data using NDMP.
Passwords for After you upgrade, you might have to Make a list of the local accounts and their passwords before you upgrade.
local user reset the passwords of the local user
accounts accounts that you configured on the
cluster. Other users should be
prepared to reset the passwords of
their local accounts after the
upgrade.
sysctl If you changed the default value PowerScale does not recommend modifying sysctl parameters unless you
parameters assigned to one or more sysctl are instructed to do so by PowerScale Technical Support. If you must modify
parameters by editing either a sysctl parameter, configure the parameter in the /etc/mcp/
the /etc/mcp/override/ override/sysctl.conf file to ensure that the change is preserved
sysctl.conf file or the /etc/ when you upgrade a node or a cluster.
local/sysctl.conf file, you
Before you upgrade, document your custom sysctl parameters and back
might need to reset the parameter
up the /etc/mcp/override/sysctl.conf and /etc/local/
after you upgrade.
sysctl.conf files.
If you modified asysctl parameter
For more information, see article 462759, Configuring sysctls and making
by editing another file—for example,
sysctl changes persist through node and cluster reboots and upgrades.
the /etc/sysctl.conf file—the
change will not be preserved during
the upgrade.
Aspera You must reinstall and reconfigure Before you upgrade, back up the Aspera configuration files in the following
Aspera after you upgrade. directories:
NOTE: Aspera is no longer
• /ifs/.ifsvar/aspera/etc/
supported In OneFS 8.2.0 and
• /ifs/.ifsvar/aspera/www/
newer.
• /usr/local/aspera/var/aspera-prepost
To determine which version of Aspera is compatible with the version of
OneFS to which you are upgrading, see the Supportability and Compatibility
Guide, and then visit the Aspera website to download the Aspera install files.
For more information, see article 493022, How to download Aspera for
OneFS.
Cron jobs Cron jobs settings that were not Document and back up custom cron job settings or configure them in
configured in the /etc/mcp/ the /etc/mcp/override/crontab.smbtime file before you upgrade.
override/crontab.smbtime
2. To cancel a job, run the following command where <job_id> is the ID of the job you want to cancel:
NOTE: Do not cancel the Upgrade, FlexProtect, FlexProtectLin, or IntegrityScan jobs. If any of these four system
jobs are running, you cannot continue with the upgrade. If an Upgrade, FlexProtect, FlexProtectLin, or IntegrityScan
system job takes longer than expected to complete, contact PowerScale Technical Support.
NOTE: Sync polices and jobs must be canceled or paused in order for the upgrade to complete successfully.
NOTE: Use of IPMI ports is supported in OneFS version 8.2.2 and later.
Checklist - Upgrade
Use this checklist to help you track your progress as you perform the upgrade tasks.
NOTE: You can upgrade OneFS using the command-line interface or the web administration interface.
NOTE: The isi upgrade cluster command runs asynchronously. The command does not run the entire upgrade
process; instead, it sets up the upgrade process, which nodes take turns controlling. For this reason, the command
returns quickly. To view the progress of the upgrade, use the isi upgrade view command or the web
administration interface.
After the upgrade, a number of upgrade-related jobs may continue to run on the cluster for some time. During this time, the cluster is
accessible, but you might experience a decrease in cluster performance. After the jobs complete, performance will return to normal. At this
stage, the upgrade is complete, but is not committed. You can still roll back to the previous version of OneFS. Some new features in the
upgrade might not be available until the upgrade is committed.
NOTE: The isi upgrade cluster command runs asynchronously. The command does not run the entire upgrade
process; instead, it sets up the upgrade process, which nodes take turns controlling. For this reason, the command
returns quickly. To view the progress of the upgrade, use the isi upgrade view command or the web
administration interface.
3. Optional: (Optional) You can specify the following rolling upgrade options:
Options Description
Nodes to Upgrade specific nodes with the --nodes <integer_range_list> option.
select for
Specify the nodes in their upgrade order as a comma-separated list (for example, --nodes 7,3,2,5) or as a dash-
upgrade
separated range (for example, --nodes 1-7) of logical node numbers (LNNs).
NOTE: We recommend that you upgrade all the nodes. If you upgrade some nodes, a weekly alert is
sent to confirm that the upgrade is making progress. Do not leave the cluster in a partially upgraded
state for a prolonged period. Some new features in the upgrade might not be available until all the
nodes in the cluster have been upgraded and the upgrade is committed. Refer to the release notes for
the OneFS version that you are upgrading to for information about features that require all the nodes
to be upgraded.
The following example for OneFS 8.2.2 and later starts a rolling upgrade on nodes 7,3,2,5, in that order:
After the upgrade, a number of upgrade-related jobs may continue to run on the cluster for some time. During this time, the cluster is
accessible, but you might experience a decrease in cluster performance. After the jobs complete, performance will return to normal. At this
stage, the upgrade is complete, but is not committed. You can still roll back to the previous version of OneFS. Some new features in the
upgrade might not be available until the upgrade is committed.
SMB3 Client transitions from the restarted node to a new node without disruption.
NFSv2 and NFSv3 Client transitions from the restarting node to a new node without disruption.
NFSv4 Clients that are connected to the cluster using static IP addresses wait for NFS serviceability to resume on the
nodes that they are connected to.
Clients that are connected to the cluster using dynamic IP addresses use NFSv4 failover support. The clients
transition from the restarting node to a new node without disruption.
NOTE: For more information, see article 457328, Best practices for NFS client settings.
NOTE: The isi upgrade cluster command runs asynchronously. The command does not run the entire upgrade
process; instead, it sets up the upgrade process, which nodes take turns controlling. For this reason, the command
returns quickly. To view the progress of the upgrade, use the isi upgrade view command or the web
administration interface.
After the upgrade, a number of upgrade-related jobs may continue to run on the cluster for some time. During this time, the cluster is
accessible, but you might experience a decrease in cluster performance. After the jobs complete, performance will return to normal. At this
stage, the upgrade is complete, but is not committed. You can still roll back to the previous version of OneFS. Some new features in the
upgrade might not be available until the upgrade is committed.
Commit an upgrade
You can commit an upgrade through the web administration interface or the command-line interface. Once you commit the upgrade, you
cannot roll back to the previous version of OneFS.
Command-Line Interface instructions:
1. Open a secure shell (SSH) connection on the lowest-numbered node in the cluster and log in with the root account.
2. To commit the upgrade, run the following command:
NOTE: Once you commit the upgrade, you cannot roll back to the previous version of OneFS. After the upgrade is
committed, all the new features are available.
Web Administration Interface instructions:
1. Log in to any node in the cluster through the web administration interface with the root account.
2. Click Cluster Management > Upgrade.
3. Click the Commit Upgrade to OneFS <version> button.
The cluster might display several confirmation messages. Confirm each message to continue the commit process.
The rollback process must restart all the upgraded nodes simultaneously. This will temporarily disrupt cluster services and data availability.
After the OneFS image is updated on the final node and the node restarts, the web administration interface login page appears. If the login
page does not appear, clear the web browser cache and reload the page. A number of upgrade-related jobs may continue to run on the
NOTE: The rollback process must restart all the upgraded nodes simultaneously. This will temporarily disrupt cluster
services and data availability.
Web Administration Interface instructions:
1. Log in to any node in the cluster through the web administration interface with the root account.
2. Click Cluster Management > Upgrade.
3. Click the Roll Back to OneFS <version> button.
The cluster might display several confirmation messages. Confirm each message to continue the rollback process.
The cluster displays the rollback progress.
isi stat
2. Remove the installation files from the /ifs/data directory by running the following command where <installation_file_name> is the
name of the installation file:
rm /ifs/data/<installation_file_name>
isi_gather_info
isi_for_array -s uname -a
2. View the status of the cluster and ensure all the nodes are operational:
isi status
3. Check the devices in the nodes to validate the status of the drives:
4. Check the status of jobs and resume the jobs that you paused for the upgrade:
6. Ping all the cluster's internal and external interfaces to verify network connectivity and to help verify that SmartConnect works
correctly.
7. Verify the network interfaces:
10. Review the cluster's other log files to check for stray problems:
cat /var/log/messages
13. Check the status of the firmware to ensure that the firmware is consistent across nodes:
NOTE: Following an upgrade to OneFS 8.1.0.0 or later, you must update your OneFS license by generating a license
activation file and submitting it to Software Licensing Central (SLC). For instructions on how to create a new license
file for your OneFS cluster, see the Licensing section of the OneFS Administration Guide.
15. Check the status of the authentication providers to ensure that they remain active:
isi_phone_home --enable
NOTE: If you are using NDMP backups on your cluster, re-enable the NDMP service and test that it's working correctly.
Reinstall Aspera
If you were running Aspera for PowerScale before you upgraded your cluster, you must reinstall Aspera after you upgrade. Aspera is not
supported in OneFS 8.2.0 and later.
1. Run the following command to verify that the upgrade process enabled the Aspera Central service and the Aspera node daemon:
isi services
2. If the Aspera service is not running properly, you must download and re-install a compatible version of Aspera from the Aspera
Enterprise Server website and verify that the Aspera Central service is enabled.
• For information about downloading and installing Aspera for OneFS, see article 493022, How to download Aspera for OneFS.
• For information about which versions of Aspera are compatible with your version of OneFS, see the PowerScale Supportability and
Compatibility Guide.
Troubleshooting overview
If you experience problems with your upgrade, check the upgrade logs and review common upgrade issues.
You can search for OneFS documentation, troubleshooting guides, and knowledge base articles on the Online Support site.
You can go through the Customer Troubleshooting Guide: Upgrades From OneFS 8.x to a later version for troubleshooting information
that is related to upgrade failures and upgrade error messages.
If you need additional help with troubleshooting, contact PowerScale Technical Support.
Common issues
Some problems with cluster upgrades might be caused by the following common upgrade issues:
Dropped node
If a node does not successfully reboot after a OneFS upgrade, you will not be allowed to commit the upgrade. If the upgrade is not
committed, then you will not be allowed to make changes to certain features, such as SMB shares and NFS exports.
Follow these steps for any node that didn't successfully reboot:
1. Smartfail the node that did not come back after rebooting.
2. Check to ensure that the name of the smartfailed node is removed from the cluster node list.
This operation might take some time depending on the amount of data that is migrated off of the node.
3. Reboot any node in the cluster.
isi_sysctl_cluster vfs.nfsrv.rpc.threads_min=x
isi_sysctl_cluster vfs.nfsrv.rpc.threads_max=x