NetBackup102 Troubleshooting Guide
NetBackup102 Troubleshooting Guide
Troubleshooting Guide
Release 10.2
NetBackup™ Troubleshooting Guide
Last updated: 2023-03-22
Legal Notice
Copyright © 2023 Veritas Technologies LLC. All rights reserved.
Veritas, the Veritas Logo, Veritas Alta, and NetBackup are trademarks or registered trademarks
of Veritas Technologies LLC or its affiliates in the U.S. and other countries. Other names may
be trademarks of their respective owners.
This product may contain third-party software for which Veritas is required to provide attribution
to the third party (“Third-party Programs”). Some of the Third-party Programs are available
under open source or free software licenses. The License Agreement accompanying the
Software does not alter any rights or obligations you may have under those open source or
free software licenses. Refer to the Third-party Legal Notices document accompanying this
Veritas product or available at:
https://www.veritas.com/about/legal/license-agreements
The product described in this document is distributed under licenses restricting its use, copying,
distribution, and decompilation/reverse engineering. No part of this document may be
reproduced in any form by any means without prior written authorization of Veritas Technologies
LLC and its licensors, if any.
The Licensed Software and Documentation are deemed to be commercial computer software
as defined in FAR 12.212 and subject to restricted rights as defined in FAR Section 52.227-19
"Commercial Computer Software - Restricted Rights" and DFARS 227.7202, et seq.
"Commercial Computer Software and Commercial Computer Software Documentation," as
applicable, and any successor regulations, whether delivered by Veritas as on premises or
hosted services. Any use, modification, reproduction release, performance, display or disclosure
of the Licensed Software and Documentation by the U.S. Government shall be solely in
accordance with the terms of this Agreement.
http://www.veritas.com
Technical Support
Technical Support maintains support centers globally. All support services will be delivered
in accordance with your support agreement and the then-current enterprise technical support
policies. For information about our support offerings and how to contact Technical Support,
visit our website:
https://www.veritas.com/support
You can manage your Veritas account information at the following URL:
https://my.veritas.com
If you have questions regarding an existing support agreement, please email the support
agreement administration team for your region as follows:
Japan [email protected]
Documentation
Make sure that you have the current version of the documentation. Each document displays
the date of the last update on page 2. The latest documentation is available on the Veritas
website:
https://sort.veritas.com/documents
Documentation feedback
Your feedback is important to us. Suggest improvements or report errors or omissions to the
documentation. Include the document title, document version, chapter title, and section title
of the text on which you are reporting. Send feedback to:
You can also see documentation information or ask a question on the Veritas community site:
http://www.veritas.com/community/
https://sort.veritas.com/data/support/SORT_Data_Sheet.pdf
Contents
■ Troubleshooting a problem
Troubleshooting a problem
The following steps offer general guidelines to help you resolve any problems you
may encounter while you use NetBackup. The steps provide links to more specific
troubleshooting information.
Introduction 10
Troubleshooting a problem
Step 1 Remember the error message Error messages are usually the vehicle for telling you something went wrong.
If you don’t see an error message in an interface, but still suspect a problem,
check the reports and logs. NetBackup provides extensive reporting and
logging facilities. These can provide an error message that points you directly
to a solution.
The logs also show you what went right and the NetBackup operation that
was ongoing when the problem occurred. For example, a restore operation
needs media to be mounted, but the required media is currently in use for
another backup. Logs and reports are essential troubleshooting tools.
Step 2 Identify what you were doing Ask the following questions:
when the problem occurred
■ What operation was tried?
■ What method did you use?
For example, more than one way exists to install software on a client.
Also more than one possible interface exists to use for many operations.
Some operations can be performed with a script.
■ What type of server platform and operating system was involved?
■ If your site uses both the primary server and the media server, was it a
primary server or a media server?
■ If a client was involved, what type of client was it?
■ Have you performed the operation successfully in the past? If so, what
is different now?
■ What is the service pack level?
■ Do you use operating system software with the latest fixes supplied,
especially those required for use with NetBackup?
■ Is your device firmware at a level, or higher than the level, at which it has
been tested according to the posted device compatibility lists?
Introduction 11
Troubleshooting a problem
Record this information for each try. Compare the results of multiple tries. A
record of tries is also useful for others at your site and for Veritas Technical
Support in the event that you cannot solve the problem. You can get more
information about logs and reports.
Step 4 Correct the problem After you define the problem, use the following information to correct it:
■ Take the corrective action that the status code or message recommends.
See the Status Codes Reference Guide.
■ If no status code or message exists, or the actions for the status code
do not solve the problem, try these additional troubleshooting procedures:
See “Troubleshooting NetBackup problems” on page 19.
Step 5 Complete a problem report If your troubleshooting is unsuccessful, prepare to contact Veritas Technical
for Veritas Technical Support Support by filling out a problem report.
Step 6 Contact Veritas Technical The Veritas Technical Support website has a wealth of information that can
Support help you solve NetBackup problems.
https://www.veritas.com/support/en_US.html
Introduction 12
Problem report for Technical Support
Note: The term media server may not apply to the NetBackup server product. It
depends on the context. When you troubleshoot a server installation, be aware that
only one host exists: The primary and the media server are one and the same.
Ignore references to a media server on a different host.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
Did this problem occur during or shortly after any of the following:
_____ Initial installation
_____ Configuration change (explain)
_____ System change or problem (explain)
_____ Have you observed the problem before? (If so, what did you do that time?)
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
Logs or other failure data you have saved:
_____ All log entries report
_____ Media and Device Management debug logs
_____ NetBackup debug logs
_____ System logs (UNIX)
_____ Event Viewer Application and System logs (Windows)
Ways that you can communicate with us:
_____ MyVeritas.com - case management portal
_____ mft.veritas.com - File transfer portal for https uploads
_____ sftp.veritas.com - File transfer server for sftp transfers
For more information, see the following:
http://www.veritas.com/docs/000097935
_____ email
_____ WebEx
NetBackup-Java administration application on If NetBackup is installed on the computer where the application was
Windows started, the script logs the data in a log file at
install_path\NetBackup\logs\user_ops\nbjlogs.
If NetBackup was not installed on this computer, the script logs the
data in a log file at install_path\Veritas\Java\logs.
Note: When NetBackup is installed where the application is started,
and when install_path is not set in the setconf.bat file, the script
logs the data here: install_path\Veritas\Java\logs.
UNIX/Linux: Queries the host and gathers appropriate diagnostic information about
NetBackup and the operating system.
/usr/openv/netbackup/bin/support/nbsu
See “About the NetBackup support utility (nbsu)” on page 190.
Windows:
install_path\NetBackup\bin\support\
nbsu.exe
The following example describes how you can gather troubleshooting data for
Veritas Technical Support to analyze.
An application does not Wait for several minutes before you assume that the operation
respond. is hung. Some operations can take quite a while to complete,
especially operations in the Activity Monitor and Reports
applications.
Introduction 15
About gathering information for NetBackup-Java applications
Get data about your Run the nbsu command that is listed in this topic. Run this
configuration. command after you complete the NetBackup installation and
every time you change the NetBackup configuration.
Contact Veritas Technical Provide the log file and the output of the nbsu command for
Support analysis.
Chapter 2
Troubleshooting
procedures
This chapter includes the following topics:
■ Extra disk space required for logs and temporary files for the NetBackup
Administration Console
■ Issues with initiating the NetBackup CA migration because of large key size
■ Issues with NetBackup jobs that are enabled for data-in-transit encryption
General test and troubleshooting These procedures define general methods for finding
server and client problems and should be used last.
Other troubleshooting procedures See “Resolving full disk problems” on page 87.
A set of examples is also available that shows host name and service entries for
UNIX systems.
■ See “Example of host name and service entries on UNIX primary server and
client” on page 77.
■ See “Example of host name and service entries on UNIX primary server and
media server” on page 79.
■ See “Example of host name and service entries on UNIX PC clients” on page 81.
■ See “Example of host name and service entries on UNIX server that connects
to multiple networks” on page 82.
cover every problem that can occur. However, they do recommend the methods
that usually result in successful problem resolution.
When you perform these procedures, try each step in sequence. If you already
performed the action or it does not apply, skip to the next step. If you branch to
another topic, use the solutions that are suggested there. If you still have a problem,
go to the next step in the procedure. Also, alter your approach according to your
configuration and what you have already tried.
Step 1 Verify operating systems and Ensure that your servers and clients are running supported operating system
peripherals. versions and that any peripherals you use are supported.
http://www.veritas.com/docs/DOC5332
Step 2 Use reports to check for Use the All Log Entries report and check for NetBackup errors for the
errors. appropriate time period. This report can show the context in which the error
occurred. Often it provides specific information, which is useful when the
status code can result from a variety of problems.
If you find a status code or message in either of these reports, perform the
recommended corrective actions.
Step 3 Check the operating system Check the system log (UNIX) or the Event Viewer Application and System
logs. log (Windows) if the problem pertains to media or device management and
one of the following is true:
■ NetBackup does not provide a status code.
■ You cannot correct the problem by following the instructions in NetBackup
status codes and messages.
■ You cannot correct the problem by following the instructions in media
and device management status codes and messages.
These logs can show the context in which the error occurred. The error
messages are usually descriptive enough to point you to a problem area.
Troubleshooting procedures 21
Troubleshooting NetBackup problems
Step 4 Review the debug logs. Read the applicable enabled debug logs and correct any problems you
detect. If these logs are not enabled, enable them before you retry the failed
operation.
Step 5 Retry the operation. If you performed corrective actions, retry the operation. If you did not perform
corrective actions or if the problem persists, continue with the next step.
Step 6 Get more information for If you see the problem during a new installation or upgrade installation, or
installation problems. after you make changes to an existing configuration, see the following
procedures:
Step 7 Ensure that the servers and If you experienced a server or a client disk crash, procedures are available
clients are operational. on how to recover the files that are critical to NetBackup operation.
See “About disk recovery procedures for UNIX and Linux” on page 225.
Step 8 Ensure that the partitions Verify that you have enough space available in the disk partitions that
have enough disk space. NetBackup uses. If one or more of these partitions is full, NetBackup
processes that access the full partition fail. The resulting error message
depends on the process. Possible error messages: "unable to access" or
"unable to create or open a file."
Step 9 Increase the logging level. Enable verbose logging either for everything or only for the areas that you
think are related to the problem.
Step 10 Determine which daemons or Follow the procedures for UNIX or Windows NetBackup servers.
processes are running.
See “Verifying that all processes are running on UNIX servers” on page 22.
/usr/openv/netbackup/bin/bpps -x
Troubleshooting procedures 24
Troubleshooting NetBackup problems
2 Ensure that the following processes are running on the NetBackup servers:
Primary server
Media server
/usr/openv/netbackup/bin/initbprd
/usr/openv/netbackup/bin/nbwmc
5 If any of the media server processes are not running, stop the device process
ltid by running the following command:
/usr/openv/volmgr/bin/stopltid
6 To verify that the ltid, avrd, and robotic control processes are stopped, run
the following command:
/usr/openv/volmgr/bin/vmps
7 If you use ACS robotic control, the acsssi and the acssel processes may
continue to run when ltid is terminated. Use the UNIX kill command to
individually stop those robotic control processes.
8 Then, start all device processes by running the following command:
/usr/openv/volmgr/bin/ltid
/usr/openv/netbackup/bin/bp.kill_all
/usr/openv/netbackup/bin/bp.start_all
Table 2-2 Steps to ensure that all necessary processes are running on
Windows servers
Step 1 Start all services on the The following services must be running for typical backup and restore operations
primary servers. (steps 1, 2, and 3 in this table). If these services are not running, start them by
using the NetBackup Activity Monitor or the Services application in the Windows
Control Panel.
■ NetBackup Authentication
■ NetBackup Client Service
■ NetBackup Compatibility Service
■ NetBackup Database Manager
■ NetBackup Discovery Framework
■ NetBackup Enterprise Media Manager
■ NetBackup Event Manager
■ NetBackup Indexing Manager
■ NetBackup Job Manager
■ NetBackup Policy Execution Manager
■ NetBackup Scale-Out Relational Database Connection Pool Service
■ NetBackup Scale-Out Relational Database Manager
■ NetBackup Remote Manager and Monitor Service
■ NetBackup Request Daemon
■ NetBackup Resource Broker
■ NetBackup Service Layer
■ NetBackup Service Monitor
■ NetBackup Storage Lifecycle Manager
■ NetBackup Vault Manager
■ NetBackup Volume Manager
■ NetBackup Web Management Console
■ Veritas Private Branch Exchange
Note: Additional processes may also need to be running if other add-on products,
database agents, and so forth are installed. For additional assistance, see
https://www.veritas.com/support/en_US/article.100002166
Troubleshooting procedures 27
Troubleshooting NetBackup problems
Table 2-2 Steps to ensure that all necessary processes are running on
Windows servers (continued)
Step 4 Start avrd and Use the NetBackup Activity Monitor to see if the following processes are running:
processes for robots.
■ avrd (automatic media recognition), only if drives are configured on the server
■ Processes for all configured robots.
See the NetBackup Administrator’s Guide, Volume I.
If these processes are not running, stop and restart the NetBackup Device Manager
service. Use the NetBackup Activity Monitor or the Services application in the
Windows Control Panel.
Troubleshooting procedures 28
Troubleshooting installation problems
Table 2-2 Steps to ensure that all necessary processes are running on
Windows servers (continued)
Step 5 Restart the operation If you had to start any of the processes or services in the previous steps, retry the
or do additional operation.
troubleshooting.
If the processes and services are running or the problem persists, you can try to
test the servers and clients.
If you cannot start any of these processes or services, check the appropriate debug
logs for NetBackup problems.
When these processes and services start, they continue to run unless you stop
them manually or a problem occurs on the system. On Windows systems, it is
recommended that you add commands for starting them to your startup scripts,
so they restart in case you have to restart.
Step 1 Determine if you can Some reasons for failure are as follows:
install the software on
■ Not logged on as an administrator on a Windows system (you must have
the primary server and
permission to install services on the system)
the media servers by
■ Permission denied (ensure that you have permission to use the device and to
using the release
write the directories and files being installed)
media.
■ Bad media (contact Technical Support)
■ Defective drive (replace the drive or refer to vendor’s hardware documentation)
■ Improperly configured drive (refer to the system and the vendor documentation)
Troubleshooting procedures 29
Troubleshooting configuration problems
Note: NetBackup UNIX or Linux servers can push client software to UNIX/Linux
clients, and Windows servers can push to Windows clients. You can also download
the client software from the NetBackup appliance, and then run the install on the
client.
Do the following:
Step 3 Resolve network Determine if the problem is related to general network communications.
problems.
See “Resolving network communication problems with UNIX clients” on page 41.
See “Resolving network communication problems with Windows clients” on page 46.
Step 1 Check for device Check for the following device configuration problems:
configuration problems.
■ Configuration for robotic drive does not specify the robot.
■ Drive is configured as wrong type or density.
■ Incorrect Robotic Drive Number.
■ SCSI ID for the robotic control is specified instead of the logical Robot Number
that is assigned to the robot.
■ The same robot number is used for different robots.
■ SCSI ID for the drive is specified instead of a unique Drive Index number.
■ A platform does not support a device or was not configured to recognize it.
■ Robotic device is not configured to use LUN 1, which some robot hardware
requires.
■ On UNIX, drive no-rewind device path is specified as a rewind path.
■ On UNIX, tape devices are not configured with "Berkeley style close." NetBackup
requires this feature which is configurable on some platforms. Further
explanation is available.
■ On UNIX, tape devices (other than QIC) are not configured as "variable mode."
NetBackup requires this feature which is configurable on some platforms. When
this condition exists, you can frequently perform backups but not restores.
For more information, see the Status Codes Reference Guide.
■ On UNIX, pass-through paths to the tape drives have not been established.
Step 2 Check the daemons or Check for the following problems with the daemons or services:
services.
■ The daemons or services do not start during restart (configure system so they
start).
■ Wrong daemons or services are started (problems with media server startup
scripts).
■ Configuration was changed while daemons or services were running.
■ On Windows, the %SystemRoot%\System32\drivers\etc\services file
does not have an entry for vmd, bprd, bpdbm, and bpcd. Also, ensure that the
processes have entries for configured robots. A list of these processes is
available.
See the NetBackup Administrator’s Guide, Volume I.
■ On UNIX, the /etc/services file (or NIS or DNS) does not have an entry
for vmd, bprd, bpdbm, or robotic daemons.
Troubleshooting procedures 31
Device configuration problem resolution
Step 3 Retry the operation and If you found and corrected any configuration problems, retry the operation and
check for status codes check for NetBackup status codes or messages in the following:
and messages.
■ Check the All Log Entries report for NetBackup errors for the appropriate time
period. This report can show the context in which the error occurred. Often it
provides specific information, which is useful when the error can result from a
variety of problems.
If the problem involved a backup or archive, check the job's Detailed Status in
the Activity Monitor. Also check the Status of Backups report.
If you find a status code or message in either of these reports, perform the
recommended corrective actions.
See the Status Codes Reference Guide.
■ Check the system logs on UNIX or the Event Viewer Application and System
log on Windows if the following is true: The problem pertains to media or device
management, and NetBackup does not provide a status code. Or you cannot
correct the problem by following the instructions in the status codes.
■ Check the appropriate enabled debug logs. Correct any problems you detect.
If these logs are not enabled, enable them before your next try.
See the NetBackup Logging Reference Guide.
Step 4 Retry the operation and If you performed corrective actions, retry the operation. If you did not perform
do additional corrective actions or the problem persists, go to one of the following procedures.
troubleshooting.
See “Resolving full disk problems” on page 87.
See “About the conditions that cause media to freeze” on page 90.
Drive does not support The drive does not return its serial number. Ask the manufacturer for a newer firmware
serialization Note that some manufacturers do not support version that returns serial numbers (if
serial numbers. Although automatic device available), or manually configure and operate
configuration does not function optimally, the the drive without a serial number.
drive can be manually configured and
operated without its serial number.
Robot does not support The robot does not return its serial number or Ask the manufacturer for a newer firmware
serialization the serial numbers of the drives that are version that returns serial numbers (if
contained within it. Note that some available). Or manually configure and operate
manufacturers do not support serial numbers. the robot and drives without serial numbers.
Although automatic device configuration does
not function optimally, the robot and drives
can be manually configured and operated
without serial numbers.
No license for this robot NetBackup server does not support the robotic Define a different robot. Use only the robotic
type type that is defined for this robot. libraries that NetBackup server supports.
No license for this drive The drive type that is defined for this drive that Define a different drive. Use only the drives
type the NetBackup server does not support. that NetBackup supports
Unable to determine NetBackup does not recognize the robotic Do the following:
robot type library. The robotic library cannot be
■ Download a new device_mapping file from
auto-configured.
the Veritas Support website, and try again.
■ Configure the robotic library manually.
■ Use only the robotic libraries that
NetBackup supports.
Drive is standalone or Either the drive is standalone, or the drive or Ask the manufacturer for a newer firmware
in unknown robot robot does not return a serial number. Note version that returns serial numbers (if
that some manufacturers do not support serial available), or manually configure and operate
numbers. Although automatic device the drive robot without serial numbers.
configuration does not function optimally, the
drive or robot can be manually configured and
operated without a serial number.
Robot drive number is Either the drive or robot does not return a Ask the manufacturer for a newer firmware
unknown serial number. Note that some manufacturers version that returns serial numbers (if
do not support serial numbers. Although available). Or manually configure and operate
automatic device configuration does not the drive and robot without serial numbers.
function optimally, the drive or robot can be
manually configured and operated without a
serial number.
Troubleshooting procedures 33
Device configuration problem resolution
Drive is in an The drive is in a robotic library that cannot be Configure a drive that does not reside in the
unlicensed robot licensed for NetBackup server. Since the robot unlicensed robot.
cannot be licensed for NetBackup server, any
drives that were configured in that robot are
unusable.
Drive’s SCSI adapter A drive was found that does not have a SCSI Change the drive’s adapter or define a
does not support pass-through path configured. The possible pass-through path for the drive. For
pass-thru (or pass-thru causes are: information about the SCSI adapter
path does not exist) pass-through, see the NetBackup Device
■ The drive is connected to an adapter that
Configuration Guide.
does not support SCSI pass-through.
■ The pass-through path for this drive has
not been defined.
No configuration device A device has been detected without the For directions about how to create device files,
file exists corresponding device file necessary to see the NetBackup Device Configuration
configure that device. Guide.
Unable to determine The NetBackup server does not recognize the Do the following:
drive type drive. The drive cannot be auto-configured.
■ Download a new device_mapping file from
the Veritas Support website, and try again.
■ Configure the drive manually.
■ Use only the drives that NetBackup
supports.
Unable to determine A drive was detected without the expected If you do not need hardware data
compression device compression device file that is used to compression, no action is necessary. The
configure that device. Automatic device drive can be operated without hardware data
configuration tries to use a device file that compression. Hardware data compression
supports hardware data compression. When and tape drive configuration help are
multiple compression device files exist for a available.
drive, automatic device configuration cannot
For directions about how to create device files,
determine which compression device file is
see the NetBackup Device Configuration
best. It uses a non-compression device file
Guide.
instead.
Troubleshooting procedures 34
Testing the primary server and clients
Table 2-6 Steps for testing the primary server and clients
Step 1 Enable debug logs. Enable the appropriate debug logs on the primary server.
If you do not know which logs apply, enable them all until you solve the problem.
Delete the debug log directories when you have resolved the problem.
Step 2 Configure a test policy. Configure a test policy to use a basic disk storage unit.
Or, configure a test policy and set the backup window to be open while you test.
Name the primary server as the client and a storage unit that is on the primary
server (preferably a nonrobotic drive). Also, configure a volume in the NetBackup
volume pool and insert the volume in the drive. If you don’t label the volume by
using the bplabel command, NetBackup automatically assigns a previously
unused media ID.
Troubleshooting procedures 35
Testing the primary server and clients
Table 2-6 Steps for testing the primary server and clients (continued)
Step 3 Verify the daemons To verify that the NetBackup daemons or services are running on the primary
and services. server, do the following:
/usr/openv/netbackup/bin/bpps -x
■ To check the services on a Windows system, use the NetBackup Activity Monitor
or the Services application of the Windows Control Panel.
Step 4 Backup and restore a Start a manual backup of a policy. Then, restore the backup.
policy. These actions verify the following:
Step 5 Check for failure. If a failure occurs, check the job's Detailed Status in the Activity Monitor.
You can also try the NetBackup All Log Entries report. For the failures that relate
to drives or media, verify that the drive is in an UP state and that the hardware
functions.
Step 6 Consult information If the debug logs do not reveal the problem, check the following:
besides the debug
■ Systems Logs on UNIX systems
logs.
■ Event Viewer and System logs on Windows systems
■ Media Manager debug logs on the media server that performed the backup,
restore, or duplication
■ The bpdm and bptm debug logs on the media server that performed the backup,
restore, or duplication
Table 2-6 Steps for testing the primary server and clients (continued)
Step 7 Verify robotic drives. If you use a robot and the configuration is an initial configuration, verify that the
robotic drive is configured correctly.
In particular, verify the following:
■ The same robot number is used both in the Media and Device Management
and storage unit configurations.
■ Each robot has a unique robot number.
On a UNIX NetBackup server, you can verify only the Media and Device
Management part of the configuration. To verify, use the tpreq command to
request a media mount. Verify that the mount completes and check the drive on
which the media was mounted. Repeat the process until the media is mounted
and unmounted on each drive from the host where the problem occurred. If this
works, the problem is probably with the policy or the storage unit configuration.
When you are done, tpunmount the media.
Step 8 Include a robot in the If you previously configured a nonrobotic drive and your system includes a robot,
test policy. change your test policy now to specify a robot. Add a volume to the robot. The
volume must be in the NetBackup volume pool on the EMM database host for the
robot.
Return to step 3 and repeat this procedure for the robot. This procedure verifies
that NetBackup can find the volume, mount it, and use the robotic drive.
Step 9 Use the robotic test If you have difficulties with the robot, try the test utilities.
utilities.
See “About the robotic test utilities” on page 206.
Do not use the Robotic Test Utilities when backups or restores are active. These
utilities prevent the corresponding robotic processes from performing robotic actions,
such as loading and unloading media. The result is that it can cause media mount
timeouts and prevent other robotic operations like robotic inventory and inject or
eject from working.
Step 10 Enhance the test Add a user schedule to your test policy (the backup window must be open while
policy. you test). Use a storage unit and media that was verified in previous steps.
Troubleshooting procedures 37
Testing the primary server and clients
Table 2-6 Steps for testing the primary server and clients (continued)
Step 11 Backup and restore a Start a user backup and restore of a file by using the client-user interface on the
file. primary server. Monitor the status and the progress log for the operation. If
successful, this operation verifies that the client software is functional on the primary
server.
If a failure occurs, check the NetBackup All Log Entries report. To isolate the
problem further, check the appropriate debug logs from the following list.
Explanations about which logs apply to specific client types are available.
Step 12 Reconfigure the test Reconfigure your test policy to name a client that is located elsewhere in the
policy. network. Use a storage unit and media that has been verified in previous steps. If
necessary, install the NetBackup client software.
Step 13 Create debug log Create debug log directories for the following processes:
directories.
■ bprd on the server
■ bpcd on the client
■ bpbkar on the client
■ nbwin on the client (Windows only)
■ bpbackup on the client (except Windows clients)
■ bpinetd (Windows only)
■ tar
■ On the media server: bpbrm, bpdm, and bptm
Explanations about which logs apply to specific client types are available.
Table 2-6 Steps for testing the primary server and clients (continued)
Step 14 Verify communication Perform a user backup and then a restore from the client that is specified in step
between the client and 8. These actions verify communications between the client and the primary server,
the primary server. and NetBackup software on the client.
If an error occurs, check the job's Detailed Status in the Activity Monitor.
check the All Log Entries report and the debug logs that you created in the
previous step. A likely cause for errors is a communications problem between the
server and the client.
Step 15 Test other clients or When the test policy operates satisfactorily, repeat specific steps as necessary to
storage units. verify other clients and storage units.
Step 16 Test the remaining When all clients and storage units are functional, test the remaining policies and
policies and schedules. schedules that use storage units on the primary server. If a scheduled backup fails,
check the All Log Entries report for errors. Then follow the recommended actions
as is part of the error status code.
Table 2-7 Steps for testing the media server and clients
Step 1 Enable legacy Enable appropriate legacy debug logs on the servers, by entering the following:
debug logs.
UNIX/Linux: /usr/openv/netbackup/logs/mklogdir
Windows: install_path\NetBackup\logs\mklogdir.bat
If you are uncertain which logs apply, enable them all until you solve the problem.
Delete the legacy debug log directories when you have resolved the problem.
Troubleshooting procedures 39
Testing the media server and clients
Table 2-7 Steps for testing the media server and clients (continued)
Step 2 Configure a test Configure a test policy with a user schedule (set the backup window to be open while
policy. you test) by doing the following:
■ Name the media server as the client and a storage unit that is on the media server
(preferably a nonrobotic drive).
■ Add a volume on the EMM database host for the devices in the storage unit. Ensure
that the volume is in the NetBackup volume pool.
■ Insert the volume in the drive. If you do not pre-label the volume by using the
bplabel command, NetBackup automatically assigns a previously unused media
ID.
Step 3 Verify the daemons Verify that all NetBackup daemons or services are running on the primary server. Also,
and services. verify that all Media and Device Management daemons or services are running on the
media server.
To perform this check, do one of the following:
/usr/openv/netbackup/bin/bpps -x
■ On a Windows system, use the Services application in the Windows Control Panel.
Step 4 Backup and Perform a user backup and then a restore of a file from a client that has been verified
restore a file. to work with the primary server.
This test verifies the following:
■ NetBackup media server software.
■ NetBackup on the media server can mount the media and use the drive that you
configured.
■ Communications between the primary server processes nbpem, nbjm, nbrb, EMM
server process nbemm, and media server processes bpcd, bpbrm, bpdm, and bptm.
■ Communications between media server process bpbrm, bpdm, bptm, and client
processes bpcd and bpbkar.
For the failures that relate to drives or media, ensure that the drive is in an UP state
and that the hardware functions.
Step 5 Verify If you suspect a communications problem between the primary server and the media
communication servers, check the debug logs for the pertinent processes.
between the If the debug logs don’t help you, check the following:
primary server and
the media servers. ■ On a UNIX server, the System log
■ On a Windows server, the Event Viewer Application and System log
■ vmd debug logs
Troubleshooting procedures 40
Testing the media server and clients
Table 2-7 Steps for testing the media server and clients (continued)
Step 6 Ensure that the For the failures that relate to drives or media, ensure that the drive is running and that
hardware runs the hardware functions correctly.
correctly.
See the vendor manuals for information on hardware failures.
If you use a robot in an initial configuration condition, verify that the robotic drive is
configured correctly.
In particular, verify the following:
■ The same robot number is used both in the Media and Device Management and
storage unit configurations.
■ Each robot has a unique robot number.
On a UNIX server, you can verify only the Media and Device Management part of the
configuration. To verify, use the tpreq command to request a media mount. Verify
that the mount completes and check the drive on which the media was mounted. Repeat
the process until the media is mounted and unmounted on each drive from the host
where the problem occurred. Perform these steps from the media server. If this works,
the problem is probably with the policy or the storage unit configuration on the media
server. When you are done, use tpunmount to unmount the media.
Troubleshooting procedures 41
Resolving network communication problems with UNIX clients
Table 2-7 Steps for testing the media server and clients (continued)
Step 7 Include a robotic If you previously configured a non-robotic drive and a robot was attached to your media
device in the test server, change the test policy to name the robot. Also, add a volume for the robot to
policy. the EMM server. Verify that the volume is in the NetBackup volume pool and in the
robot.
Start with step 3 to repeat this procedure for a robot. This procedure verifies that
NetBackup can find the volume, mount it, and use the robotic drive.
If a failure occurs, check the NetBackup All Log Entries report. Look for any errors
that relate to devices or media.
In an initial configuration, verify that the robotic drive is configured correctly. Do not
use a robot number that is already configured on another server.
Do not use the Robotic Test Utilities when backups or restores are active. These utilities
prevent the corresponding robotic processes from performing robotic actions, such as
loading and unloading media. The result is that it can cause media mount timeouts
and prevent other robotic operations like robotic inventory and inject or eject from
working.
Step 8 Test other clients When the test policy operates satisfactorily, repeat specific steps as necessary to verify
or storage units. other clients and storage units.
Step 9 Test the remaining When all clients and storage units are in operation, test the remaining policies and
policies and schedules that use storage units on the media server. If a scheduled backup fails,
schedules. check the All Log Entries report for errors. Then follow the suggested actions for the
appropriate status code.
procedure consists of two variations: one for UNIX clients and another for Windows
clients.
Note: In all cases, ensure that your network configuration works correctly outside
of NetBackup before trying to resolve NetBackup problems.
For UNIX clients, perform the following steps. Before you start this procedure, add
the VERBOSE=5 option to the /usr/openv/netbackup/bp.conf file.
Table 2-8 Steps for resolving network communication problems with UNIX
clients
Step 1 Create debug log During communication retries, the debug logs provide detailed debug information, which
directories. can help you analyze the problem.
Create the following directories:
Use the bprd log directory to debug client to primary server communication, not client
to media server communication problems.
Step 2 Test a new If this configuration is a new or a modified configuration, do the following:
configuration or
■ Check any recent modifications to ensure that they did not introduce the problem.
modified
■ Ensure that the client software was installed and that it supports the client operating
configuration.
system.
■ Check the client names, server names, and service entries in your NetBackup
configuration as explained in the following topic:
See “Verifying host name and service entries in NetBackup” on page 73.
You can also use the hostname command on the client to determine the host name
that the client sends with requests to the primary server. Check the bprd debug log
on the primary server to determine what occurred when the server received the
request.
Troubleshooting procedures 43
Resolving network communication problems with UNIX clients
Table 2-8 Steps for resolving network communication problems with UNIX
clients (continued)
Step 3 Verify name To verify name resolution, run the following command on the primary server and the
resolution. media servers:
If the results are unexpected, review the configuration of these name resolution services:
nsswitch.conf file, hosts file, ipnodes file, and resolv.conf file.
Also run the following on the client to check forward and reverse name lookup of the
primary server and media server that perform the backup:
Step 4 Verify network Verify network connectivity between client and server by pinging the client from the server.
connectivity.
# ping clientname
Where clientname is the name of the client as configured in the NetBackup policy
configuration.
# ping ant
ant.nul.nul.com: 64 byte packets
64 bytes from 199.199.199.24: icmp_seq=0. time=1. ms
----ant.nul.nul.com PING Statistics----
2 packets transmitted, 2 packets received, 0% packet
loss round-trip (ms) min/avg/max = 1/1/1
A successful ping verifies connectivity between the server and client. If the ping fails
and ICMP is not blocked between the hosts, resolve the network problem outside of
NetBackup before you proceed.
Some forms of the ping command let you ping the bpcd port on the client as in the
following command:
Ping 1556 (PBX) and 13724 (vnetd) in sequence, the same sequence that NetBackup
tries by default. You then know which ports are closed so that you can open them for
more efficient connection tries.
Troubleshooting procedures 44
Resolving network communication problems with UNIX clients
Table 2-8 Steps for resolving network communication problems with UNIX
clients (continued)
Step 5 Ensure that the On the client, run one of the following commands (depending on platform and operating
client listens on the system):
correct port for the
bpcd connections. netstat -a | grep bpcd
netstat -a | grep 13782
rpcinfo -p | grep 13782
Repeat for 1556 (PBX) and 13724 (vnetd). If no problems occur with the ports, the
expected output is as follows:
LISTEN indicates that the client listens for connections on the port.
If the NetBackup processes are running correctly, the expected output is as follows:
Repeat the procedure on the primary server(s) and media server(s), to test communication
to the client.
Troubleshooting procedures 45
Resolving network communication problems with UNIX clients
Table 2-8 Steps for resolving network communication problems with UNIX
clients (continued)
Step 6 Connect to the On the client, telnet to 1556 (PBX) and 13724 (vnetd). Check both ports to make sure
client through that a connection is made on at least one of them. If the telnet connection succeeds,
telnet. keep the connection until after you perform step 8, then terminate it with Ctrl-c.
Where clientname is the name of the client as configured in the NetBackup policy
configuration.
For example,
Repeat the procedure on the primary server(s) and media server(s), to test communication
to the client.
Step 7 Identify the On the primary server(s) and media server(s): Use the following command to identify the
outbound socket outbound socket that is used for the telnet command from step 6. Specify the appropriate
on the server host. IP address to which the server resolves the policy client. Note the source IP
(10.82.105.11), the source port (45856) and the destination port (1556).
If telnet is still connected and a socket is not displayed: Remove the port number
filtering and observe the port number to which the site has mapped the service name.
Check that process listens on the port number in step 5.
If the socket is in a SYN_SENT state instead of an ESTABLISHED state, the server host
is trying to make the connection. However, a firewall blocks the outbound TCP SYN from
reaching the client host or blocks the return bound TCP SYN+ACK from reaching the
server host.
Troubleshooting procedures 46
Resolving network communication problems with Windows clients
Table 2-8 Steps for resolving network communication problems with UNIX
clients (continued)
Step 8 Confirm that the On the primary server(s) and media server(s), to confirm that the telnet connection
telnet connection reaches this client host, run the following command:
reaches this client
host. $ netstat -na | grep ‘<source_port>’
10.82.104.99.1556 10.82.105.11.45856 49152 0 49152 0 ESTABLISHED
■ If telnet is connected but the socket is not present: The telnet reached some other
host that incorrectly shares the same IP address as the client host.
■ If the socket is in a SYN_RCVD state instead of an ESTABLISHED state, then the
connection reached this client host. However, a firewall blocks the return of the TCP
SYN+ACK to the server host.
Step 9 Verify To verify client to primary server communications, use the bpclntcmd utility. When -pn
communication and -sv run on a NetBackup client, they initiate inquiries to the NetBackup primary server
between the client (as configured in the client bp.conf file). The primary server then returns information
and the primary to the requesting client. More information is available about bpclntcmd.
server.
See “About the bpclntcmd utility” on page 84.
The PBX, vnetd, and bprd debug logs should provide details on the nature of any
remaining failure.
Note: In all cases, ensure that your network configuration works correctly outside
of NetBackup before trying to resolve NetBackup problems.
This procedure helps you resolve network communication problems with PC clients.
To resolve network communication problems
1 Before you retry the failed operation, do the following:
Troubleshooting procedures 47
Resolving network communication problems with Windows clients
■ Increase the logging level on the client (see the NetBackup Administrator's
Guide, Volume I, under "Client Settings properties").
■ On the NetBackup primary server, create a bprd debug log directory and
on the clients create a bpcd debug log.
■ On the NetBackup server, set the Verbose level to 1.
See the NetBackup Logging Reference Guide for help changing the logging
level.
2 If this client is new, verify the client and the server names in your NetBackup
configuration.
See “Verifying host name and service entries in NetBackup” on page 73.
3 Verify network connectivity between client and server by pinging from the server
to the client and vice versa. Use the following command:
# ping hostname
%SystemRoot%\system32\drivers\etc\services
(Windows)
Correct the port number if necessary. Then, on Windows clients and servers,
stop and restart the NetBackup Client service.
Do not change NetBackup port assignments unless it is necessary to resolve
conflicts with other applications. If you do change them, do so on all
NetBackup clients and servers. These numbers must be the same
throughout your NetBackup configuration.
Troubleshooting procedures 49
Resolving network communication problems with Windows clients
5 Verify that the NetBackup Request Service (bprd) port number on Microsoft
Windows is the same as on the server (by default, 13720). Do one of the
following:
Verify that the setting on the Network tab matches the one in
the services file. The services file is located in:
%SystemRoot%\system32\drivers\etc\services
(Windows)
Windows NetBackup Set these numbers in the Client Properties dialog box in the
servers Host Properties window.
6 Verify that the hosts file or its equivalent contains the NetBackup server name.
The hosts files are the following:
Windows %SystemRoot%\system32\drivers\etc\hosts
UNIX /etc/hosts
■ Host IDs must be mapped for host names on all NetBackup 8.1 and later hosts.
The NetBackup global security settings configure how NetBackup maps host
IDs to name.
Verify the global settings under Security Management in the NetBackup
Administration Console. Alternatively, you can use the following command
and option:
Windows:
install_path\Veritas\NetBackup\bin\admincmd\nbseccmd
-getsecurityconfig -autoaddhostmapping
UNIX:
/usr/openv/netbackup/bin/admincmd/nbseccmd -getsecurityconfig
-autoaddhostmapping
■ For NetBackup hosts earlier than 8.1, you must allow insecure communication.
The NetBackup global security settings configure if NetBackup can communicate
with hosts earlier than 8.1.
Verify the global settings under Security Management in the NetBackup
Administration Console. Alternatively, you can use the following command
and option:
Windows:
install_path\Veritas\NetBackup\bin\admincmd\nbseccmd
-getsecurityconfig -insecurecommunication
UNIX:
/usr/openv/netbackup/bin/admincmd/nbseccmd -getsecurityconfig
-insecurecommunication
■ The NetBackup web services on the primary server must be active. To confirm
that they are active, use the following NetBackup command and option:
Windows: install_path\Veritas\NetBackup\bin\nbcertcmd -ping
UNIX: /usr/openv/netbackup/bin/nbcertcmd -ping
■ If the primary server is configured to use external CA-signed certificates, the
hosts should enroll their external CA-signed certificates with the appropriate
primary server domain.
For more information on external CA support and certificate enrollment, refer
to the NetBackup Security and Encryption Guide.
Troubleshooting procedures 52
Troubleshooting vnetd proxy connections
For Auto Image Replication, host ID-based certificates from the source primary
server are required on all of the trusted primary servers in the destination domains.
If the primary server is configured to use external CA-signed certificates, ensure
that trust is established between the source and target primary servers using external
CA-signed certificates.
For more information, see the NetBackup Deduplication Guide.
If the failure is not during a NetBackup job, examine the exit status of the operation
for the status codes of interest. Also examine the debug logs for the processes that
are involved in the operation. Look first at the command that initiated the operation
or the service that performed the request.
You can find the status codes described in the following:
■ The NetBackup Status Codes Reference Guide.
■ The NetBackup Administration Console help.
■ The Troubleshooter in the NetBackup Administration Console.
If a job did not run, verify that the vnetd process and its proxies are active.
$ bpps
…output shortened…
root 13577 1 0 Jun27 ? 00:00:04 /usr/openv/netbackup/bin/vnetd -standalone
root 13606 1 0 Jun27 ? 00:01:55 /usr/openv/netbackup/bin/vnetd -proxy inbound_proxy
-number 0
root 13608 1 0 Jun27 ? 00:00:06 /usr/openv/netbackup/bin/vnetd -proxy outbound_proxy
Troubleshooting procedures 53
Troubleshooting vnetd proxy connections
-number 0
root 13610 1 0 Jun27 ? 00:00:06 /usr/openv/netbackup/bin/vnetd -proxy http_tunnel
Depending on which vnetd process or proxy is or is not running, try the following:
■ If the vnetd process (-standalone) is not running, start it.
■ If the vnetd process is running, examine the vnetd debug log to confirm that it
tries to start the proxies.
■ If the vnetd process tries to start the inbound and the outbound proxies: Examine
the proxy log file to determine why the proxy does not listen for connections.
Use the nbpxyhelper short component name or its originator ID 486 with the
vxlogview command.
■ If the vnetd process tries to start the HTTP tunnel proxy, examine the HTTP
tunnel proxy log. Use the nbpxytnl short component name or its originator ID
490 with the vxlogview command.
If the vnetd process and its proxies are active, determine if the connections are
proxied.
1 1 0
127.0.0.1:42553 -> 127.0.0.1:52236 PROXY 10.81.41.245:895 -> 10.81.40.148:1556
127.0.0.1:35386 -> 127.0.0.1:49429 PROXY 10.81.41.245:51325 -> 10.81.40.148:1556
Conversely, the following example shows a connection test to the same media
server that fails after its security certificate was revoked:
NetBackup hosts must have a valid host ID-based security certificate and a valid
certificate revocation list so they can communicate with other NetBackup hosts.
Troubleshooting procedures 55
Troubleshooting vnetd proxy connections
The lack of either prevents communication. In this case, you can look up status
code 7653 to find the explanation for and recommended action to recover from the
error.
UNIX:
/usr/openv/netbackup/bin/bpclntcmd -pn -verbose
If the vnetd proxy connections are active, examine the log files of the connecting
and accepting processes
Troubleshooting procedures 56
Troubleshooting vnetd proxy connections
A NetBackup host’s names must be mapped to its host ID. If a host name is not
mapped properly in NetBackup, communication fails. In this case, you can look up
status code 7648 to find the explanation for and recommended action to recover
from the error.
If you do not find an indication of a problem by examining the connecting process
and accepting process log files, examine the vnetd proxy log files. You can use
the connection IDs to find relevant information.
The following is the NetBackup vxlogview command syntax to view the inbound
and the outbound proxy log file using the short component name:
Troubleshooting procedures 57
Troubleshooting security certificate revocation
Windows: install_path\Veritas\NetBackup\bin\vxlogview –p NB –i
nbpxyhelper
Note: On Windows, omit the single quote marks from the connection ID string.
Symptoms:
■ Cloud Storage creation fails.
■ Backup job fails because the cloud storage server is down.
Causes:
■ The certificate is revoked, NetBackup does not connect to the cloud provider.
■ The CRL file failed to download.
Resolution:
■ If the problem is a CRL verification failure, contact your security administrator
■ If the problem is a download failure, verify the firewall settings. Refer to the
NetBackup Cloud Administrator’s Guide and ensure that you have met all the
requirements for CRL.
Troubleshooting procedures 59
Troubleshooting security certificate revocation
If the NetBackup host is configured to download CRLs from CDPs, CRLs are
refreshed as per ECA_CRL_REFRESH_HOURS.
For more information about external certificate configuration options for CRLs and
the global security settings, see the NetBackup Security and Encryption Guide.
Cause
The cause may be one of the following reasons:
■ The security certificate of the client is revoked.
■ The security certificate of the media server that backs up the client is revoked.
■ The security certificate of the primary server is revoked.
■ The CRL on the client, media server, or the primary server is corrupted or
missing.
Resolution
1. Examine the job details for the following message strings and adjacent status
codes:
■ For certificate revocation, look for the message strings that contain
certificate and revoked.
■ For the CRL, look for the message strings that contain certificate
revocation list or CRL and missing , corrupted, or unavailable.
2. If necessary, determine if the client or the media server certificate was revoked.
See “Determining a NetBackup host's certificate state” on page 64.
3. If an external CA-signed certificate is used, refer to the external certificate
section:
See “Troubleshooting issues with external CA-signed certificate revocation”
on page 67.
Troubleshooting procedures 61
Troubleshooting security certificate revocation
4. Refer to the NetBackup documentation for the explanations for the status codes
and recommended actions for recovery. If possible, resolve the issue.
5. If you cannot resolve the issue in a timely fashion, remove the revoked host
from the backup policy or deactivate the policy. If the revoked host is the media
server, deactivate it. (You can ignore “NetBackup version” errors when you
deactivate the host.)
6. In case of NetBackup CA-signed certificate, after you resolve the security issue,
reissue the certificate for the revoked host. Certificate reissue is documented
in the NetBackup Security and Encryption Guide.
7. If necessary, add the client back to the backup policy, activate the backup
policy, or activate the media server.
Cause
The host certificate of a NetBackup client or the media server that backs it up may
be revoked. Also, the CRL on the client or the media server may be out-of-date,
missing, or corrupt. Therefore, the client or the media server cannot determine that
a host certificate is revoked. The job runs but communication fails and appears as
a network error.
Resolution
1. Determine if the client or the media server certificate was revoked.
See “Determining a NetBackup host's certificate state” on page 64.
2. Optionally, verify the cause by doing one of the following:
■ Log onto the revoked host and examine the vnetd proxy log file. Look for
the message strings that contain the following:
■ PEER_HOST_PROTOCOL_ERROR
Cause
The security certificate of the media server that backs up or restores the client is
revoked. Or for disk-based storage, the certificate of the storage server may be
revoked.
Resolution
1. Determine the state of the security certificate on the client and the media server
or the storage server.
See “Determining a NetBackup host's certificate state” on page 64.
2. Depending on which host has the revoked certificate, do one of the following:
■ If the revoked host is a client, remove it from the backup policy or deactivate
the policy.
■ If the revoked host is the media server or a storage server, deactivate it.
(You can ignore “NetBackup version” errors when you deactivate the host.)
If possible, change the storage unit to use a different media server or storage
server.
3. Investigate the revoked host to determine the security issue and then resolve
the issue.
Troubleshooting procedures 63
Troubleshooting security certificate revocation
Verify a host certificate from The method uses the NetBackup nbcertcmd command.
the host itself
See “To verify the host's certificate state from the host”
on page 65.
Verify a host certificate from The method uses the NetBackup bptestbpcd command.
a NetBackup server
See “To verify from a NetBackup server if a different host’s
certificate is revoked” on page 65.
Verify a host certificate from See “To verify a host’s certificate” on page 66.
the NetBackup
Administration Console
Troubleshooting procedures 65
Troubleshooting security certificate revocation
To get a CRL from a NetBackup domain other than the default, specify the
-server primary_server_name option and argument.
-cluster Use this option on the active node of a NetBackup primary server cluster
to verify the certificate of the virtual host.
3 Examine the command output. The output indicates that either the certificate
is or is not revoked.
To verify from a NetBackup server if a different host’s certificate is revoked
1 As an administrator on the NetBackup primary server or a NetBackup media
server, run the following command:
UNIX: /usr/openv/netbackup/bin/admincmd/bptestbpcd –host hostname
-verbose
For –host hostname, specify the host for which you want to verify the certificate.
2 Examine the command output. If the certificate on the specified host is revoked,
the command output includes the string The Peer Certificate is revoked.
If the command output does not include that string, the certificate is valid.
Troubleshooting procedures 66
Troubleshooting security certificate revocation
Verify a host See “To verify a host certificate from the host itself” on page 66.
certificate from
the host itself
Verify a host See “To verify from a NetBackup server if a different host’s certificate is
certificate from revoked” on page 67.
a NetBackup
server
Use the -cluster option on the active node of a clustered primary server to
verify the certificate of the virtual name.
3 Examine the command output. The output indicates whether the certificate is
revoked or not.
Troubleshooting procedures 67
Troubleshooting security certificate revocation
For -host hostname, specify the host for which you want to verify the certificate.
2 Examine the command output. If the certificate on the specified host is revoked,
the command output includes the string 'The Peer Certificate is revoked'. If the
command output does not include that string, the certificate is valid.
For more details, refer to the About certificate revocation lists for external CA chapter
from the NetBackup Security and Encryption Guide.
Symptom
The certificate revocation list is unavailable (NetBackup status code - 5982)
Cause
■ The NetBackup is not configured with correct CRL path or the certificate does
not contain valid CDP.
■ The host does not have a CRL cached in the NetBackup CRL cache.
Resolution
1 If the ECA_CRL_PATH setting is specified in the NetBackup configuration file,
ensure the following:
■ ECA_CRL_PATH has the correct CRL directory path
■ CRL directory contains CRLs for all required certificate issuers (based on
the ECA_CRL_CHECK setting)
If the CDP is used (ECA_CRL_PATH is not specified)
■ Ensure that the certificate has at least one CDP (with HTTP/HTTPS protocol)
that points to a CRL that includes revocation information for all reasons.
Troubleshooting procedures 68
Troubleshooting security certificate revocation
2 Ensure that the CRL is valid in the directory specified for ECA_CRL_PATH or at
CDP location.
■ CRL is in PEM or DER format.
■ CRL is not expired.
■ CRL is not a delta CRL.
■ CRL's last update date is not in future.
4 Examine the required CRLs are available in the NetBackup CRL cache at the
following location:
UNIX:/usr/openv/var/vxss/crl
Windows: install_path\NetBackup\var\vxss\crl
5 If the issue persists, examine bpclntcmd logs at the following location:
UNIX: /usr/openv/netbackup/logs/bpclntcmd
Windows: install_path\NetBackup\logs\bpclntcmd
Symptom
The NetBackup is functioning correctly even if the certificate is revoked or the
NetBackup operations are failing with the error ‘certificate is revoked’ even if the
certificate is not revoked.
Cause
The NetBackup host’s CRL cache is not updated.
Troubleshooting procedures 69
About troubleshooting networks and host names
Resolution
1 Verify if the CRLs at the following location are updated:
UNIX: /usr/openv/var/vxss/crl
Windows: install_path\NetBackup\var\vxss\crl
If not, cleanup the cached CRLs for issuers in the certificate chain as per the
ECA_CRL_CHECK setting.
The peer name is derived from the IP address of the connection. This means that
the address must translate into a host name (using the getnameinfo() network
routine). This name is visible in the bpcd or bprd debug log when a connection is
made as in the line:
On a client, the peer host name for the connecting server must match a server or
a media server entry in the local NetBackup configuration: Either as a string match
or by comparison with the getaddrinfo() information for each server entry.
On the primary server, the comparison is more complex.
The client’s configured name is then derived from the peer name by querying the
bpdbm process on UNIX/Linux hosts, or the NetBackup Database Manager service
on Windows hosts.
The bpdbm process compares the peer name to a list of client names that are
generated from the following:
■ All clients for which a backup was run
■ All clients in all policies
The comparison is first a string comparison. The comparison is verified by comparing
the peer name to the list of client names.
If none of the comparisons succeed, a more brute force method is used, which
compares all names and aliases that are found using getaddrinfo() for each client
name in the list.
The configured name is the first comparison that succeeds.
If the comparison fails, in most cases bprd replaces the requesting client (as follows)
with the peer name because the host names in the request are not under
administrative control like the network and NetBackup configurations.
An example of a failed comparison:
The client has a new network interface and has changed the first server entry to
take advantage of the new network. The name services on the primary server
resolve the new source IP of the client to a peer name that is not a network alias
for any client in any policies.
These comparisons are recorded in the bpdbm debug log if VERBOSE is set. You can
determine a client’s configured name by using the bpclntcmd command on the
client. For example:
# /usr/openv/netbackup/bin/bpclntcmd -pn (UNIX)
Troubleshooting procedures 71
About troubleshooting networks and host names
Where the first output line identifies the server to which the request is directed. The
second output line is the server’s response in the following order:
■ Peer name of the connection to the server
■ Configured name of the client
■ IP address of the connection to the server
■ Source IP address of the connection to the server
When the client connects to the server, it sends the following three names to the
server:
■ Browse client
■ Requesting client
■ Destination client
The browse client name is used to identify the client files to list or restore from. The
user on the client can modify this name to restore files from another client. For
example, on a Windows client, the user can change the client name by using the
Backup, Archive, and Restore interface. (See the NetBackup online Help for
instructions). For this change to work, however, the administrator must also have
made a corresponding change on the server.
See the NetBackup Administrator’s Guide, Volume I.
The requesting client is the value from either CLIENT_NAME or the gethostname()
function on the client.
The destination client name is a factor only if an administrator pushes a restore to
a client from a server. For a user restore, the destination client and the requesting
client are the same. For an administrator restore, the administrator can specify a
different name for the destination client.
By the time these names appear in the bprd debug log, the requesting client name
has been translated into the client’s configured name.
The name that is used to connect back to the client to complete the restore is either
the client’s peer name or its configured name. The type of restore request (for
example, from root on a server, from a client, to a different client, and so on)
influences this action.
When you modify client names in NetBackup policies to accommodate specific
network paths, the administrator needs to consider:
Troubleshooting procedures 72
About troubleshooting networks and host names
■ The client name as configured on the client. For example, on UNIX the client
name is CLIENT_NAME in the client’s bp.conf file. On a Windows client, it is on
the General tab of the NetBackup Client Properties dialog box. To open this
dialog box, select NetBackup Client Properties from the File menu in the
Backup, Archive, and Restore interface.
■ The client as currently named in the policy configuration.
■ The client backup and archive images that already exist as recorded in the
images directory on the primary server. On a UNIX server, the images directory
is /usr/openv/netbackup/db/images. On a Windows NetBackup server, the
images directory is install_path\NetBackup\db\images.
Any of these client names can require manual modification by the administrator if
the following is true: a client has multiple network connections to the server and list
or restore requests from the client fail because of a connection-related problem.
The traceroute (UNIX) and tracert (Windows) programs can often provide
valuable information about the configuration of the network.
The primary server may be unable to reply to client requests, if the Domain Name
Services (DNS) are used and the following is true: The name that the client obtains
through its gethostname() library is unknown to the DNS on the primary server.
The client and the server configurations can determine if this situation exists. The
gethostname() function on the client may return an unqualified host name that the
DNS on the primary server cannot resolve.
Although you can reconfigure name services, including the hosts file, this solution
is not always desirable. For this reason, NetBackup provides a special file on the
primary server. This file is as follows:
/usr/openv/netbackup/db/altnames/host.xlate (UNIX)
install_path\NetBackup\db\altnames\host.xlate (Windows)
You can create and edit this file to force the desired translation of NetBackup client
host names.
Each line in the host.xlate file has three elements: a numeric key and two host
names. Each line is left justified, and a space character separates each element
of the line.
0 danr danr.eng.aaa.com
When the primary server receives a request for a configured client name (numeric
key 0), the name always replaces the peer name.
You can also make these changes on the appropriate tabs in the properties dialog boxes on
a Windows NetBackup server
See “Using the Host properties to access configuration settings” on page 87.
On UNIX NetBackup Check the server and the client name entries in the bp.conf file by doing the following:
servers and clients
■ Ensure that a SERVER entry exists for the primary server and each media server in the
configuration. The primary server must be the first name in the list.
If you add or modify SERVER entries on the primary server, stop and restart bprd and
bpdbm before the changes take effect.
■ The bp.conf of the primary server does not require the addition of other clients, other than
the primary server as CLIENT_NAME = primary server name. The name is added
by default.
UNIX client users can also have a personal bp.conf file in their home directory. A
CLIENT_NAME option in $HOME/bp.conf overrides the option in
/usr/openv/netbackup/bp.conf.
Troubleshooting procedures 75
Verifying host name and service entries in NetBackup
On the primary server Verify that you have created any of the following required files:
2 Verify that each server and client have the required entries for NetBackup
reserved port numbers.
The following examples show the default port numbers.
See “Example of host name and service entries on UNIX primary server and
client” on page 77.
See “Example of host name and service entries on UNIX primary server and
media server” on page 79.
See “Example of host name and service entries on UNIX PC clients”
on page 81.
See “Example of host name and service entries on UNIX server that connects
to multiple networks” on page 82.
Do not change NetBackup port assignments unless it is necessary to resolve
conflicts with other applications. If you do change them, do so on all NetBackup
clients and servers. These numbers must be the same throughout your
NetBackup configuration.
3 On NetBackup servers, check the services files to ensure that they have entries
for the following:
■ bpcd and bprd
■ vmd
■ bpdbm
On UNIX clients Check the bprd and the bpcd entries in the /etc/services
file.
Troubleshooting procedures 76
Verifying host name and service entries in NetBackup
On Microsoft Verify that the NetBackup Client Service Port number and
Windows clients NetBackup Request Service Port number match settings in the
services file by doing the following:
The values on the Network tab are written to the services file
when the NetBackup Client service starts.
%SystemRoot%\system32\drivers\etc\services
4 On UNIX servers and clients, ensure that the bpcd -standalone process is
running.
5 On Windows servers and clients, verify that the NetBackup Client service is
running.
6 If you use NIS in your network, update those services to include the NetBackup
information that is added to the /etc/services file.
7 NIS, WINS, or DNS host name information must correspond to what is in the
policy configuration and the name entries. On Windows NetBackup servers
and Microsoft Windows clients, do the following:
■ Check the General tab:
Start the Backup, Archive, and Restore interface on the client. On the
File menu, click NetBackup Client Properties. In the NetBackup Client
Properties dialog box, click the General tab.
■ Check the Server to use for backups and restores drop-down list:
Start the Backup, Archive, and Restore interface on the client. On the
File menu, click Specify NetBackup Machines and Policy Type. In the
Specify NetBackup Machines and Policy Type dialog box, click the
Server to use for backups and restores drop-down list.
■ Check the bp.conf file on UNIX servers and clients.
Troubleshooting procedures 77
Verifying host name and service entries in NetBackup
8 Use the bpclntcmd utility to confirm the setup of the IP addresses and host
names in DNS, NIS, and local hosts files on each NetBackup node.
Note: FT (Fibre Transport) target devices are named based on the host name
or domain name response from the device. If any alternate computer names
for different VLAN network interface names appear in the
SERVER/MEDIA_SERVER entries of the DNS (Domain Name System) or the
host files, the primary name must appear first.
UNIX
master jupiter
server
Ethernet
usr/openv/netbackup/bp.conf
SERVER=jupiter
CLIENT_NAME=jupiter
usr/openv/netbackup/bp.conf
SERVER=jupiter
bpcd -standalone CLIENT_NAME=mars
/etc/services
#NetBackup services bpcd -standalone
bpcd 13782/tcp bpcd
bprd 13720/tcp bprd
bpdbm 13721/tcp bpdbm
/etc/services
#NetBackup services
#Volume Manager services
bpcd 13782/tcp bpcd
vmd 13701/tcp vmd
bprd 13720/tcp bprd
UNIX UNIX
master jupiter media saturn
server server
Ethernet
UNIX
Policy Client List mars
client
jupiter
mars
saturn
UNIX
master jupiter
server
Ethernet
Windows
Policy Client List mars saturn
Client
jupiter
mars
saturn
UNIX
UNIX
mars media saturn
client
server
Ethernet
jupiter UNIX
usr/openv/netbackup/bp.conf master
meteor server
SERVER=jupiter
SERVER=meteor
SERVER=saturn Ethernet
CLIENT_NAME=mars
SERVER=jupiter SERVER=jupiter
SERVER=meteor SERVER=meteor
SERVER=saturn SERVER=saturn
CLIENT_NAME=jupiter CLIENT_NAME=pluto
/etc/services /etc/services
#NetBackup services #NetBackup services
bpcd 13782/tcp bpcd bpcd 13782/tcp bpcd
bprd 13720/tcp bprd bprd 13720/tcp bprd
bpdbm 13721/tcp bpdbm
Windows install_path\NetBackup\bin
UNIX /usr/openv/netbackup/bin
The bpclntcmd options that are useful for testing the functionality of the host name
and IP address resolution are -ip, -hn, -sv, and -pn. The following topics explain
each of these options:
The -sv option displays the NetBackup version number on the primary
server.
-pn When the -pn option is run on a NetBackup client, it initiates an inquiry to
the NetBackup primary server. The server then returns information to the
requesting client. First, the server is the first server in the server list. Then
it displays the information that the server returns. The information the server
returns is from the perspective of the primary server and describes how the
primary server sees the connecting client. For example:
bpclntcmd -pn
expecting response from server rabbit.friendlyanimals.com
dove.friendlyanimals.com dove 123.145.167.3 57141
-verbose Use with the -pn option to display more details about the connection and
the host certificates used. The following is an example of the output:
Use -ip and -hn to verify the ability of a NetBackup node to resolve the IP addresses
and host names of other NetBackup nodes.
For example, to verify that a NetBackup server can connect to a client, do the
following:
■ On the NetBackup server, use bpclntcmd -hn to verify the following: The
operating system can resolve the host name of the NetBackup client (as
configured in the client list for the policy) to an IP address. The IP address is
then used in the node’s routing tables to route a network message from the
NetBackup server.
■ On the NetBackup client, use bpclntcmd -ip to verify that the operating system
can resolve the IP address of the NetBackup server. (The IP address is in the
message that arrives at the client’s network interface.)
5 Select the property that you want to edit and make the changes.
2 Use the Activity Monitor to verify that the NetBackup Scale-Out Relational
Database Manager service is running.
This service is the vrtsdbsvc_psql daemon on UNIX.
3 If the NetBackup Scale-Out Relational Database Manager is stopped, note the
following:
■ Do not stop the nbrb service. If you stop the nbrb service while the
NetBackup relational database service is down, it can result in errors.
■ Restart the NetBackup Scale-Out Relational Database Manager service.
UNIX ■ The bptm log from the media servers that froze the media:
/usr/openv/netbackup/logs/bptm
Windows ■ The bptm log from the media servers that froze the media:
install_dir\VERITAS\NetBackup\logs\bptm
Set the verbosity of the bptm process log to 5 to troubleshoot any media and
drive-related issues. This log does not use excessive drive space or resources even
at an elevated verbosity. When media is frozen, the bptm logs may contain more
detailed information that the Activity Monitor or Problems Report. Set the verbosity
for bptm on individual media servers by changing their logging levels under Host
Properties on the NetBackup Administration Console.
See “Frozen media troubleshooting considerations” on page 89.
See “About the conditions that cause media to freeze” on page 90.
Dirty drives Clean the drives that are freezing the media according to the
manufacturer's suggestions. Frozen media is one of the first
symptoms of a dirty drive.
The drive itself Check for the tape device errors that the operating system logs
or the device driver reports. If any are found, follow the hardware
manufacturer's recommendations for this type of error.
Communication issues Check for SCSI or HBA device errors the operating system logs
at the SCSI or host bus or the device driver reports. If any are found, follow the hardware
adapter (HBA) level manufacturer's recommendations for this type of error.
Drive not supported Ensure that the tape drives appear on the hardware compatibility
list as supported for NetBackup. This list is located on the
following Veritas Support website:
www.veritas.com/docs/TECH59978
Troubleshooting procedures 91
Frozen media troubleshooting considerations
Media not supported Ensure that the media is supported for use with the tape drive
by the tape drive vendor.
These library tapes may have been written outside of NetBackup. By default,
NetBackup only writes to a blank media or other NetBackup media. Other media
types (DBR, TAR, CPIO, ANSI, MTF1, and recycled Backup Exec BE-MTF1
media) are frozen as a safety measure. Change this behavior by using the
following procedure:
Troubleshooting procedures 92
Frozen media troubleshooting considerations
ALLOW_MEDIA_OVERWRITE = DBR
ALLOW_MEDIA_OVERWRITE = TAR
ALLOW_MEDIA_OVERWRITE = CPIO
ALLOW_MEDIA_OVERWRITE = ANSI
ALLOW_MEDIA_OVERWRITE = MTF1
ALLOW_MEDIA_OVERWRITE = BE-MTF1
Stop and restart the NetBackup daemons for the changes to take
effect.
Do not select a foreign media type for overwriting unless you are
sure that you want to overwrite this media type.
For more details about each media type, see the NetBackup Device
Configuration Guide.
■ The media is a tape formerly used for the NetBackup catalog backup. For
example, the log entry may be the following:
The media is frozen because it is an old catalog backup tape which NetBackup
does not overwrite by default. The bplabel command must label the media to
reset the media header.
■ The media is intentionally frozen. You can use the bpmedia command to manually
freeze media for a variety of administrative reasons. If no record exists of a
specific job freezing the media, the media may have been frozen manually.
■ The media is physically write protected. If the media has a write-protect notch
that is set for write protection, NetBackup freezes the media.
To unfreeze frozen media, enter the following bpmedia command:
Troubleshooting procedures 93
Troubleshooting problems with the NetBackup web services
The media_server variable is the one that froze the media. If this item is unknown,
run the bpmedialist command and note the "Server Host:" listed in the output.
The following example shows that media server denton froze media div008:
# bpmedialist -m div008
/usr/openv/netbackup/bin/bpps -x
install_path/netbackup/bin/nbwmc -terminate
install_path/netbackup/bin/nbwmc
3 Review the NetBackup web server logs and web application logs.
See “Viewing NetBackup web services logs” on page 94.
Troubleshooting procedures 94
Troubleshooting problems with the NetBackup web services
See the following tech note for the web server tasks you must perform before
installing the primary server:
https://www.veritas.com/support/en_US/article.000081350
usr/openv/wmc/webserver/logs
install_path\NetBackup\wmc\webserver\logs
■ The NetBackup web application logs use unified logging. These logs are written
to the following location.
usr/openv/logs/nbwebservice
install_path\NetBackup\logs\nbwebservice
Cause
Check the web server logs at the following location:
install_path/wmc/webserver/logs/catalina.log
The root cause can be: The keystore of the external CA used by the NetBackup
web service is tampered or deleted.
Solution
■ Verify that NetBackup Web Management Console service is running.
Run the following command:
On UNIX: /usr/openv/netbackup/bin/bpps -x
On Windows: Use the NetBackup Activity Monitor or the services application of
the Windows Control Panel.
■ If the status is FAIL, reconfigure the external certificate by executing the following
command:
On Windows:Install path\netbackup\wmc\bin\configureWebServerCerts
-addExternalCert -nbHost -certPath file_path -privateKeyPath
file_path -trustStorePath file_path
On Unix:/usr/openv/netbackup/bin/configureWebServerCerts
-addExternalCert -nbHost -certPath file_path -privateKeyPath
file_path -trustStorePath file_path
Problem
External certificate is not configured.
Cause
The issue can occur because of the following:
■ Invalid certificate, private key, or trust store.
Error message : The certificate could not be added. Please check the
configureWebServerCerts logs.
■ Certificate does not contain server name in the subject alternative name (SAN)
of the certificate.
Location: <install
dir>/NetBackup/wmc/webserver/logs/configureWebServerCerts.log
■ Review the log messages:
■ If the logs have the following message:
unable to load private key 22308:error:0906D06C:PEM
routines:PEM_read_bio:no start
line:.\crypto\pem\pem_lib.c:697:Expecting: ANY PRIVATE KEY Could
not export certificates in PKCS#12 format, 1.
The private key does not t match the private key of the certificate that is
provided.
Provide the appropriate private key.
■ If the logs have following message:
Error occurred while adding certificate to keystore. Exception:
java.security.cert.CertificateParsingException: signed overrun,
bytes = 918 Exiting.. Could not import CA certificates in JAVA
keystore, -1.
The file path that is provided for the -trustStorePath option is not a valid
file path or a valid trust store CA certificate is not present at the given file
path.
Provide the trust store bundle path for the -trustStorePath option.
Note: The host name is the name provided for the primary server at the time of
installation. Host name can be found in the setenv file with the NB_HOSTNAME
property.
Location of the file:
On UNIX : /usr/openv/wmc/bin/setenv
On Windows: install_path\Veritas\NetBackup\wmc\bin\setenv
/usr/openv/logs/nbcert
/usr/openv/wmc/webserver/logs/configureCerts.log
/usr/openv/logs/nbatd
install_path\NetBackup\logs\nbcert
C:\ProgramData\Veritas\NetBackup\InstallLogs\WMC_configureCerts_yyyymmdd_timestamp.txt
install_path\NetBackup\logs\nbatd
Troubleshooting procedures 98
Resolving PBX problems
/usr/openv/logs/nbwebservice
/usr/openv/wmc/webserver/logs/configureCerts.log
/usr/openv/logs/nbatd
install_path\NetBackup\logs\nbwebservice
C:\ProgramData\Veritas\NetBackup\InstallLogs\WMC_configureCerts_yyyymmdd_timestamp.txt
install_path\NetBackup\logs\nbatd
4 Access and check the PBX logs by using the following procedure:
See “Accessing the PBX logs” on page 101.
5 Check the PBX security and correct any problem by using the following
procedure:
See “Troubleshooting PBX security” on page 102.
6 Check that the required NetBackup daemon or service is running. If necessary,
start the needed daemon or service by using the following procedure:
See “Determining if the PBX daemon or service is available” on page 104.
ps | grep pbx_exchange
/opt/VRTSpbx/bin/vxpbx_exchanged start
On Windows, make sure that the Private Branch Exchange service is started.
(Go to Start > Run and enter services.msc.)
install_path\VxPBX\bin\pbxcfg -p
Example output:
/opt/VRTSpbx/bin/pbxcfg -p
Example output:
/opt/VRTSpbx/bin/pbxcfg -a -u root
/opt/VRTSpbx/bin/pbxcfg -d -m
For more information on the pbxcfg command, refer to the pbxcfg man
page.
■ install_path\VxPBX\log (Windows)
The unified logging originator number for PBX is 103. See the NetBackup Logging
Reference Guide for more information on unified logging.
Error messages regarding PBX may appear in the PBX log or in the unified logging
logs for nbemm, nbpem, nbrb, or nbjm. The following is an example of an error that
is related to PBX:
pbxcfg -s -l debug_level
You do not have to restart PBX for this setting to take effect.
■ On UNIX:
/opt/VRTSpbx/bin/pbxcfg -p
install_path\VxPBX\bin\pbxcfg -d -m
■ On UNIX:
/opt/VRTSpbx/bin/pbxcfg -d -m
3 Stop NetBackup:
■ On Windows:
install_path\NetBackup\bin\bpdown
■ On UNIX:
/usr/openv/netbackup/bin/bp.kill_all
4 Stop PBX:
■ On Windows: Go to Start > Run, enter services.msc, and stop the Veritas
Private Branch Exchange service.
■ On UNIX:
/opt/VRTSpbx/bin/vxpbx_exchanged stop
5 Start PBX:
■ On UNIX:
/opt/VRTSpbx/bin/vxpbx_exchanged start
■ On Windows: Go to Start > Run, enter services.msc, and start the Veritas
Private Branch Exchange service.
6 Start NetBackup:
■ On Windows:
install_path\NetBackup\bin\bpup
Troubleshooting procedures 104
Resolving PBX problems
■ On UNIX:
/usr/openv/netbackup/bin/bp.start_all
install_path\NetBackup\bin\bpdown
install_path\NetBackup\bin\bpup
■ On UNIX:
/usr/openv/netbackup/bin/bp.kill_all
/usr/openv/netbackup/bin/bp.start_all
Troubleshooting procedures 105
Troubleshooting problems with validation of the remote host
Note: The host validation failures are shown as connection failure errors on
NetBackup 8.0 and earlier hosts.
■ Windows: Install_Path\NetBackup\bin\admincmd\nbseccmd
-getsecurityconfig -insecurecommunication
Note: Although Auto Image Replication supports replication across different master
server domains, the Replication Director does not.
Auto Image Replication operates like any duplication job except that its job contains
no write side. The job must consume a read resource from the disk volume on which
the source images reside. If no media server is available, the job fails with status
800.
The Auto Image Replication job operates at a disk volume level. Within the storage
unit that is specified in the storage lifecycle policy for the source copy, some disk
volumes may not support replication. Use the Disk Pools interface of the NetBackup
Administration Console to verify that the image is on a disk volume that supports
replication. If the interface shows that the disk volume is not a replication source,
click Update Disk Volume or Refresh to update the disk volume(s) in the disk
pool. If the problem persists, check your disk device configuration.
Note: The Update Disk Volume option is also available in the NetBackup web UI.
Click Storage > Disk Pool.
The action to take on the automatic replication job depends on several conditions
as shown in the following table.
Action Condition
AIR replication jobs are queued but have No media server or I/O stream is available.
not started
Troubleshooting procedures 110
Troubleshooting Auto Image Replication
Action Condition
AIR replication jobs fail, for example with Check the job details for more information about
status 191 the failure.
Example output:
LSU Info:
Server Name: PureDisk:ss1.acme.com
LSU Name: PureDiskVolume
Allocation : STS_LSU_AT_STATIC
Storage: STS_LSU_ST_NONE
Description: PureDisk storage unit (/ss1.acme.com#1/2)
Configuration:
Media: (STS_LSUF_DISK | STS_LSUF_ACTIVE | STS_LSUF_STORAGE_NOT_FREED
| STS_LSUF_REP_ENABLED | STS_LSUF_REP_SOURCE)
Save As : (STS_SA_CLEARF | STS_SA_OPAQUEF | STS_SA_IMAGE)
Replication Sources: 0 ( )
Replication Targets: 1 ( PureDisk:bayside:PureDiskVolume )
...
3 To display the replication targets by using the raw output, run the following
command:
The display shows that the replication target is a storage server called bayside
and the LSU (volume) name is PureDiskVolume.
4 To ensure that NetBackup captured this configuration correctly, run the following
command:
This listing shows that disk volume PureDiskVolume is configured in disk pool
PDPool, and that NetBackup recognizes the replication capability on the source
side. A similar nbdevquery command on the target side should display
ReplicationTarget for its disk volume.
5 If NetBackup does not recognize the replication capability, run the following
command:
6 To ensure that you have a storage unit that uses this disk pool, run the following
command:
# bpstulist
PDstu 0 _STU_NO_DEV_HOST_ 0 -1 -1 1 0 "*NULL*"
1 1 51200 *NULL* 2 6 0 0 0 0 PDpool *NULL*
The output shows that storage unit PDstu uses disk pool PDpool.
Troubleshooting procedures 113
Troubleshooting Auto Image Replication
7 Check the settings on the disk pool by running the following command:
Max IO Streams is set to -1, which means the disk pool has unlimited
input-output streams.
8 To check the list of media servers that are credentialed to access the storage
servers and their disk pools, run the following command:
This disk pool only has one media server, ss1.acme.com. You have completed
the storage configuration validation.
Troubleshooting procedures 114
Troubleshooting Auto Image Replication
9 The last phase of validation is the storage lifecycle policy configuration. To run
Auto Image Replication, the source copy must be on storage unit PDstu. Run
the following command (for example):
nbstl woodridge2bayside -L
Name: woodridge2bayside
Data Classification: (none specified)
Duplication job priority: 0
State: active
Version: 0
Destination 1 Use for: backup
Storage: PDstu
Volume Pool: (none specified)
Server Group: (none specified)
Retention Type: Fixed
Retention Level: 1 (2 weeks)
Alternate Read Server: (none specified)
Preserve Multiplexing: false
Enable Automatic Remote Import: true
State: active
Source: (client)
Destination ID: 0
Destination 2 Use for: 3 (replication to remote master)
Storage: Remote Master
Volume Pool: (none specified)
Server Group: (none specified)
...
Preserve Multiplexing: false
Enable Automatic Remote Import: false
State: active
Source: Destination 1 (backup:PDstu)
Destination ID: 0
To troubleshoot the Auto Image Replication job flow, use the same command
lines as you use for other storage lifecycle policy managed jobs. For example,
to list the images that have been duplicated to remote master, run the following:
To list the images that have not been duplicated to remote master (either
pending or failed), run the following:
10 To show the status for completed replication copies, run the following command:
nbstlutil repllist -U
Image:
Master Server : ss1.acme.com
Backup ID : woodridge_1287610477
Client : woodridge
Backup Time : 1287610477 (Wed Oct 20 16:34:37 2010)
Policy : two-hop-with-dup
Client Type : 0
Schedule Type : 0
Storage Lifecycle Policy : woodridge2bayside2pearl_withdup
Storage Lifecycle State : 3 (COMPLETE)
Time In Process : 1287610545 (Wed Oct 20 16:35:45 2010)
Data Classification ID : (none specified)
Version Number : 0
OriginMasterServer : (none specified)
OriginMasterServerID : 00000000-0000-0000-0000-000000000000
Import From Replica Time : 0 (Wed Dec 31 18:00:00 1969)
Required Expiration Date : 0 (Wed Dec 31 18:00:00 1969)
Created Date Time : 1287610496 (Wed Oct 20 16:34:56 2010)
Copy:
Master Server : ss1.acme.com
Backup ID : woodridge_1287610477
Copy Number : 102
Copy Type : 3
Expire Time : 1290288877 (Sat Nov 20 15:34:37 2010)
Expire LC Time : 1290288877 (Sat Nov 20 15:34:37 2010)
Try To Keep Time : 1290288877 (Sat Nov 20 15:34:37 2010)
Residence : Remote Master
Copy State : 3 (COMPLETE)
Job ID : 25
Retention Type : 0 (FIXED)
MPX State : 0 (FALSE)
Source : 1
Destination ID :
Last Retry Time : 1287610614
Replication Destination:
Source Master Server: ss1.acme.com
Backup ID : woodridge_1287610477
Copy Number : 102
Troubleshooting procedures 116
Troubleshooting Auto Image Replication
Rules for master servers used with Auto Image Replication and SLPs
Auto Image Replication operations use storage lifecycle policies (SLP) in at least
two NetBackup master server domains. Verify that the two master servers follow
these rules:
■ If replicating to specific targets (targeted AIR), you must create the Import SLP
before creating the Auto Image Replication SLP in the originating domain. You
may then choose the appropriate import SLP.
Note: Ensure that the Import SLP name is less than 113 characters.
■ The storage lifecycle policy's data classification in the source master server
domain must match the SLP policy's data classification in the target master
server domain.
Cause
The issue can occur because of the following reasons:
■ Cause 1 - Enrollment of source master server to target master server failed.
■ Cause 2 - Failed to add the target master server in the trusted master server
database and in the configuration file as TRUSTED_MASTER.
2 Review the error message: (EXIT STATUS 5616: The local master server is
not reachable. The trust is unidirectional right now, the remote master server
trusts the local master server, but the local master server doesn't trust the
remote master. Please remove the trust)
If the bprd service is down on the source master server, check the logs in the
following order:
■ Check the bprd logs.
Windows: C:\Program Files\Veritas\NetBackup\logs\bprd\log_file
UNIX: /usr/openv/netbackup/logs/bprd/log_file
■ Check the proxy logs.
Windows: C:\Program
Files\Veritas\NetBackup\logs\nbpxyhelper\log_file
Troubleshooting procedures 118
Troubleshooting Auto Image Replication
UNIX: /usr/openv/logs/nbpxyhelper/log_file
■ Check the EMM database logs.
Windows: C:\Program Files\Veritas\NetBackup\logs\nbemm\log_file
UNIX: /usr/openv/logs/nbemm/log_file
Remove trust
Problem
Remove trust operation failed
Cause
Failed to remove target master server from trusted master server database and
from configuration file as TRUSTED_MASTER.
Solution
■ Review the error message: (EXIT STATUS 5616: The local master server is not
reachable. The trust is unidirectional right now, the remote master server trusts
the local master server, but the local master server doesn't trust the remote
master. Please remove the trust) .
The bprd service is down on the source master server.
Check the logs in the following order:
■ Check bprd logs.
Windows: C:\Program Files\Veritas\NetBackup\logs\bprd\log_file
UNIX: /usr/openv/netbackup/logs/bprd/log_file
read the entire image. An automatic import job reads the catalog record off the
storage device and adds it into its own catalog. This process is so fast that
NetBackup batches images for import for efficiency. A pending import is the state
where NetBackup has been notified, but the import has not yet occurred.
More information is available about the import operation in an SLP and how to tune
the batch interval of the import manager process.
See the NetBackup Administrator's Guide, Volume I.
The notify event from the storage server provides the following: the image name,
the storage server location to read the catalog for this image, and the name of the
SLP that processes the image. Images for automatic import jobs are batched by
storage lifecycle policy name and disk volume. The import job consumes an
input-output stream on the disk volume.
To view the images that are pending import, run the following command:
# nbstlutil pendimplist -U
Image:
Master Server : bayside.example.com
Backup ID : gdwinlin04_1280299412
Client : gdwinlin04
Backup Time : 1280299412 (Wed Jul 28 01:43:32 2010)
Policy : (none specified)
Client Type : 0
Schedule Type : 0
Storage Lifecycle Policy : (none specified)
Storage Lifecycle State : 1 (NOT_STARTED)
Time In Process : 0 (Wed Dec 31 18:00:00 1969)
Data Classification ID : (none specified)
Version Number : 0
OriginMasterServer : master_tlk
OriginMasterServerID : 00000000-0000-0000-0000-000000000000
Import From Replica Time : 0 (Wed Dec 31 18:00:00 1969)
Required Expiration Date : 0 (Wed Dec 31 18:00:00 1969)
Created Date Time : 1287678771 (Thu Oct 21 11:32:51 2010)
Copy:
Master Server : bayside.example.com
Backup ID : gdwinlin04_1280299412
Copy Number : 1
Copy Type : 4
Expire Time : 0 (Wed Dec 31 18:00:00 1969)
Expire LC Time : 0 (Wed Dec 31 18:00:00 1969)
Try To Keep Time : 0 (Wed Dec 31 18:00:00 1969)
Troubleshooting procedures 120
Troubleshooting Auto Image Replication
Fragment:
Master Server : bayside.example.com
Backup ID : gdwinlin04_1280299412
Copy Number : 1
Fragment Number : -2147482648
Resume Count : 0
Media ID : @aaaab
Media Server : bayside.example.com
Storage Server : bayside.example.com
Media Type : 0 (DISK)
Media Sub-Type : 0 (DEFAULT)
Fragment State : 1 (ACTIVE)
Fragment Size : 0
Delete Header : 1
Fragment ID : gdwinlin04_1280299412_C1_IM
The action to take on the automatic import job and the automatic import event
depends on several conditions as shown in the following table.
Action Condition
Automatic import jobs queue No media server or I/O stream is available for this
disk volume.
Automatic import jobs never start (copy ■ The storage lifecycle policy is inactive.
stays at storage lifecycle state 1) ■ The storage lifecycle policy import destination
is inactive.
■ The storage lifecycle policy is between
sessions.
■ The image has exceeded the extended retry
count and the extended retry time has not
passed.
Troubleshooting procedures 121
Troubleshooting Auto Image Replication
Action Condition
Automatic import event is discarded and ■ The event specifies a backup ID that already
the image is ignored exists in this master server catalog.
■ The event specifies a disk volume that is not
configured in NetBackup for this storage server.
Automatic import job is started but the ■ The storage lifecycle policy that is specified in
image is expired and deleted to clean the event does not contain an import
up disk space in some cases. The event destination.
logs an error in the Problems Report or ■ The storage lifecycle policy that is specified in
bperror output. An import job runs, but the event has an import destination with a
the import for this image fails showing residence that does not include the disk volume
a status code in the range 1532–1535. that is specified by the event.
■ The storage lifecycle policy that is specified
does not exist. By default, the Storage
Lifecycle Policies utility automatically creates
a storage lifecycle policy with the correct name.
Ensure that a storage lifecycle policy with the
same case-sensitive name exists in the target
master server.
More information is available for the storage
lifecycle policy configuration options.
See the NetBackup Administrator's Guide,
Volume I.
Look at the Problems Report or the bperror list for these cases.
To troubleshoot the job flow for automatic import jobs, use the same commands
as you would for other storage lifecycle policy managed jobs. To list images for
which NetBackup has received notification from storage but not yet initiated import
(either pending or failed): use the commands that were previously noted or run the
following command:
To list the images that have been automatically imported, run the following command:
Image:
Master Server : bayside.example.com
Backup ID : gdwinlin04_1280299412
Client : gdwinlin04
Backup Time : 1280299412 (Wed Jul 28 01:43:32 2010)
Policy : (none specified)
Client Type : 0
Schedule Type : 0
Storage Lifecycle Policy : (none specified)
Storage Lifecycle State : 1 (NOT_STARTED)
Time In Process : 0 (Wed Dec 31 18:00:00 1969)
Data Classification ID : (none specified)
Version Number : 0
OriginMasterServer : master_tlk
OriginMasterServerID : 00000000-0000-0000-0000-000000000000
Import From Replica Time : 0 (Wed Dec 31 18:00:00 1969)
Required Expiration Date : 0 (Wed Dec 31 18:00:00 1969)
Created Date Time : 1287680533 (Thu Oct 21 12:02:13 2010)
Note: If the NIC in a NetBackup master or media server is changed, or if the server
IP address changes, CORBA communications may be interrupted. To address this
situation, stop and restart NetBackup.
For help on how to view and reset duplex mode for a particular host or device,
consult the manufacturer’s documentation. If the documentation is not helpful,
perform the following procedure.
To troubleshoot network interface card performance
1 Log onto the host that contains the network interface card whose duplex mode
you want to check.
2 Enter the following command to view the current duplex setting.
ifconfig -a
4 For most hosts, you can set full-duplex mode permanently, such as in the host’s
/etc/rc files. Refer to the host’s documentation for more information.
Troubleshooting procedures 124
About SERVER entries in the bp.conf file
Remove or correct the SERVER entry in the bp.conf file, restart nbftclnt on the
client, and retry the operation.
Note: The nbftclnt process on the client must be running before you start a SAN
client backup or restore over Fibre Channel.
NetBackup status The operations that are performed in the NetBackup Administration Console can result
codes and messages in the errors that are recognized in other parts of NetBackup. These errors usually appear
exactly as documented in the NetBackup status codes and messages.
Note: A status code does not always accompany the error message.
Java exceptions Either the Java APIs or NetBackup Administration APIs generate these exceptions.
Java exceptions usually appear in one of the following places:
Scenario
If the vnetd service is down on the host to which the NetBackup Administration
Console is connecting
Recommended action
Check if the services are up on the host and try logging in again.
Scenario
If external certificate's private key is not available or is in an incorrect format, error
VRTS-28678 is displayed.
Recommended action
■ Check if the path provided for the ECA_PRIVATE_KEY_PATH configuration option
is valid (it should not be empty).
■ Check if the path provided for ECA_PRIVATE_KEY_PATH is accessible and also
if the private key file has required access permissions.
■ Provide a valid private key and try logging in again.
In case of Windows certificate store, do the following:
■ Run the certlm.msc command.
In case certlm.msc is not working, you can access the Windows certificate
store by running the mmc.exe command. Go to File > Add Remove Snap in.
■ Open the certificate by double clicking it.
Troubleshooting procedures 128
Unable to logon to the NetBackup Administration Console after external CA configuration
The certificate with private key should have a message stating that you have a
private key corresponding to this certificate.
Scenario
If the external certificate is not present while you establish the trust with the
NetBackup Administration Console.
Recommended action
■ Check if the path provided for the ECA_TRUST_STORE_PATH configuration option
is not empty.
■ Check if the path provided for ECA_TRUST_STORE_PATH is accessible and also
if the CA certificate file has required access permissions.
■ Provide a valid external certificate and try logging in.
In case of Windows certificate store, do the following:
■ Check if the root CA certificate is added in the Windows Cert Store's Trusted
Root Certificate Authorities.
■ Run certlm.msc command. In the certificate management window, open the
store named Trusted Root Certificate Authorities. The Trusted Root Certificate
Authorities store contains all the self-signed certificates that are trusted by that
machine.
In case certlm.msc is not working, you can access the Windows certificate
store by running mmc.exe. Go to File > Add Remove Snap in.
■ Select certificates from left hand side.
■ Click Add.
■ Select computer account. Click Next.
■ Click Finish and then OK.
■ Click Trusted Root Certification Authorities > Certificates.
■ Check if the root CA certificate in the certificate chain is present in the Trusted
Root Certificate Authorities store.
Note: All the certificates should be imported in the local machine store and not
in the current user store. You can verify the current store in the certificate
management window.
Scenario
If an external CA-signed certificate is not present or not accessible, the following
error is displayed:
The host does not have external CA-signed certificate. The certificate
is mandatory to establish a secure connection.
Recommended action
■ Check if the path provided for ECA_CERT_PATH in NetBackup configuration
file is not empty.
■ Check if the path provided for ECA_CERT_PATH points to the entire certificate
chain.
■ Check if the path provided for ECA_CERT_PATH is accessible and also if it
has required access permissions.
■ Provide a valid external CA-signed certificate and try logging in.
In case of Windows certificate store, do the following:
■ Check if ECA_CERT_PATH contains the appropriate value: Windows Certificate
Store Name\Issuer Name\Subject Name. Verify if the certificate exists in the
Windows certificate store.
■ Run the certlm.msc command.
In case certlm.msc is not working, you can access the Windows certificate
store by running the mmc.exe.
File > Add Remove Snap in.
■ Navigate to your certificate as per your input Windows Certificate Store
Name\Issuer Name\Subject Name.
■ Open your certificate by double-clicking it.
■ Ensure that it is valid, has a private key, a correct issuer name, and a correct
subject name.
If you are using $hostname in Subject name, check that certificate subject
has fully qualified domain name of the host.
If this is not the case, either change the ECA_CERT_PATH or put the right
certificate in Windows certificate store and then try logging in.
Troubleshooting procedures 130
Unable to logon to the NetBackup Administration Console after external CA configuration
Scenario
Certificate revocation list (CRL) is not signed by a trusted authority.
Recommended action
This may occur at the time of login if the master server was configured to use
NetBackup certificates and later it was enabled to use external certificates and vice
versa. So the NetBackup Administration Console starts using the new CRL if you
click Activity Monitor, locks the screen, tries to login again or in the periodic checks
after every 1 hour, the certificate revocation status verification fails.
To fix this issue, you need to close the console and login again so that the peer
host's certificate and the CRL are in sync.
If logging in again does not fix the issue then the reason can be the new CRL was
not downloaded.
Run following command after correcting the CRL format:
UNIX: /usr/openv/netbackup/bin/nbcertcmd -updateCRLCache
Windows: install_path\Veritas\Netbackup\bin\nbcertcmd -updateCRLCache
Scenario
The revocation status of the host certificate cannot be verified using the CRL,
because the CRL format is not valid.
Recommended action
This error can occur if a delta CRL is used.
NetBackup does not support delta CRLs, so you need to use non-delta CRLs.
Run following command after correcting the CRL format:
UNIX: /usr/openv/netbackup/bin/nbcertcmd -updateCRLCache
Windows: install_path\Veritas\Netbackup\bin\nbcertcmd -updateCRLCache
Scenario
The certificate of the host name is revoked.
Recommended action
If the certificate was revoked in error, reissue a certificate for the host.
If the certificate was revoked intentionally, a security breach may have occurred.
Contact your security administrator.
Troubleshooting procedures 131
Unable to logon to the NetBackup Administration Console after external CA configuration
Scenario
The Certificate Revocation List could not be downloaded. Therefore the certificate
revocation status could not be verified.
Recommended action
The possible causes include the following:
■ ECA_CRL_PATH is missing or has incorrect path.
Scenario
The Certificate Revocation List is not updated. Therefore the certificate revocation
status could not be verified.
Recommended action
The possible causes include the following:
■ The next update date / time of the CRL is older than the current system date /
time.
■ The CRL was valid at the time of login. The console was open and now the CRL
has become invalid.
Ensure that the system time is correct.
In case the new CRL was not downloaded, run the following command
UNIX: /usr/openv/netbackup/bin/nbcertcmd -updateCRLCache
Windows: install_path\Veritas\Netbackup\bin\nbcertcmd -updateCRLCache
Scenario
Unable to connect to the NetBackup Web Management Console service.
Recommended action
The possible causes include the following:
■ The NetBackup Web management Console service is down.
■ ECA_CERT_PATH does not point to the entire certificate chain.
Troubleshooting procedures 132
Troubleshooting file-based external certificate issues
■ Web service certificate's issuer and the issuer of the host certificate may not
match.
If both the certificates are not issued by the same external CA, certificate trust
verification fails.
Review the following:
■ It is mandatory to provide the path to the certificate file that contains the entire
chain of certificates (except the root certificate).
■ If chain is not specified, the certificate trust verification fails and the console is
not able to connect to the web service.
■ Ensure that the web server's certificate and the host certificate are issued by
same external CA.
NBCA:OFF ECA:ON
■ The web service certificate that is used for communication is not trusted by a
certificate authority.
■ Check the certificate path (the configureWebServerCert -certPath option)
must have a leaf certificate with the entire chain of CA certificates except
the trust anchor (root CA).
■ Run the following command to list the certificates that are configured for the
web server.
nbcertcmd -listallcertificates -jks
On Windows: C:\Program Files\ VERITAS\NetBackup\bin\nbcertcmd
-listallcertificates -jks
On UNIX: /usr/openv/netbackup/bin/netbackup/bin/nbcertcmd
-listallcertificates -jks
■ Run the following command to list the host certificate details of the NetBackup
primary server.
install_path/goodies/nbsslcmd x509 -in certificate_path -noout
-text -purpose
On Windows: install_path\goodies\nbsslcmd x509 -in
certificate_path -noout -text -purpose
On UNIX: /usr/openv/netbackup/bin/netbackup/bin/goodies/nbsslcmd
x509 -in certificate_path -noout -text -purpose
Troubleshooting procedures 134
Troubleshooting file-based external certificate issues
Validate whether the host certificate of the primary server is issued by the
same root CA as of the web server certificate.
If host certificate is not issued by the same root CA as of web server
certificate then issue new certificate with that CA for NetBackup primary
server and enroll certificate again.
■ The specified server name was not found in the web service certificate.
The server name does not match any of the host names listed in the server's
certificate.
Names listed in the server's certificate are:
DNS: nb-primary _ext
DNS: nb-primary .some.domain.com
DNS: nb-primary _web_svr EXIT STATUS 8509:
Either update the configuration on the NetBackup host so that it uses one of the
names that are present in the web server certificate to refer to the primary server
or include all names of the primary server that are known to the NetBackup
domain in the certificate.
For more information, refer to the following article:
https://www.veritas.com/support/en_US/article.000126751
Cause 2
Some of the NetBackup core services have not started.
For more details on the NetBackup commands, refer to the NetBackup Commands
Reference Guide.
Use the following procedure to resolve the issue:
■ Check the status of the following services by running the bpps command from
the NetBackup bin directory:
■ nbsl
■ vnetd -standalone
■ Restart the nbsl and the vnetd services, if they are not running.
■ Restart the NetBackup Scale-Out Relational Database, if it is not running.
On Windows:
Restart the nbsl, vnetd, and NetBackup Scale-Out Relational Database Manager
services as follows:
Troubleshooting procedures 135
Troubleshooting file-based external certificate issues
On UNIX:
Restart the nbsl service as follows.
/usr/openv/netbackup/bin/nbsl -terminate
/usr/openv/netbackup/bin/nbsl
# kill 16018
/usr/openv/netbackup/bin/vnetd -standalone
/usr/openv/netbackup/bin/nbdbms_start_server
Cause 3
The required prerequisites for external certificate are not met.
Review the following prerequisites:
■ Subject DN should be unique and stable for each host. It should have less than
255 characters and should not be empty.
Troubleshooting procedures 136
Troubleshooting file-based external certificate issues
■ Only ASCII 7 characters are supported in the certificate subject DN and X509v3
Subject Alternative Name.
■ Server and client authentication attributes (SSL server and SSL client) should
be set (or should be true) in the certificate.
■ Certificate is in PEM format.
■ CRL distribution points (CDPs) are supported only for HTTP/HTTPS.
Run the following command to verify if the prerequisites are met.
install_path/goodies/nbsslcmd x509 -in certificate_path -noout -text
-purpose
Note: The certificate paths that are provided for the configureWebServerCert
-certPath option and the ECA_CERT_PATH option must have a leaf certificate with
the entire chain of the CA certificates except the trust anchor (root CA).
Desirable conditions:
■ Host name (CLIENT_NAME) that is used for certificate enrollment should be part
of X509v3 Subject Alternative Name under DNS type.
■ Common name (CN) of the subject name should not be empty.
Note: The following warning is generated when the nbsslcmd command is run and
can be safely ignored:
WARNING: can't open config file: /usr/local/ssl/openssl.cnf
Cause 4
External certificate configuration path is not configured properly.
Ensure the following external certificate configuration options are configured properly:
■ ECA_CERT_PATH
■ ECA_TRUST_STORE_PATH
■ ECA_PRIVATE_KEY_PATH
■ ECA_CRL_PATH
■ ECA_CRL_CHECK
If you have not specified ECA_CRL_PATH, NetBackup uses the CRLs on the URLs
that are specified in the peer host certificate's CDP.
■ ECA_CRL_PATH is not a volumeID path on Windows.
Run the following command and validate the external certificate configuration
parameters.
On UNIX: install_path/bin/nbgetconfig | grep ECA
Windows: install_path/bin/nbgetconfig | findstr ECA
.
For more information about the configuration options, refer to the NetBackup Security
and Encryption Guide.
Cause 5
The requirements that are mentioned in Cause 3 are not met.
■ Host name (CLIENT_NAME) used for the certificate enrollment is not part of X509v3
Subject Alternative Name under the DNS type.
If enrollment fails with this error, do one of the following:
■ Generate new certificate having host name in subject alternative name of
the certificate.
■ Add or update (first delete and then add) the subject name of the certificate
(RFC 2253 compliant) in the external certificate database on the primary
server.
Run the following command to add an entry for the host and the associated
subject name in the NetBackup certificate database (only administrator can
perform this operation):
install_path/bin/nbcertcmd -createECACertEntry -host host_name
| -hostId host_id -subject subject name of external cert
[-server primary_server_name]
Alternatively, run the following command to delete an entry for the host and
the associated subject name from the NetBackup certificate database and
then add an entry using the -createECACertEntry command (only
administrator can perform this operation):
install_path/bin/nbcertcmd -deleteECACertEntry -subject subject
name of external cert [-server primary_server_name]
■ Common name (CN) of the subject name is not present in the certificate.
If certificate enrollment fails with this error, do one of the following:
■ Generate a new certificate with the common name in the certificate.
Troubleshooting procedures 138
Troubleshooting Windows certificate store issues
■ Generate a new certificate with the host name in the subject alternative name
of the certificate.
■ Add host in the NetBackup host database and add an entry for the host and
the associated subject name in the NetBackup certificate database.
Run the following command to add a host in the NetBackup host database
(only administrator can perform this operation):
install_path/bin/admincmd/nbhostmgmt -addhost -host host_name
| -hostId host_id [-server primary_server_name]
Run the following command to add an entry for the host and the associated
subject name in the NetBackup certificate database.
install_path/bin/nbcertcmd -createECACertEntry -host host_name
| -hostId host_id -subject subject name of external cert
[-server primary_server_name]
Subject name of the external certificate should be RFC 2253 compliant.
Cause 6
Certificate revocation check failed.
External certificate enrollment can fail with the certificate revocation error for the
following reasons:
■ The external certificate is revoked.
■ The web server certificate is revoked.
■ CRL is unavailable on either the host or the primary server.
See “Troubleshooting issues with external CA-signed certificate revocation”
on page 67.
For more details on enrollment of external certificates in NetBackup, refer to the
NetBackup Security and Encryption Guide.
Problem
The web service certificate cannot be trusted while enrolling the host certificate.
Cause
This issue is caused by one of the following:
Troubleshooting procedures 139
Troubleshooting Windows certificate store issues
■ The web service certificate that is used for communication is not configured
properly.
■ The root certificate in the certificate chain of web service certificate is not present
in the Trusted Root Certification Authorities of the Windows certificate store.
Solution
To resolve the issue, review the following causes and run the following command
to determine the current state of the problem.
Install_Path/bin/ nbcertcmd -enrollCertificate -preCheck -server
server_name
■ Ensure that all the certificates in the chain (except the root CA certificate) are
present in the jks.
Check the following parameters in the nbcertcmd -listallcertificates
-jks output.
■ If the root CA certificate is not present, click All Actions > Import, select .PEM
/ .CRT / .CER file of the certificate and click Import.
All the certificates should be imported in the local machine store and not in the
current user store.
You can verify the current store in the certificate management window.
Problem
Certificate's public key algorithm is not supported.
The public key algorithm is not supported by NetBackup. Currently only the RSA
algorithm is supported.
Cause
The certificate with given path exists in windows cert store but its signature algorithm
is not supported.
Solution
You need to use the certificate with public key algorithm that is supported by
NetBackup.
Troubleshooting procedures 141
Troubleshooting Windows certificate store issues
Problem
Private key for the given certificate is not available.
The certificate in specified by the path does not have a corresponding private key
imported in Windows certificate store.
Cause
This is typically caused by importing a .crt, .cer, or .pem certificate manually in
the Windows certificate store instead of .pfx.
Solution
Ensure that the certificate has its private key imported.
■ Run the certlm.msc command.
In case certlm.msc does not work, you can access the Windows certificate
store by running the mmc.exe command.
File > Add Remove Snap in
■ Navigate to your certificate.
■ Open your certificate by double-clicking it.
The certificate with the private key should have a message stating that you have
a private key corresponding to this certificate.
■ If certificate is to be manually enrolled, import a .pfx file and not just the .cer
or .crt file.
For more details on enrollment of external certificates in NetBackup, refer to the
NetBackup Security and Encryption Guide.
Problem
Certificate with the given subject name is not found
Could not find the certificate when a special keyword $hostname is used in
ECA_CERT_PATH
Cause
The certificate does not exist in the local machine store for the given ECA_CERT_PATH.
One of the attributes from store name, issuer name, or subject name does not
match the one in the local machine store.
Troubleshooting procedures 142
Troubleshooting backup failures
Solution
■ Check if the certificate exists in the local machines store. Do the following:
■ Run the certlm.msc command.
In case certlm.msc does not work, you can access the Windows certificate
store by running the mmc.exe command.
File > Add Remove Snap in.
■ Check if the certificate exist
Cause
Possible reasons for the failure are:
Troubleshooting procedures 143
Troubleshooting backup failure issues with NAT clients or NAT servers
■ The master server (web server) is configured to use only external CA-signed
certificates, but the media server or the clients are not configured to use external
certificates. Their external certificates are not enrolled with the master server
domain.
■ The master server (web server) is configured to use only external CA-signed
certificates, but the media server or the clients are still not upgraded to 8.2 or
later.
Solution
■ Check the master server certificate authority (CA) configuration using the
nbcertcmd -getsecconfig -caUsage command, the NetBackup Web UI.
If the web server is configured to use only external certificates, do the following:
■ Identify the two hosts for which the communication fails.
■ Check if any of the two hosts is 8.2 or later, but is not configured to use external
certificates.
If it is true, enroll an external certificate for the host with the master server
domain.
■ Check if any of the two hosts is 8.1.x.
If it is true, upgrade the host to 8.2 or later and enroll an external certificate for
the host with the master server domain or configure the web server to use both
external and NetBackup certificates.
■ Clear the cache memory on the hosts using the following command:
bpclntcmd -clear_host_cache
■ The nbmqbroker service may not be up and running on the master server.
Troubleshooting procedures 144
Troubleshooting backup failure issues with NAT clients or NAT servers
Cause 1
Media server cannot connect to the nbmqbroker service.
Cause 2
The nbmqbroker service may not be up and running on the master server.
Cause 1 and Cause 2 have the same solution as follows:
■ Check the bpbrm logs on the media server at Install_Path/logs/bpbrm.
■ Check the nbmqbroker log file at:
UNIX: /usr/openv/mqbroker/logs
Windows: Install_Path/mqbroker/logs
■ Ensure that the nbmqbroker service is running on the master server. Use the
following commands:
■ Run the bpps command.
■ Run the bptestbpcd -host hostname command from the master or media
server and check the admin logs at Install_Path/logs/admin.
■ Check the subscriber service is running on the NAT client by running the bpps
command.
■ Check the existing host ID-to-host name mapping of the client by running the
Install_Path/bin/admincmd/nbhostmgmt -li -json command on the master
or media server.
■ If the client name is not mapped to the host ID, add a new name for the client
and map it to existing host ID using the
Install_Path/bin/admincmd/nbhostmgmt -add -hostid hostid
-mappingname hostname command.
■ Add the media server name in the /etc/hosts file on the client.
■ Add the media server name in the configuration file on the client using the
nbsetconfig command.
Do the following:
■ Check the client logs to see if it contains the following message:
4 If the master server service is restarted, restart the media server and wait for
the media server to be online.
5 Check if the subscriber logs of the media server are ready to receive connection
messages if the log level is set to a value greater than 1. For example:
Log message for the disconnected state: Retrying connection stopped
for n seconds with attempt:m
Example: The ‘Program Files’ directory has the 8dot3 file name setting enabled,
therefore the short name ‘PROGRA~1’ is generated.
But it differs for the ‘not8 Dot3’ directory.
C:\>dir /x
-5.6.3
2 Check the certificates that the nbmqbroker service uses. Run the following
command:
On Unix: cat /usr/openv/var/global/mqbroker/mqbroker.config | grep
ssl_options
On windows: type
"NetBackup_Install_path\var\global\mqbroker\mqbroker.config" |
findstr "ssl_options"
You need to remove the NetBackup certificates so that the nbmqbroker service
works appropriately.
3 Run the following command to remove the NetBackup certificates:
configureWebServerCerts -removeNBCert
Troubleshooting procedures 152
Troubleshooting issues with the NetBackup Messaging Broker (or nbmqbroker) service
4 Restart the NetBackup Web Management Console (nbwmc) service and the
nbqmbroker service to reflect the changes.
5 Check the certificates that the nbmqbroker service uses. Run the following
command:
On Unix: cat /usr/openv/var/global/mqbroker/mqbroker.config | grep
ssl_options
On windows: type
"NetBackup_Install_path\var\global\mqbroker\mqbroker.config" |
findstr "ssl_options"
Root cause:
Certain configuration changes on the master server may result into inconsistency
in nbmqbroker service configuration. To resolve the issue, you need to reconfigure
the nbmqbroker service.
Troubleshooting procedures 153
Troubleshooting issues with the NetBackup Messaging Broker (or nbmqbroker) service
■ bp.start_all command
Sample output:
# nslookup primary-server.com
Server: 2600:100:f0a1:9000::a
Address: 2600:100:f0a1:9000::a#53
Non-authoritative answer:
Name: primary-server.com
Address: 10.200.100.60
Name: primary-server.com
Address: 2600:100:f0a1:9014::335
Expected output:
Troubleshooting procedures 154
Troubleshooting issues with the NetBackup Messaging Broker (or nbmqbroker) service
# nslookup primary-server.com
Server: 2600:100:f0a1:9000::a
Address: 2600:100:f0a1:9000::a#53
Non-authoritative answer:
Name: primary-server.com
Address: 2600:100:f0a1:9014::335
Do the following:
■ Fix all the configurations to create an appropriate IPv6-only setup.
■ If the issue still persists, do the following configuration changes to start the
nbmqbroker service.
With this configuration, the nbmqbroker service always attempts to first use the
IPv6 address for name resolution.
To change the configurations
1 Do the following to create the required file.
Use an appropriate text editor (vi on Linux or Notepad on Windows) and create
a file called erl_inetrc in the given directory:
On Linux, create the erl_inetrc file in the following directory:
/usr/openv/var/global/mqbroker/erl_inetrc
ls -l /usr/openv/mqbroker/bin/setmqenv
4 Do the following:
On Linux:
Add the following lines in the
/usr/openv/var/global/mqbroker/advanced_setmqenv file:
RABBITMQ_SERVER_ADDITIONAL_ERL_ARGS="-kernel inetrc
'/usr/openv/var/global/mqbroker/erl_inetrc' -proto_dist inet6_tcp"
RABBITMQ_CTL_ERL_ARGS="-proto_dist inet6_tcp"
On Windows:
Add the following lines in the
NetBackup_Install_path\var\global\mqbroker\advanced_setmqenv file:
RABBITMQ_SERVER_ADDITIONAL_ERL_ARGS=-kernel inetrc
'E:/NetBackup/var/global/mqbroker/erl_inetrc' -proto_dist
inet6_tcp RABBITMQ_CTL_ERL_ARGS=-proto_dist inet6_tcp
5 Ensure that the file permissions are not changed after the update.
6 Start the nbmqbroker service.
Mar 27, 2020 5:20:40 PM - Error bptm (pid=11143) KMS failed with error status: Error details :
Error Code : 1298, Error Message : Cannot communicate with one or more key management servers.,
Server - example.master.com:0, Error code - 25, .
Mar 27, 2020 5:20:40 PM - Info bptm (pid=11143) EXITING with status 83 <----------
Mar 27, 2020 5:20:43 PM - Info bpbkar (pid=11132) done. status: 83: media open error
2 Run the following command on the master server to verify whether NetBackup
KMS is configured or not:
Install_Path/bin/nbkmscmd -listKMSConfig -name nbkms
■ If nbkms service is not running check nbkms logs at the following location:
On UNIX - /usr/openv/logs/nbkms
On Windows - Install_Path\NetBackup\logs\nbkms
Check if a key is created on the KMS server with the required key group.
Troubleshooting procedures 157
Issues with KMS configuration
4 Check if at least one active key is listed using the following command:
Install_Path/bin/nbkmscmd -listKeys -name KMS_configuration_name
-keyGroupName key_group_name
5 If key is not listed, create a key with the required key group and clear the cache
on the media server. Run the following command:
Install_Path/bin/bpclntcmd -clear_host_cache
4 Run the following command if certificate files exist on the master server.
Install_Path/netbackup/bin/goodies/nbkmiputil -validate -kmsServer
kms_server_name -port 5696 -certPath certificate_file_path
-privateKeyPath private_key__file_path -trustStorePath
ca_file_path
If key is not listed, create a key with the required key group and clear the cache
on the media server. Run the following command:
Install_Path/bin/bpclntcmd -clear_host_cache
[Debug] NB 51216 nbwebapi 495 PID:10984 TID:149 File ID:495 [No context] 5
[com.netbackup.config.PeerInfoPopulatorFilter]
Request URL : https://<Master-Server>:1556/netbackup/security/key-management-services/keys
Connection Info :ConnectionInfo
[Debug] NB 51216 nbwebapi 495 PID:10984 TID:149 File ID:495 [No context] 5
[com.netbackup.security.kms.resource.KMSConfigResource]
HTTP GET filter query string is : KeyId eq 'bdc3492b015d4a9ab25426465b12adac6a834dfc6b4449c49092
and kadlen eq 32
[Debug] NB 51216 nbwebapi 495 PID:10984 TID:149 File ID:495 [No context] 5
[com.netbackup.security.kms.resource.KMSConfigResource]
com.netbackup.security.kms.resource.KMSConfigResource getKeys() - NBKMSRecordNotFoundException
occured due to missing KMS record.com.netbackup.nbkms.exception.NBKMSRecordNotFoundException:
security.error.kms.KeyRecordNotFound
In case of such an error, it is possible that the CA migration was successfully initiated
but the request is timed out because of the large key size. However, in the
background the CA migration initiation may be complete and the certificates may
be renewed with the new CA.
To verify if the initiation of NetBackup CA migration was successful
1 Run the following command:
nbseccmd -nbcaMigrate -summary
3 Once you have ensured that the migration status is INITIATED, run the following
command to verify if the new CA is displayed in the list:
nbseccmd -nbcalist
■ If the new CA is present in the list, it implies that the migration is successfully
initiated.
■ If the new CA is not present in the list, run the following command:
nbseccmd -nbcaMigrate -syncMigrationDB
4 If the certificates are still not updated, contact Veritas Technical Support.
1 During NetBackup installation or Possible reasons are as follows: Resolutions are as follows:
upgrade on UNIX platform, unable to
■ Reason 1 - The service user ■ Resolution 1 - Run the following
specify the service user even after
does not exist locally, in command:
three prompts.
LDAP, or in NIS. id service_user
■ Reason 2 - nbwebsvc is used The ID command must be
as a service user. successful.
■ Reason 3 - nbwebgrp is not ■ Resolution 2 - Run the
a secondary group of the nbgetconfig command to
service user. check the NetBackup
configuration file (bp.conf) for
the WEBSVC_USER entry.
The service user cannot be
same as the value that is set
for the WEBSVC_USER
configuration option.
■ Resolution 3 - Run the
nbgetconfig command to
check the NetBackup
configuration file (bp.conf) for
the WEBSVC_USER entry.
Run the following command:
id service_user
In the command output, ensure
that gid is not equal to the gid
of the WEBSVC_GROUP
option value and groups have
the value WEBSVC_GROUP.
2 During NetBackup installation on an The service user name and the Ensure that the service user name
inactive cluster node on UNIX platform, user ID do not match. and the user ID match on all cluster
one of the following errors occurs: nodes and the same is provided
during NetBackup installation on
■ Service user name on
active and inactive nodes.
active node does not match
with service user name
entered on inactive node.
■ SERVICE_USER_ID '10021'
retrieved from active node
does not match with the
user ID '1002' of local
user 'nonroot'.
Troubleshooting procedures 163
Issues with the non-privileged user (service user) account
3 During NetBackup upgrade of an The bpgetconfig command Provide the service user as that of
inactive cluster node on UNIX platform, could not retrieve the service user the active node and ensure that the
the following error occurs: and the ID from active node. service user has the same user ID
on all cluster nodes.
Failed to retrieve the
'SERVICE_USER' or
'SERVICE_USER_ID' entries
from the configuration file
on the server
'cluster_virtual_name'. You
must provide the same
'SERVICE_USER' (daemon user
name) that is configured on
the active node.
4 During NetBackup installation or This may be because of the Fix the errors specified in
upgrade on UNIX platform, the issues while changing the installation trace under the
following error occurs: ownership of the installation following heading:
directory.
The user serviceuser cannot be set Fix below errors and then
as the owner of files in /usr/openv. retry
5 NetBackup host communication does NetBackup services do not have Check private key permissions as
not work when external CA is access to the private key. follows:
configured with Windows Certificate Usually, the error in this case can
Right-click the certificate. Go to All
Store and services run in a Local be seen in the nbpxyhelper
Tasks > Manage Private Keys.
Service account context. logs:
All NetBackup services should
The Windows API
have permissions to read the
CryptAcquireCertificatePrivateKey
private key.
fails with error 0x80090016:
Keyset does not exist. Run the following command to set
permissions:
nbcertcmd
-setWinCertPrivKeyPermissions
nbcertcmd -ecaHealthCheck
Troubleshooting procedures 164
Issues with the non-privileged user (service user) account
6 The setconfig command fails with Ownership of Run the following command to fix
the following error: /usr/openv/netbackup is the ownership issues:
changed to the root user.
Failed to open /usr/openv/netbackup/bin/goodies/
/usr/openv/netbackup/bp.conf.d53: Other possible reason may be update_install_folder_perms
Permission denied (13) that the language pack is installed
using rpm.
7 ■ Create or update operation fails for Service user account may not Review status code 9201 and
catalog backup policy. have access to the disaster 9202.
■ Catalog backup fails. recovery (DR) path specified in
Refer to the NetBackup Status
■ Catalog recovery fails. policy.
Codes Guide.
9 Any of the following commands fail Service user account may not Refer to the NetBackup Security
with error: Ensure that the service user have access permissions on and Encryption Guide for giving
account [service_user_name] has specified paths and their access permissions to the service
access permissions on the specified contents. user account.
paths and their contents.
■ nbdb_admin
■ nbdb_move
■ nbdb_backup
■ nbdb_restore
■ nbdb_unload
■ create_nbdb
■ cat_export
■ cat_import
Path:
For Windows -
Install_Path\netbackup\bin
10 Adding VMware server operation fails 500 system error Ensure that the temp directory
(/tmp) is accessible to the service
user account
11 Issue in bpjava-test-login File ownership is shown as 'root' Change the ownership of the file
workflow to the service user account.
13 nbcertcmd or bpnbaz fails with error The private key Ensure that NetBackup SIDs are
code 123. file (PrivKeyFile-2048.pem), configured and both public and
public key file private keys are present in
(PubKeyFile-2048.pem), AT_DATA_DIR.
or access control list (ACL)
update failed.
Troubleshooting procedures 166
Issues with the non-privileged user (service user) account
14 nbserviceusercmd -changeUser The new service user is not part Add the new service user in the
operation failed with authorization of the NBAC security admin NBAC security admin group. Run
failure, when NBAC is configured. group. the following command:
vssaz addazgrpmember
--azgrpname \"Security
Administrators\"
--prplinfo prplinfo
15 After NetBackup 9.1 installation and The user certificate directory is If NBAC or EA is enabled in your
upgrade, NetBackup Administration changed. environment, you must run the
Console login fails for root user, if bpnbat -login command after
NetBackup access control (NBAC) or NetBackup upgrade.
Enhanced Auditing (EA) is enabled.
17 Before upgrade or change user, the The service user may be deleted Do the following:
service user is deleted. because of certain user actions.
Reconfigure the user to restore the
service user. Refer to the article.
■ useradd -c 'NetBackup
Services account' -d
/usr/openv/ nbsvcusr -u
old uid
■ usermod -a -G nbwebgrp
nbsvcusr
18 During backup or restore, operation The media server version is Upgrade the media server or use
error is encountered. earlier than the client version. an alternate media server with the
version that is later or same as the
client version.
Troubleshooting procedures 167
Issues with group name format in the auth.conf file
On Windows:
install_path\NetBackup\sec\at\bin\vssat validateprpl -p user name
-d nt:domain name -b broker-host:1556:nbatd
The output of the command provides names of the groups that are associated
with the user who cannot access certain nodes or operations in the NetBackup
Administration Console.
Troubleshooting procedures 169
Troubleshooting the VxUpdate add package process
2 To access the nodes as expected, copy the group names that appear in the
command output and paste them in the auth.conf file.
Consider the following eExample:
vssat validateprpl -p [email protected] -d unixpwd -b
localhost:1556:nbatd
ValidatePrincipal :
ID : <UID>
Name : [email protected]
Domain :
Description : User
Group(s) Details :
Count : 2
GID of group1 :
GID of group2
Add the group name in the auth.conf file as per the following format:
<GRP> [email protected] ADMIN=SUM+AM JBP=ALL
■ Linux:
/usr/openv/netbackup/logs/bprd
/usr/openv/logs/vxul/nbwebservice
3 Review the log files around the approximate time of the package add attempt.
Search through both the nbwebservice and the bprd log files for the requested
VxUpdate SJA file name.
4 From the log files, determine the status code or status codes that the add
attempt received.
5 Review the recommended action for the status code in the NetBackup Status
Code Guide.
Example
The log section that is shown is from the nbwebservice log. It shows an example
of one error that can occur during the VxUpdate package add (emphasis is added
for clarity):
0,51216,495,495,10738,1633618954821,12920,229,16:5F6DBAD64588994B,393:PackagesServiceImpl.
validateCreatePackagesBulkInputs - The Package file for file path [\\nbserver_store\
vxupdate\NetBackup_9.1.2_VU_2of4\vxupdate_nb_9.1.2_windows_x64.sja] was not found, or is
not accessible to NetBackup processes on the primary server. If the file exists, it must
be in a location that is accessible to NetBackup, such as a local directory on the primary
server.,61:com.netbackup.deployment.packages.service.PackagesServiceImpl,50,51216,495,495,
10739,1633618954822,12920,229,16:5F6DBAD64588994B,11659:Raised exception The Package file
for file path [\\nbserver_store\vxupdate\NetBackup_9.1.2_VU_2of4\
vxupdate_nb_9.1.2_windows_x64.sja] was not found, or is not accessible to NetBackup
processes on the primary server. If the file exists, it must be in a location that is
accessible to NetBackup, such as a local directory on the primary server. - errorCode: 7284
Troubleshooting procedures 171
Issues with FIPS mode
The add attempt shown failed with a NetBackup Status Code 7284. The file in this
example exists, but is on a network share that is not accessible to the primary
server. NetBackup services such as bprd might not be active with an account that
has adequate permissions to read file on UNC paths or network shares.
If you place the .sja file into a Windows profile directory, such as a user's desktop,
NetBackup generates a similar error. The error is because the NetBackup services
and processes do not have adequate permissions to that location.
Review the NetBackup Status Code Guide for recommended actions.
Solution:
Use the private key with the PKCS8 format.
To check the status of entropy on the host where the NetBackup Administration
Console runs, execute the following command. If the command returns the value
less than 200, there is an entropy issue on that host.
cat /proc/sys/kernel/random/entropy_avail
Solution:
Set the USE_URANDOM option to 1 in the nbj.conf configuration file. The Java
processes start using the /dev/urandom device.
Solution:
Set the USE_URANDOM option to 1 in the configuration file. The nbwmc service starts
using the /dev/urandom device.
Solution:
Reconfigure the NetBackup CA hierarchy and use a key size that is supported for
FIPS mode - either 2048 bits or 3072 bits.
Troubleshooting procedures 173
Issues with malware scanning
Failed to connect to the SSH connection to scan host Verify the following scan host
scan host. from media server failed. credentials:
Failed to determine the Error can be due to For a complete list of supported
scan host OS. unsupported scan host. platforms for the scan host, refer
to the Software Compatibility list
document.
Failed to get the scan Media server is not able to Check that credentials for scan
host credentials. fetch the credentials to access host are specified.
scan host from the Primary.
Time-out has occurred Default scan operation time Scan time-out is configurable and
during the scan. out is two days. Time to scan can be changed by setting the
may vary depending on the MALWARE_SCAN_OPERATION_TIMEOUT
factors sch as workload type, configuration key.
network bandwidth, backup
■ Minimum value: 1 hour
size.
■ Maximum value: 30 days
Failed to mount the IA share is not accessible from Check IA configuration on storage
backup image. the scan host. server. Verify on activity monitor
that IA job is successful.
Failed to run a scan. Generic failure during the scan Refer to nbmalwarescanner
of a backup image. logs on the media server.
■ Failed to an open file. ■ Not enough space is ■ On windows scan host, check
■ Unable to create a available on the scan host. space is availability in C:\
directory. ■ SSH user does not have ■ On Linux scan host check
■ Failed to generate access to the required space is availability in /tmp
the result file. directories on the scan
■ Failed to open output host.
file.
■ Unable to create
directory for result
file.
■ Failed to open the
result file.
■ Unable to create
mount destination
directory.
■ Unable to create
directory for a log file.
All mount drives are Only five backup images can ■ Ensure that scan host is not
exhausted. be mounted at the same time part of multiple NetBackup
on windows scan host. domains.
■ Check if there are any Stale
mounts on the Scan host by
running net use.
■ Following drive letters are
used for mounting the IA
shares on the windows scan
host. Ensure that they are not
in use. L:\ M:\ N:\ O:\
P:\
Troubleshooting procedures 175
Issues with malware scanning
Either the Windows Microsoft Windows Defender Ensure that Microsoft Windows
Defender is not installed is not installed on the scan Defender is installed on scan
or the environment host or not configured host.
variable is not set. properly.
Refer NetBackup™ Web UI
Administrator’s Guide for the scan
host configuration.
Failed to perform Generic error for Scan failure. Contact NetBackup support.
malware scan of the
backup image.
Net bios name can be at Storage server hostname Ensure the character limit.
most 15 chars long. cannot be more than 15
characters for the SMB share
support.
Failed to run a scan. Generic failure during Check for the following errors:
scanning backup image.
■ Refer to nbmalwarescanner
logs on the media server.
■ Check for space on media
server storage.
■ Check for NFS service failure
on media server.
Troubleshooting procedures 176
Issues with malware scanning
Too many infected files Review the Update the date range or
in the selected time nbmalwarescanner to view recovery files and folders
range. the infected files list for the selection to reduce the number
backup images in the selected of infected files. Retry the
date range. operation. You can also perform
one of the following:
Large number of infected There are too many infected Export the infected file list in
files. files in the selected scan .csv format and download it to
result. If the scan result has view it.
infected files greater than
5000, the following message
is displayed:
Large number of
infected files. To
view the complete list
of infected files,
export the list.
Large number of
infected files.
Scan operation is divided For large size backup, scan Each divided part details can be
into parts. operation is divided into parts. obtained by using the REST API.
Options Fields
Copies: Copy2
Workaround:
To view the images that are backed up, ensure that you select the Malware scan
status option as All to scan the NAS-Data-Protection backup images created on
earlier version of NetBackup media server.
Troubleshooting procedures 178
Issues with NetBackup jobs that are enabled for data-in-transit encryption
Operation Logs
Cause
The NetBackup web service took more time to restart as a result of which the global
DTE cache of bpcd is not refreshed. It results into a failure of the given operation
while determining the DTE mode.
Troubleshooting procedures 179
Issues with NetBackup jobs that are enabled for data-in-transit encryption
Resolution
Retry the operation after 2 minutes of the service restart so that the global DTE
mode is successfully refreshed by the web service in the next iteration.
Operation Logs
Cause
Failure in retrieving the media server DTE setting from EMM, resulting in failure of
the operation.
Resolution
Retry the operation to successfully retrieve the media server DTE mode.
Troubleshooting procedures 180
Issues with NetBackup jobs that are enabled for data-in-transit encryption
Operation Logs
Cause
There is a failure while retrieving the pre-shared key that is required for TLS
handshake between hosts. This is because of either of the following issues in
bpclntcmd such as:
Resolution
Stop the existing bpclntcmd -store process and retry the operation.
Operation Logs
Jan 19, 2022 8:49:36 PM - Error bpdm (pid=18607) cannot connect to the
writing side process for duplication, Success Jan 19, 2022 9:37:02 PM - Error
(pid=1028) listen protocol error - couldn't accept from data socket,
The operation completed successfully. Jan 19, 2022 9:37:03 PM - Info bptm
(pid=1028) EXITING with status 25 <----------
Cause
When data-in-transit encryption (DTE) is enabled, vnetd process is responsible for
setting up the pre-requisites required for DTE TLS handshake. On a busy machine,
if vnetd spends more time doing this, bptm times out before vnetd forwards the
connection. As a result of this, duplication fails.
Solution
On the target host, increase the timeout for accepting connection from vnetd. Use
the nbgetconfig and nbsetconfig commands to increase the timeout of the
VNET_OPTIONS configuration option.
For example, to change the timeout from 120 seconds to 300 run the following
commands:
nbgetconfig VNET_OPTIONS VNET_OPTIONS = 120 3600 200 40 3 1 30 10
1793 32 0 0
{
"errorCode": 4001,
"errorMessage": "Failed to create the instant access mount.",
"attributeErrors": {},
"fileUploadErrors": [],
"errorDetails": [
"Failed to provision the backup. /usr/openv/pdde/vpfs/bin/vpfs_
actions failed (1): Unable to getCatalog: ('Could not get
catalog of backup
(test-mssql1_1654780591): /usr/openv/pdde/vpfs/bin/cata2map failed
(255): ', {'statusInfo': {'msgId': 'Failed to
get catalog', 'parameters': [{'type':
'string', 'name': 'backupId', 'value':
'test-mssql1_1654780591'}]}})\n"
]
}
Resolution:
Check if the MSDP engines are healthy in your AKS or EKS environment.
Alternatively, you can create a new backup and create Unstructured Data Instant
Access again using its API interface.
Chapter 3
Using NetBackup utilities
This chapter includes the following topics:
Utility Description
Network troubleshooting They verify various aspects of the network configuration inside
utilities and outside NetBackup to ensure that there is no
misconfiguration.
NetBackup support utility It queries the host and gathers appropriate diagnostic
(nbsu) information about NetBackup and the operating system.
Note: The utilities must be initiated on supported platforms. However, the utilities
can analyze debug log files from most NetBackup client and server platforms for
UNIX and Windows.
Utility Description
backupdbtrace Consolidates the debug log messages for specified NetBackup database backup jobs and
writes them to standard output. It sorts the messages by time. backupdbtrace attempts to
compensate for time zone changes and clock drift between remote servers and clients.
At a minimum, you must enable debug logging for admin on the master server, and for bptm
and bpbkar on the media server. For best results, set the verbose logging level to 5 and
enable debug logging for the following: bpdbm on the master server and bpcd on all servers
in addition to the processes already identified.
Utility Description
backuptrace Copies to standard output the debug log lines relevant to the specified backup jobs, including
online (hot) catalog backups.
The backuptrace utility can be used for regular file system, database extension, and
alternate backup method backup jobs. It consolidates the debug logs for specified NetBackup
jobs. The utility writes the relevant debug log messages to standard output and sorts the
messages by time. backuptrace attempts to compensate for time zone changes and clock
drift between remote servers and clients. The format of the output makes it relatively easy
to sort or grep by timestamp, program name, and server or client name.
The backuptrace utility works with the nbpem, nbjm, and nbrb logs on the master server.
You should enable debug logging for bpbrm and bptm or bpdm on the media server and for
bpbkar on the client. For best results, set the verbose logging level to 5. Enable debug
logging for the following: bpdbm and bprd on the master server and for bpcd on all servers
and clients in addition to the processes already identified.
bpgetdebuglog A helper program for backuptrace and restoretrace. It can also be useful as a
standalone program and is available for all NetBackup server platforms.
bpgetdebuglog prints to standard output the contents of a specified debug log file. If only
the remote machine parameter is specified, bpgetdebuglog prints the following to standard
output: the number of seconds of clock drift between the local computer and the remote
computer.
duplicatetrace Consolidates the debug logs for the specified NetBackup duplicate jobs and writes them to
standard output. It sorts the messages by time. duplicatetrace attempts to compensate
for time zone changes and clock drift between remote servers and clients.
At a minimum, you must enable debug logging for admin on the master server and for bptm
or bpdm on the media server. For best results, set the verbose logging level to 5 and enable
debug logging for the following: bpdbm on the master server and bpcd on all servers and
clients in addition to the processes already identified.
Utility Description
importtrace Consolidates the debug log messages for the specified NetBackup import jobs and writes
them to standard output. It sorts the messages by time. importtrace attempts to compensate
for time zone changes and clock drift between remote servers and clients.
At a minimum, you must enable debug logging for admin on the master server. And for
bpbrm, you must enable debug logging for bptm and tar on the media server.
For best results, set the verbose logging level to 5 and enable debug logging for the following:
bpdbm on the master server and bpcd on all servers and clients in addition to the processes
already identified.
restoretrace Copies to standard output the debug log lines relevant to the specified restore jobs.
The restoretrace utility consolidates the debug logs for specified NetBackup restore jobs.
The utility writes debug log messages relevant to the specified jobs to standard output and
sorts the messages by time. restoretrace attempts to compensate for time zone changes
and clock drift between remote servers and clients. The format of the output makes it relatively
easy to sort or grep by timestamp, program name, and server or client name.
At a minimum, you must enable debug logging for bprd on the master server. Enable debug
logging for bpbrm and bptm or bpdm on the media server and tar on the client. For best
results, set the verbose logging level to 5. Enable debug logging for bpdbm on the master
server and for bpcd on all servers and clients.
verifytrace Consolidates the debug log messages for the specified verify jobs and writes them to standard
output. It sorts the messages by time. The verifytrace command attempts to compensate
for time zone changes and clock drift between remote servers and clients.
At a minimum, you must enable debug logging as follows: for admin on the master server
and for bpbrm, bptm (or bpdm) and tar on the media server. For best results, set the verbose
logging level to 5 and enable debug logging for the following: bpdbm on the master server
and bpcd on all servers and clients in addition to the processes already identified.
UNIX /usr/openv/netbackup/logs/<PROGRAM_NAME>/log.mmddyy
Using NetBackup utilities 188
About the Logging Assistant
Windows install_path\NetBackup\Logs\<PROGRAM_NAME>\mmddyy.log
An option may be added later that allows the analyzed log files to reside on
alternate paths.
Note: For the processes that use unified logging, log directories are automatically
created.
■ The consolidated debug log may contain messages from unrelated processes.
You can ignore messages with timestamps outside the duration of the job from
the following: bprd, nbpem, nbjm, nbrb, bpdbm, bpbrm, bptm, bpdm, and bpcd.
An output line from the log analysis utilities uses the following format:
Utility Description
bptestnetconn Performs several tasks that aid in the analysis of DNS and
connectivity problems with any specified list of hosts. This list
includes the server list in the NetBackup configuration. To help
troubleshoot connectivity problems between the services that use
CORBA communications, bptestnetconn can perform and report
on CORBA connections to named services.
nbdna (NetBackup Evaluates the host names in the NetBackup domain. The nbdna
Domain Network utility self-discovers the NetBackup domain and evaluates host
Analyzer) name information, then tests connectivity to these host names and
validates their network relationship status.
UNIX /usr/openv/netbackup/bin/support/nbsu
Windows install_path\NetBackup\bin\support\nbsu.exe
Note: The NetBackup support utility (nbsu) has been updated in NetBackup 8.1.1.
The previous version of nbsu (renamed old_nbsu) is deprecated and will be removed
in a future NetBackup release. Veritas recommends use of the newer version (nbsu).
Veritas recommends that you run the NetBackup support utility (nbsu) in the following
circumstances:
■ To obtain baseline data on your NetBackup installation. If you encounter
problems later, this data can be useful.
■ To document changes in your NetBackup or operating system environment.
Run nbsu periodically to keep your baseline data up to date.
■ To help isolate a NetBackup or operating system issue.
■ To report issues to Veritas technical support.
The following suggestions can help you run the nbsu utility more effectively:
■ For a complete description of nbsu including examples and how to gather
diagnostic information to send to Veritas Technical Support, see the NetBackup
Commands Reference Guide.
If you have a case ID from Technical Support of the form ########, rename
the log files with the case ID number. Then manually upload the files to the
Veritas Evidence Server. For additional assistance, see:
http://www.veritas.com/docs/000097935
■ For troubleshooting, run nbsu when the system is in the same state as when
the problem occurred. For example, do not stop and restart the NetBackup
processes after the error occurs or make a change to the server or network. If
you do, nbsu may not be able to gather key information about the problem.
Using NetBackup utilities 191
About the NetBackup support utility (nbsu)
■ To generate the debug messages that relate to nbsu, enter the following:
# nbsu -debug
For example:
■ UNIX/Linux: NBSU_mylinuxvm_master_11072017_152100.tgz
■ Windows: NBSU_mywindowsvm_master_11072017_152100.cab
The NetBackup environment where nbsu runs determines the particular files that
nbsu creates. nbsu runs only those diagnostic commands that are appropriate to
the operating system and the NetBackup version and configuration. For each
diagnostic command that it runs, nbsu writes the command output to a separate
file. As a rule, the name of each output file reflects the command that nbsu ran to
obtain the output. For example, nbsu created the NBU_bpplclients.txt by running
the NetBackup bpplclients command and created the OS_set.txt file by running
the operating system’s set command.
Each output file begins with a header that identifies the commands that nbsu ran.
If output from more than one command was included in the file, the header identifies
the output as an “internal procedure.”
The following is an example of part of the nbsu output file for the bpgetconfig
command. The STDERR is shown as the output of the command and is captured
in the output file. Exit status is outputted into the output file as follows: Exit status:
<exit status code>
#######Command used:
/usr/openv/netbackup/bin/admincmd/bpgetconfig -g sivbl17.domain.com -L#######
Client/Master = Master
NetBackup Client Platform = Linux, RedHat2.6.18
NetBackup Client Protocol Level = 8.1.0
Product = NetBackup
Version Name = 8.1
Version Number = 810000
NetBackup Installation Path = /usr/openv/netbackup/bin
Client OS/Release = Linux 3.10.0-229.el7.x86_64
Exit status: 0
WEB_SERVER_CONNECTION_TIMEOUT = 30
WEB_SERVER_TUNNEL_USE = AUTO
WEB_SERVER_TUNNEL_ENABLED = YES
WEB_SERVER_TUNNEL
TRUSTED_MASTER
KNOWN_MASTER
MASTER_OF_MASTERS
USEMAIL =
BPBACKUP_POLICY = any
BPBACKUP_SCHED = any
Exit status: 0
If a supported archive program is available on the host where nbsu runs, nbsu
bundles its output files into an archive file. If a supported compression utility is
available, nbsu compresses the archive file. Otherwise, the individual output files
remain unarchived and uncompressed.
An example of a compressed archive file that nbsu created is as follows:
/usr/openv/netbackup/bin/support/NBSU_host1_master_01172018_220505.tgz
where host1 is the name of the host on which nbsu ran, and master indicates that
the host is a NetBackup master server. The date is embedded in the file name in
the mmddyyyy format.
nbsu supports tar for archive and gzip for compression.
.
.
Collecting OS_filesystem info
Collecting OS_process_list info
Collecting OS_set info
CAB file created successfully.
■ Queries the EMM database to obtain the primary host name, associated host
names, and server attributes for host name normalization
■ Through examination of the NetBackup configuration, identifies cluster,
application cluster and servers
■ Gathers the information on the database and catalog
■ Analyzes the consistency of the gathered configuration and database and catalog
information
■ Creates a packaged bundle for Veritas technical support to review
NBCC resides in the following location:
UNIX /usr/openv/netbackup/bin/support/NBCC
Windows install_path\NetBackup\bin\support\NBCC.exe
This section indicates the server types in EMM that NBCC detects. It begins with
“Summary of NBCC <type> processing”.
See “Example of an NBCC progress display” on page 196.
A complete description of NBCC is in the NetBackup Commands Reference Guide.
Windows install_path\NetBackup\bin\support\output
\nbcc\hostname_NBCC_timestamp
If a supported archive program is available on the host where NBCC runs, NBCC
bundles its output files into an archive file. If a supported compression utility is
available, NBCC compresses the archive file. Otherwise, the individual output files
remain unarchived and uncompressed.
An example of a compressed (UNIX) archive file that NBCC created is as follows:
/usr/openv/netbackup/bin/support/output/NBCC/host1_NBCC_20060814_
164443/host1_NBCC_20060814_164443.tar.gz
where host1 is the name of the host where NBCC had been run.
On UNIX platforms, NBCC supports the tar, compress, and gzip utilities for UNIX file
archiving and compression. On Windows platforms, NBCC supports the tar, Makecab,
and gzip utilities for Windows file archiving and compression.
A complete description of NBCC is in the NetBackup Commands Reference Guide.
server1
NBCC version 8.1 Gather mode = full
NBCC command line = C:\Veritas\NetBackup\bin\support\NBCC.exe -nozip
OS name = MSWin32
OS version = Microsoft Windows [Version 6.1.7601]
NetBackup Install path = C:\Program Files\Veritas\
> dir output\nbcc\server1_NBCC_20130227_091747 2>&1
Parsed output for "bytes free"
lidabl11
Detected Disk Pool
lidabl11_pdde_pool
...
2.6 Obtaining Disk Pool information...
Detected Disk Pool
lidabl11_pdde_pool
host
lidabl11
Detected Disk Pool lidabl11_pdde_pool member
lidabl11
...
2.7 Obtaining tpconfig Storage credential information...
Detected the master server hostname
lidabl11
and associated Storage server hostname
lidabl11
...
2.8 Obtaining tpconfig NDMP configuration information...
Detected the EMM NDMP Host hostname
fas3240a
Detected the EMM NDMP Host hostname
fas3240b
...
2.9 Analyzing EMM master and/or media servers and configured
Storage Units...
The following EMM server entries do not have configured
Storage Units or Disk Pools:
C:\Program Files\Veritas\netbackup\bin\support\output\nbccr
No repair file output directory detected.
...
Stage 1 Data collection NBCCR first collects the information that is required to
perform a repair.
Using NetBackup utilities 203
About the NetBackup consistency check repair (NBCCR) utility
Stage 2 Repair qualification Immediately before the suggested repair is applied, NBCCR
verifies that the current status of the tape still qualifies for
the requested repair. It recognizes that time has passed
and the environment may have changed since the data
was collected. If so, it reports in a history file that the repair
is not qualified.
Stage 3 Repair Finally, NBCCR performs up to three steps of repair for every
repair entry in the SRA file. An element may be modified
to enable the repair and steps may be necessary after the
repair. If the repair fails during the repair operation, NBCCR
tries to roll back the repair so that the corrective action
does not introduce any new errors.
UNIX /usr/openv/netbackup/bin/support/NBCCR
Windows install_path\NetBackup\bin\support\NBCCR.exe
NBCCR accepts one input file, creates two output files, and uses one temporary file.
Input file NBCCR accepts as input the Suggested Repair Action (SRA) file named
mastername_NBCCA_timestamptxt. Technical Support analyzes
the NBCC support package and generates this file which is sent to the
end-user. This file is placed in the following directory for NBCCR
processing:
On UNIX:
/usr/openv/netbackup/bin/support/input/nbccr/SRA
On Windows:
install_path\NetBackup\bin\support\input\nbccr\SRA
Using NetBackup utilities 204
About the NetBackup consistency check repair (NBCCR) utility
Output files NBCCR automatically creates a separate directory for each SRA file
processed. The file name is based on the contents of the SRA file. The
name of the directory is as follows:
On UNIX: /usr/openv/netbackup/bin/support/output/
nbccr/mastername_nbccr_timestamp
On Windows: install_path\NetBackup\bin\support\output\
nbccr\mastername_nbccr_timestamp.
NBCCR also creates the following output files and places them in the
same directory.
Temporary file While it runs, the NBCCR utility uses KeepOnTruckin.txt, which
appears in the same location as the output files described in this table.
The following sample NBCCR.output.txt files show the results of two MContents
repairs. One where all images were found on tape and one where one or more
images were not found on the tape:
■ Example 1: NBCCR found all images on the tape. The MContents repair action
is successful.
■ Example 2: NBCCR did not find one or more images on the tape. The MContents
repair action was not performed.
found on tape
MContents ULT000 status: ActionFailed
■ --filecopy. File copy is the default condition. It copies the entire log file. File
copy with compression is usually enough to get the job done.
■ --fast. Fast search uses a binary search to strip out the lines that are outside
the time frame of the file. This mechanism is useful for copying large log files
such as bpdbm. This option is seldom needed and should be used with caution.
The default condition is the file copy, which copies the entire log file. A fast search
algorithm uses a binary search to strip out the lines that are outside the time frame
of the file. This mechanism is useful for copying large log files such as bpdbm.
The nbcplogs utility is intended to simplify the process of copying logs by specifying
the following options:
■ A time frame for the logs.
■ The log types that you want to collect.
■ Bundling and in-transit data compression.
In addition, you can preview the amount of log data to be copied.
Using NetBackup utilities 206
About the robotic test utilities
Note: Do not use the robotic test utilities when backups or restores are active. The
tests lock the robotic control path and prevent the corresponding robotic software
from performing actions, such as loading and unloading media. If a mount is
requested, the corresponding robotic process times out and goes to the DOWN
state. This usually results in a media mount timeout. Also, be certain to quit the
utility when your testing is complete.
/usr/openv/volmgr/bin/robtest
for acstest to work on UNIX and Linux, acssel and acsssi must
be running
install_path\Volmgr\bin\robtest.exe
Note: If the robot is not configured, you cannot use robtest. You must execute
the command that applies to the robot you want to test (see following list).
Note: The nbsmartdiag service is supported only on Windows and Linux (RHEL
and SUSE) platforms.
Evidence
Evidence is a set of information which is collected to help troubleshoot the
performance of NetBackup.
A single set of evidence collected that contains:
On Windows
■ A process dump of the process exhibiting performance issues.
■ Memory Performance counters in the form of a CSV file.
■ Network Performance counters in the form of a CSV file.
■ Disk Performance counters in the form of a CSV file.
■ netstat command outputs for network issues.
On Linux
■ A process dump of the process exhibiting performance issues.
■ vmstat, free, top, etc. command output for memory details.
■ gstack, pmap of the process.
■ mpstat, iostat command output for Disk I/O details.
■ netstat command outputs for network issues.
Evidence examples:
■ Sample of collected evidence on Windows.
Using NetBackup utilities 209
About the NetBackup Smart Diagnosis (nbsmartdiag) utility
Directory of NBSD_EVIDENCE_PATH\nbsmartdiag\bpdbm\5004\Evidence1
04/08/2021 02:07 AM <DIR> .
04/08/2021 02:07 AM <DIR> ..
04/08/2021 02:08 AM 197,979,709 5004_08-04_02.07.38_Deadlock.dmp
04/08/2021 02:07 AM 4,363 5004_08-04_02.07.38_DiskPerf_Deadlock.csv
04/08/2021 02:07 AM 1,530 5004_08-04_02.07.38_MemeoryPerf_Deadlock.csv
04/08/2021 02:07 AM 5,572 5004_08-04_02.07.38_Netstat_Deadlock.log
04/08/2021 02:07 AM 23,249 5004_08-04_02.07.38_NetworkPerf_Deadlock.csv
5 File(s) 198,014,423 bytes
2 Dir(s) 188,446,031,872 bytes free
■ Sample of collected evidence on Linux:
Cmd$ ls -l /root/NBTestData/nbsd.evd/nbsmartdiag/vnetd/29696/Evidence1 total
1154144
-rw-r--r-- 1 root root 1180858264 Apr 8 15:25 29696_08-04_15.24.43_CPU.29696
-rw-r--r-- 1 root root 197 Apr 8 15:24 29696_08-04_15.24.43_CPU.DiskPerf_iostat
-rw-r--r-- 1 root root 193 Apr 8 15:24 29696_08-04_15.24.43_CPU.DiskPerf_mpstat
-rw-r--r-- 1 root root 560374 Apr 8 15:24 29696_08-04_15.24.43_CPU.MemoryPerf
-rw-r--r-- 1 root root 185787 Apr 8 15:24 29696_08-04_15.24.43_CPU.Netstat
-rw-r--r-- 1 root root 214191 Apr 8 15:24 29696_08-04_15.24.43_CPU.ProcessData
Step Description
■ Windows
■ RHEL
■ SUSE
http://www.veritas.com/docs/DOC5332
nbsmartdiag demo $
/usr/openv/netbackup/bin/nbsmartdiag
-install.
Step Description
Nbsmartdiag demo $
/usr/openv/netbackup/bin/nbsmartdiag -start
Performing the start operation.
Info:Daemon is running.
http://www.veritas.com/docs/DOC5332
Legacy logs and try file logs may include logs outside of job run time frame as those
logs do not honor the time duration filter. Logs from all the hosts that are involved
in a job hierarchy are gathered by specifying a job ID of the hierarchy. Veritas
recommends that you use time synchronization for log collection on all hosts that
are included in the job time duration. A valid job ID must be present in the Activity
monitor. By default, a job ID is removed one week after the job is completed. The
nblogadm utility cannot gather the logs of a job ID if bpdbjobs or the Activity monitor
cannot retrieve the job details of the specified job ID. In addition, the logs gathering
command line interface and API option do not support Backup Now jobs. The VxUL
logs are not gathered from a back-level media server or a client.
The gathered logs include NetBackup product and NetBackup support utility (nbsu)
logs. The log gathering supports one record ID at a time, no concurrent log gathering
from multiple record IDs.
To avoid filling up the file system on primary server, media server, and client during
log gathering, Veritas recommends that you use the KEEP_LOGS_SIZE_GB option.
Veritas recommends that you specify the size of NetBackup logs that are retained
before you gather the logs. See the NetBackup Administrator’s Guide, Volume I for
more information.
A time-based log cleanup process is introduced in NetBackup 10.2. When logs are
not removed 7 days after they are gathered, this process removes those gathered
logs and the log record. To have a shorter log retention period of 5 days on a primary
server or a media server, set the LOG_RECORD_EXPIRY_DAYS to 5 with bpsetconfig.
To have a shorter log retention period of 5 days on a client, set the
LOG_RECORD_EXPIRY_DAYS to 5 with nbsetconfig. The smaller number takes
precedence. NetBackup may not remove logs from a back-level media server or a
client if it encounters errors during the log cleanup process. Veritas recommends
that you remove the left behind logs manually when you encounter this situation.
To avoid the gathered logs filling up the file system on a primary server, a predefined
10GB free space watermark is used. NetBackup uses this watermark to check and
prevent the start of log gathering when the available disk space is less than the
sum of watermark and the estimated size of the gathered logs. Additionally, the log
gathering process stops when the available space on a primary server falls under
the sum of watermark and the estimated size of the gathered logs. In this release,
the check of available space is extended to media servers and clients. To reduce
the free space watermark to 5GB, set the HIGH_WATERMARK_TRB_LOG_RECORDS =
5 with bpsetconfig command.
You have two options to gather higher verbosity logs. You can manually enable
logging and configure the desired logging level as documented in the NetBackup
Logging Reference Guide. Or you can use the command line interface and API
option to gather and to configure the logging level values on a primary server, a
media server, or a client. Then restart the job and start a log gathering task. The
Using NetBackup utilities 213
About log collection by job ID
feature includes an API option to retrieve the job ID of a new job after the originally
specified job is restarted.
Two log record IDs are required to gather higher verbosity logs. The first log record
ID (Record ID 1) is used for enabling logging and configuring the desired logging
levels to the hosts of a job ID (Job ID 1). After logging levels are configured and
the original job (Job ID 1) is restarted, a new job ID (Job ID 2) is generated. The
second log record ID (Record ID 2) is used for gathering logs within the new restarted
job (Job ID 2) run time frame from the primary server, media server, and clients if
reachable. On a backup domain that consists of multiple media servers and clients,
the media servers or clients of Record ID 1 and Record ID 2 may not be the same
due to the job scheduling algorithms.
In NetBackup 10.2, a SHA256 checksum of each collected log is included in the
Progress.txt file of the directory shown. The checksum fails to compute on a
media server or a client with back-level of NetBackup installed.
Location of the Progress.txt file:
■ Linux and UNIX
/usr/openv/netbackup/logs/nblastaging/record ID-timestamp:
YYYYMMDD-HHMMSS
■ Windows
install_path\Veritas\NetBackup\logs\nblastaging\record
ID-timestamp: YYYYMMDD-HHMMSS
NetBackup 10.2 includes a space usage enhancement to the required log storage
space on a primary server. The log files that are gathered from a primary server, a
media server, and a client are no longer stored on the primary server. The files
reside with each host in the directory shown.
■ Linux and UNIX
/usr/openv/netbackup/logs/nblaevidence/nbla-hash
■ Windows
install_path\Veritas\NetBackup\logs\nblaevidence\nbla-hash
■ Microsoft Exchange (logs are only collected from Primary and Media servers)
■ Microsoft Cluster Server (MSCS)
■ Microsoft SQL Availability Group
■ NDMP (logs are only collected from Primary and media servers)
■ Oracle
■ Snapshot Manager (logs are only collected from Primary and media servers)
■ VMware
When you set the disableIPResolution option on a primary server, logs on the
protected virtual machines are not gathered when you specify the job ID of the
VMware workload type. See
https://www.veritas.com/content/support/en_US/doc/21902280-158271263-0/
v38310204-158271263 for more details of the setting.
Gathering logs from distributed workloads with multiple clients is supported with
this release. Examples of distributed workloads include Oracle RAC and MSSQL
Availability Groups.
You can upload the gathered logs to the Veritas Technical Support organization
with the command line interface and the API options as well as a valid support case
ID. See https://www.veritas.com/support/en_US/article.100038665 for more details.
The password that is provided to the API to upload the logs is stored as a credential
object in the NetBackup credential management pane. It is removed after logs
are uploaded.
A single tar file consisting of gathered logs is uploaded to the Veritas Technical
Support organization’s SFTP server or the specified SFTP server. If the Veritas
Technical Support organization does not manage the SFTP server, the upload
operation fails if a tar file with the same name exists on the SFTP server.
Use the nblogadm log to debug or troubleshoot log collection by job ID. Use the
nblogadm log for both the command line interface and API option. To collect logs
from the nblogadm process, confirm that the directory that is shown is present:
■ Linux and UNIX
/usr/openv/netbackup/logs/nblogadm
■ Windows
install_path\Veritas\NetBackup\logs\nblogadm
Using NetBackup utilities 215
About log collection by job ID
Table 3-6 New command line interface flags introduced to nblogadm utility
nblogadm --action createrecord Take a job ID, create an empty log record,
--jobid job ID --json and return the created record ID.
nblogadm --action startupload Create a task to upload the logs for the
--recid record ID --sftp_host sftp specified record ID and SFTP server access
host --sftp_port sftp port information.
--supportcase support case ID
--target_folder sftp host folder
--fingerprint sftp host
fingerprint, use comma as
delimiter without spaces
--passcredentials --json
nblogadm --action deleterecord Delete the collected logs and record for the
--recid record ID --json specified record ID. This action also
terminates any in-progress task.
nblogadm --action casedetail Get the log gather and the log upload task
--recid record ID --json details for the specified record ID.
nblogadm --action getlogging Get the list of hosts, their components, and
--recid record ID --json the corresponding logging level values for the
specified record ID.
Table 3-6 New command line interface flags introduced to nblogadm utility
(continued)
Warning: Before you try any of the disaster recovery procedures in this chapter,
Veritas recommends that you contact technical support.
This topic provides information about NetBackup installation and (if necessary),
catalog recovery after a system disk failure. Veritas assumes that you recover to
the original system disk or one configured exactly like it.
Disaster recovery 219
About disaster recovery requirements
Warning: NetBackup may not function properly if you reinstall and recover to a
different partition or to one that is partitioned differently due to internal configuration
information. Instead, configure a replacement disk with partitioning that is identical
to the failed disk. Then reinstall NetBackup on the same partition on which it was
originally installed.
The specific procedures that replace failed disks, build partitions and logical volumes,
and reinstall operating systems can be complicated and time consuming. Such
procedures are beyond the scope of this manual. Appropriate vendor-specific
information should be referenced.
Note: Certificates for active and inactive nodes are not recovered during catalog
recovery. Therefore, you must manually deploy certificates on all cluster nodes
using a reissue token after you install NetBackup in a disaster recovery mode.
See “Generating a certificate on a clustered primary server after disaster recovery
installation” on page 248.
Note: If a non-privileged user (or service user) account is configured, ensure that
the service account has the write access permissions on the directory where the
disaster recovery package resides.
For more information on the service user account, refer to the NetBackup Security
and Encryption Guide.
Disaster recovery 220
Disaster recovery packages
Note: Be aware that NetBackup does not support push, remote, or silent installation
for the disaster recovery of primary servers. Exception: NetBackup supports these
installation methods for hosts in a NetBackup primary server cluster.
Note: By default, the KMS configuration is not backed up during catalog backup.
Set the KMS_CONFIG_IN_CATALOG_BKUP configuration option to 1 to include
the KMS configuration as part of the disaster recovery package during catalog
backup.
Disaster recovery 221
About disaster recovery settings
Note: You must set a passphrase for the disaster recovery package for the catalog
backups to be successful.
Setting Description
Caution: Ensure that the passphrase contains only the supported characters. If
you enter a character that is not supported, you may face problems during disaster
recovery package restore. The passphrase may not be validated and you may not
be able to restore the disaster recovery package.
■ If you change the passphrase anytime, it is not changed for the previous disaster
recovery packages. Only new disaster recovery packages are associated with
the new passphrase.
■ Passphrase that you provide while you install NetBackup on the primary server
in a disaster recovery mode after a disaster must correspond to the disaster
recovery package from which you want to recover the primary server host identity.
Selecting files to back up In addition to backing up files on a regular basis, it is important to select the correct
files to back up. Include all files with records that are critical to users and the
organization. Back up system and application files, so you can quickly and accurately
restore a system to normal operation if a disaster occurs.
Include all Windows system files in your backups. In addition to the other system
software, the Windows system directories include the registry, which is needed to
restore the client to its original configuration. If you use a NetBackup exclude list for a
client, do not specify any Windows system files in that list.
Do not omit executables and other application files. You may want to save tape by
excluding these easy-to-reinstall files. However, backing up the entire application
ensures that it is restored to its exact configuration. For example, if you have applied
software updates and patches, restoring from a backup eliminates the need to reapply
them.
Bare Metal Restore NetBackup Bare Metal Restore (BMR) protects client systems by backing them up with
a policy configured for BMR protection. A complete description of BMR backup and
recovery procedures is available.
http://www.veritas.com/docs/DOC5332
Critical policies When you configure a policy for online catalog backup, designate certain NetBackup
policies as critical. Critical policies back up systems and data deemed critical to end-user
operation. During a catalog recovery, NetBackup verifies that all of the media that is
needed to restore critical policies are available.
Full backup after catalog If the configuration contains Windows clients that have incremental backup
recovery configurations set to Perform Incrementals Based on Archive Bit, run a full backup
of these clients as soon as possible after a catalog recovery. The archive bit resets on
the files that were incrementally backed up after the catalog backup that was used for
the catalog recovery. If a full backup of these clients is not run after a catalog recovery,
these files could be skipped and not backed up by subsequent incremental backups.
Disaster recovery 223
Recommended backup practices
Online catalog backups Online, hot catalog backup is a policy-driven backup that supports tape-spanning and
incremental backups. Online catalog backups may be run while other NetBackup activity
occurs, which provides improved support for environments in which continual backup
activity is typical.
Online catalog backup Veritas recommends saving the disaster recovery files that are created by the online
disaster recovery files catalog backup to a network share or removable device. Do not save the disaster
recovery files to the local computer. Catalog recovery from an online catalog backup
without the disaster recovery image file is a more complex procedure and
time-consuming procedure.
Automated recovery The catalog disaster recovery file (created during an online catalog backup) is intended
to automate the process of NetBackup recovery. If you recover a system other than
the one that originally made the backups, it should be identical to the original system.
For example, the system that performs the recovery should include NetBackup servers
with identical names to those servers where the backups were made. If not, the
automated recovery may not succeed.
Disaster recovery 224
Recommended backup practices
Online catalog disaster Configure the online catalog backup policy to email a copy of the disaster recovery
recovery information email information to a NetBackup administrator in your organization. Configure this policy as
part of every catalog backup. Do not save the disaster recovery information emails to
the local computer. Catalog recovery without the disaster recovery image file or the
disaster recovery information email available is exceedingly complex, time consuming,
and requires assistance.
NetBackup emails the disaster recovery file when the following events occur:
You may tailor the disaster recovery email process by using the mail_dr_info notify
script. More details are available.
http://www.veritas.com/docs/DOC5332
If you are not able to receive the disaster recovery packages over emails even after
you have configured your email, then ensure the following:
■ Your email exchange server is configured to have the attachment size equal to or
greater than the disaster recovery package size. You can check the size of the
package (.drpkg file size) on the disaster recovery file location that you have
specified in the catalog backup policy.
■ The firewall and antivirus software in your environment allow the files with the
.drpkg extension (which is the extension for a disaster recovery package file).
■ If BLAT is used as email notification application, it is of v2.4 or later version.
Identifying the correct catalog Ensure that you identify and use the appropriate catalog backup for your recovery. For
backup example, if you recover from your most recent backups, use the catalog from your
most recent backups. Similarly, if you recover from a specific point in time, use the
catalog backup from that specific point in time.
Catalog recovery time System environment, catalog size, location, and backup configuration (full and
incremental policy schedules) all help determine the time that is required to recover
the catalog. Carefully plan and test to determine the catalog backup methods that result
in the desired catalog recovery time.
Disaster recovery 225
About disk recovery procedures for UNIX and Linux
Primary and media server The NetBackup catalog backup protects your configuration data and catalog data. Set
backups up backup schedules for the primary servers and media servers in your NetBackup
installation. These schedules protect the operating systems, device configurations,
and other applications on the servers.
Primary or media server recovery procedures when the system disk has been lost
assume that the servers are backed up separately from the catalog backup. Backups
of primary and media servers should not include NetBackup binaries, configuration or
catalog files, or NetBackup database data.
About recovering the primary server disk for UNIX and Linux
Two procedures explain how to recover data if the system disk fails on a UNIX or
Linux NetBackup primary server, as follows:
■ The root file system is intact. The operating system, NetBackup software and
some (if not all) other files are assumed to be lost.
See “Recovering the primary server when root is intact” on page 226.
■ The root file system is lost along with everything else on the disk. This situation
requires a total recovery. This recovery reloads the operating system to an
Disaster recovery 226
About disk recovery procedures for UNIX and Linux
alternate boot disk and starts from this disk during recovery. You then can
recover the root partition without risking a crash that is caused by overwriting
the files that the operating system uses during the restore.
See “Recovering the primary server when the root partition is lost” on page 228.
For NetBackup primary and media servers, the directory locations of the NetBackup
catalog become an integral part of NetBackup catalog backups. Any recovery of
the NetBackup catalog requires identical directory paths or locations be created
during the NetBackup software reinstallation. Disk partitioning, symbolic links, and
NetBackup catalog relocation utilities may be needed.
NetBackup Bare Metal Restore (BMR) protects client systems by backing them up
with a policy configured for BMR protection. Information is available that describes
BMR backup and recovery procedures.
See the NetBackup Bare Metal Restore System Administrator's Guide:
http://www.veritas.com/docs/DOC5332
Note: For the NetBackup Web Services, you must use the same user account
and credentials that were used when you backed up the NetBackup catalog.
More information is available:
http://www.veritas.com/docs/000081350
Note: You must use the same service user account that was used when you
backed up the NetBackup catalog.
For more information on the service user account, refer to the NetBackup
Security and Encryption Guide.
Disaster recovery 227
About disk recovery procedures for UNIX and Linux
3 Install any NetBackup patches that had been previously installed. See the
documentation that was included with the patch software.
Note: Veritas does not support the recovery of a catalog image that was backed
up using an earlier version of NetBackup.
4 If the catalog directories differ from those in the NetBackup catalog backups,
recreate that directory structure on the disk before you recover the catalog.
For example, if you used symbolic links as part of the NetBackup catalog
directory structure.
5 If the recovery scenario involves restoring policy or catalog backups, the
appropriate recovery devices must be configured, which may involve the
following tasks:
■ Install and configure the robotic software for the devices that read backups
of the NetBackup catalog and regular backups of the disk being restored.
If a non-robotic drive is available that can read these backups, then no robot
is required. Although manual intervention is required if multiple pieces of
media are required.
See the NetBackup Device Configuration Guide:
http://www.veritas.com/docs/DOC5332
■ Use the NetBackup Device Configuration Wizard to discover and configure
the recovery device in NetBackup.
See the NetBackup Administrator's Guide, Volume I:
http://www.veritas.com/docs/DOC5332
■ Use the NetBackup tpautoconf command to discover and configure the
recovery device in NetBackup.
See the NetBackup Commands Reference Guide:
http://www.veritas.com/docs/DOC5332
■ Update the device mapping files.
See the NetBackup Administrator’s Guide, Volume I:
http://www.veritas.com/docs/DOC5332
6 If you must restore from the policy backups or catalog backups that were done
to media, the appropriate media may have to be configured in NetBackup
See the NetBackup Administrator’s Guide, Volume I:
http://www.veritas.com/docs/DOC5332
Configuring the media may require some or all of the following tasks:
■ Manually load the required media into a standalone recovery device.
Disaster recovery 228
About disk recovery procedures for UNIX and Linux
/usr/openv/netbackup/bin/bp.kill_all
/usr/openv/netbackup/bin/bp.start_all
9 Start the NetBackup Backup, Archive, and Restore interface (or the bp
command) and restore other files to the server as desired. When the files are
restored, you are done.
4 Install NetBackup on the alternate disk. Install only the robotic software for the
devices that are required to read backups of the NetBackup catalogs and
regular backups of the disk being restored. If a non-robotic drive can read these
backups, no robot is required.
Note: For the NetBackup Web Services, you must use the same user account
and credentials that were used when you backed up the NetBackup catalog.
More information is available:
http://www.veritas.com/docs/000081350
5 Install any NetBackup patches that had been previously installed. See the
documentation that was included with the patch software.
6 If the catalog directories differ from those in the NetBackup catalog backups,
recreate that directory structure on the disk before you recover the catalog.
For example, if you used symbolic links as part of the NetBackup catalog
directory structure.
7 If the recovery scenario involves restoring policy or catalog backups, the
appropriate recovery devices must be configured.
Device configuration may include the following tasks:
■ Install and configure the robotic software for the devices that read backups
of the NetBackup catalog and regular backups of the disk being restored.
If a non-robotic drive is available that can read these backups, then no robot
is required. Although manual intervention is required if multiple pieces of
media are required.
See the NetBackup Device Configuration Guide:
http://www.veritas.com/docs/DOC5332
■ Use the NetBackup Device Configuration Wizard to discover and configure
the recovery device in NetBackup.
See the NetBackup Administrator's Guide, Volume I.
http://www.veritas.com/docs/DOC5332
■ Use the NetBackup tpautoconf command to discover and configure the
recovery device in NetBackup.
See the NetBackup Commands Reference Guide manual:
http://www.veritas.com/docs/DOC5332
■ Update the device mapping files.
See the NetBackup Administrator’s Guide, Volume I:
http://www.veritas.com/docs/DOC5332
Disaster recovery 230
About disk recovery procedures for UNIX and Linux
8 If you must restore from the policy backups or catalog backups that were done
to media, the appropriate media may have to be configured in NetBackup.
See the NetBackup Administrator’s Guide, Volume I:
http://www.veritas.com/docs/DOC5332
Configuring the media may require some or all of the following tasks:
■ Manually load the required media into a standalone recovery device.
■ Use the NetBackup utilities such as robtest or vendor-specific robotic
control software to load media into the required recovery device or devices.
■ Use the NetBackup Volume Configuration Wizard to inventory the media
contents of a robotic device.
■ Use the vendor-specific robotic control software to load the media into the
required recovery devices.
11 Stop all NetBackup processes that you started from NetBackup on the alternate
disk.
/usr/openv/netbackup/bin/bp.kill_all
12 Maintaining the same directory structure, copy the NetBackup catalogs from
the alternate disk to the disk that you recover. These are the catalogs recovered
in step 9.
13 Make the recovered disk the boot disk again and restart the system.
Disaster recovery 231
About clustered NetBackup server recovery for UNIX and Linux
14 Start and test the copy of NetBackup on the disk that you have recovered.
/usr/openv/netbackup/bin/bp.start_all
Try the NetBackup Administration utilities. Also, try some backups and restores.
15 When you are satisfied that the recovery is complete, delete the NetBackup
database directories from the alternate disk. Or, unhook that disk, if it is a spare.
Warning: Before attempting any of the recovery procedures in this topic, contact
technical support.
Scenario Procedure
Shared disk failure See “Recovering the entire UNIX or Linux cluster” on page 234.
Cluster failure See “Recovering the entire UNIX or Linux cluster” on page 234.
Note: For the NetBackup Web Services, you must use the same user account
and credentials that are used on the other nodes of the cluster. More information
is available:
http://www.veritas.com/docs/000081350
7 Install any Maintenance Packs and patches that are required to bring the newly
installed node to the same patch level as the other cluster nodes.
8 Bring the NetBackup Resource group online on a node other than the freshly
installed node.
9 Log onto the node on which the NetBackup resource group is online and run
the following command:
/usr/openv/netbackup/bin/cluster/cluster_config -s nbu -o
add_node -n node_name
14 Check that the robot numbers and robot drive numbers for each robot are
consistent across all nodes of the cluster. Repeat for any other servers that
are connected to that robot and correct if necessary.
See the NetBackup Administrator’s Guide , Volume 1:
http://www.veritas.com/docs/DOC5332
15 Test the ability of NetBackup to perform restores using the configured devices
on the replacement node.
16 Unfreeze the NetBackup resource group.
Note: For the NetBackup Web Services, you must use the same user account
and credentials that were used when you backed up the NetBackup catalog.
More information is available:
http://www.veritas.com/docs/000081350
5 Install any Maintenance Packs and patches that are required to bring the newly
installed NetBackup server to the same patch level as the server being replaced
6 Configure required devices and media and recover the NetBackup catalogs.
See “Recovering the primary server when root is intact” on page 226.
7 Bring the NetBackup resource group on each node in turn and run the Device
Configuration Wizard to configure the devices.
Configuration information on your particular cluster is available.
Refer to the NetBackup High Availability Guide:
http://www.veritas.com/docs/DOC5332
Note: When the disk image is imported, NetBackup does not recover the original
catalog entry for the image. Instead, a new catalog entry is created.
■ Windows is intact and not corrupted. The system still starts Windows, but some
or all other partitions are lost. NetBackup software is assumed to be lost.
See “Recovering the primary server with Windows intact” on page 236.
■ All disk partitions are lost. Windows must be reinstalled, which is a total recovery.
These procedures assume that the NetBackup primary disk was running a
supported version of Windows and that the defective hardware has been
replaced.
See “Recovering the primary server and Windows” on page 239.
For NetBackup primary and media servers, the directory locations of the NetBackup
catalog become an integral part of NetBackup catalog backups. Any recovery of
the NetBackup catalog requires the identical directory paths or locations be created
before the catalog recovery.
Note: For the NetBackup Web Services, you must use the same user account
and credentials that were used when you backed up the NetBackup catalog.
More information is available:
http://www.veritas.com/docs/000081350
5 Install any NetBackup patches that had been previously installed. See the
documentation that was included with the patch software.
6 If the catalog directories differ from those in the NetBackup catalog backups,
recreate that directory structure on the disk before you recover the catalog.
Disaster recovery 237
About disk recovery procedures for Windows
8 If the recovery scenario involves restoring the policy backups or catalog backups
that were done to media, the appropriate recovery devices must be configured.
Configuring the media may involve the following actions:
■ Manually load the required media into a standalone recovery device.
■ Use NetBackup utilities such as robtest or vendor-specific robotic control
software to load media into the required recovery devices.
■ Use the NetBackup Volume Configuration Wizard to inventory the media
contents of a robotic device.
■ Use the vendor-specific robotic control software to load the media into the
required recovery devices.
10 When catalog recovery is complete, stop and restart the NetBackup services.
Use the following bpdown and bpup commands or the Services application in
the Windows Control Panel.
install_path\NetBackup\bin\bpdown
install_path\NetBackup\bin\bpup
These directories were recovered in step 10 and overwriting them with regular
backups leaves the catalogs in an inconsistent state. If the databases were
relocated using nbdb_move from install_path\NetBackupDB\data, they are
recovered in step 10 and should not be restored in step 12.
12 Restart the system, which replaces any files that were busy during the restore.
When the boot process is complete, the system is restored to the state it was
in at the time of the last backup.
Disaster recovery 239
About disk recovery procedures for Windows
Note: For the NetBackup Web Services, you must use the same user account
and credentials that were used when you backed up the NetBackup catalog.
More information is available:
http://www.veritas.com/docs/000081350
6 Install any NetBackup patches that had been previously installed. See the
documentation that was included with the patch software.
7 If the catalog directories differ from those in the NetBackup catalog backups,
recreate that directory structure on the disk before you recover the catalog.
Disaster recovery 240
About disk recovery procedures for Windows
9 If you must restore from the policy backups or catalog backups that were done
to media, the appropriate media may have to be configured in NetBackup.
See the NetBackup Administrator’s Guide, Volume I:
http://www.veritas.com/docs/DOC5332
When you configure the media, you may have to do some or all of the following:
■ Manually load the required media into a standalone recovery device.
■ Use the NetBackup utilities such as robtest or vendor-specific robotic
control software to load media into the required recovery devices.
■ Use the NetBackup Volume Configuration Wizard to inventory the media
contents of a robotic device.
■ Use the vendor-specific robotic control software to load the media into the
required recovery devices.
11 When catalog recovery is complete, stop and restart the NetBackup services.
Use the following bpdown and bpup commands, the Activity Monitor in the
NetBackup Administration Console, or the Services application in the
Windows Control Panel.
install_path\NetBackup\bin\bpdown
install_path\NetBackup\bin\bpup
These directories were recovered in step 10 and overwriting them with regular
backups leaves the catalogs in an inconsistent state. If the databases were
relocated using nbdb_move from install_path\NetBackupDB\data, they are
recovered in step 10 and should not be restored in step 12.
12 To restore all other files, do the following steps in the order presented:
■ Start the NetBackup Administration interface on the primary server.
■ Start the Backup, Archive, and Restore client interface.
■ Browse for restores and select only the partitions that were lost. Select the
system directory (typically C:\Windows), which ensures that all registry files
are restored.
■ Deselect the following directories:
install_path\NetBackup\db install_path\NetBackupDB (or relocated
NetBackup database path)
install_path\NetBackup\var
install_path\Volmgr\database
See the caution in this procedure.
■ If you reinstall Windows, select the Overwrite existing files option, which
ensures that existing files are replaced with the backups.
■ Start the restore.
13 Restart the system, which replaces any files that were busy during the restore.
When the boot process is complete, the system is restored to the state it was
in at the time of the last backup.
Disaster recovery 242
About disk recovery procedures for Windows
6 Enable debug logging by creating the following debug log directories on the
client:
install_path\NetBackup\Logs\tar
install_path\NetBackup\Logs\bpinetd
Warning: Contact technical support before you try these recovery procedures.
Note: For the NetBackup Web Services, you must use the same user account
and credentials that are used on the other nodes of the cluster. More information
is available:
http://www.veritas.com/docs/000081350
Disaster recovery 246
About clustered NetBackup server recovery for Windows
4 Ensure that the node is a member of an existing cluster and that it performs
the necessary configuration automatically.
5 Install any Maintenance Packs and patches that are required to bring the newly
installed node to the same patch level as the other cluster nodes.
6 Unfreeze the NetBackup service and verify that it can be brought up on the
replacement node.
bpclusterutil -ci
tpext
bpclusterutil -online
7 Configure required devices and media and recover the NetBackup catalogs.
8 Manually shut down and restart NetBackup on the active node.
9 Re-enable monitoring of the NetBackup resource group.
10 Verify that the NetBackup server can now be brought online on all configured
nodes.
Note: For the NetBackup Web Services, you must use the same user account
and credentials that were used when you backed up the NetBackup catalog.
More information is available:
http://www.veritas.com/docs/000081350
6 Configure required devices and media and recover the NetBackup catalogs.
See “Recovering the primary server and Windows” on page 239.
7 Bring the NetBackup resource group on each node in turn and run the Device
Configuration Wizard to configure the devices.
Configuration information on your cluster (WSFC or VCS) is available.
Refer to the NetBackup High Availability Guide:
http://www.veritas.com/docs/DOC5332
4 Use the nbcertcmd command to create a reissue token. The hostname is the
local node name. When the command runs, it displays the token string value.
A unique reissue token is needed for each cluster node.
nbcertcmd -createtoken -name token_name -reissue -host hostname
5 Use the reissue token with the nbcertcmd command to store the host certificate.
This command prompts you for the token string value. Enter the token string
from the nbcertcmd -createToken command.
nbcertcmd -getCertificate -token
Important notes
■ Catalog recovery does not recover the host identity. To restore the host identity
or disaster recovery package, you must install NetBackup in the disaster recovery
mode and import the required package. Once you have recovered the disaster
recovery package, you can recover the catalog.
■ After you have restored the disaster recovery package or the primary server
host identity, you must immediately perform catalog recovery.
See “About recovering the NetBackup catalog” on page 257.
You can restore the disaster recovery package of the NetBackup primary server
either during installation or after installation.
■ To restore the package during installation, select the disaster recovery mode
of installation.
You need to specify the disaster recovery package passphrase during installation.
If you specify a wrong passphrase or the passphrase is lost, you need to deploy
security certificates on all hosts after installation. The disaster recovery package
cannot be restored during installation. To restore the disaster recovery package
after installation, refer to the following article:
http://www.veritas.com/docs/000125933
Disaster recovery 250
About the DR_PKG_MARKER_FILE environment variable
Note: To restore the disaster recovery package of the NetBackup Appliance, use
the nbhostidentity command.
Note: This marker file should be used only in case of DR installation failures.
Important notes
■ In a clustered primary server setup:
■ The disaster recovery package contains the identity files and configuration
only for the virtual name.
■ After the DR installation, the virtual name's certificate is restored.
■ Cluster node-specific certificates and configuration options are not backed
up and therefore are not recovered. You need to redeploy or reconfigure
NetBackup or external certificates after the DR installation.
Prerequisites
If external CA-signed certificates are used in your NetBackup domain, ensure the
following:
■ The certificate file path is configured, accessible, and is the same as the one
that was backed up.
■ You have configured the required certificate revocation lists (CRL) before you
begin the disaster recovery installation, if applicable.
Refer to the NetBackup Security and Encryption Guide.
■ You have copied the required external certificates in Windows certificate store,
if applicable.
■ If an external certificate was configured on the primary server before the disaster
and DR installation fails, you can set the environment variable called
DR_PKG_MARKER_FILE to enable you to correct the external certificate
configuration towards the end of the DR installation.
See “About the DR_PKG_MARKER_FILE environment variable” on page 250.
To restore the disaster recovery package during NetBackup installation
1 Start the NetBackup software installation.
Refer to the Installing server software on Windows systems section from the
NetBackup Installation Guide.
2 On the NetBackup License Key and Server Type screen, select the Disaster
Recovery Master Server option.
3 On the NetBackup Disaster Recovery screen, specify the location of the
disaster recovery package. Click Browse to select the package location that
you want to restore.
4 Specify the passphrase that is associated with the disaster recovery package
that you want to restore.
Ensure that you specify the appropriate passphrase:
Disaster recovery 252
Restoring disaster recovery package on Windows
■ Depending on the value that is specified for the ECA_CRL_PATH option, make
the required CRLs available.
■ If ECA_CRL_PATH is not specified, NetBackup uses the CRLs from CRL
distribution point (CDP) of the peer host's certificate. Ensure that the
URLs that are available in the CDP are accessible.
■ If ECA_CRL_PATH is specified, NetBackup uses the CRLs that are
available in the directory specified for this option. Copy the valid CRLs
in the directory that you specify for ECA_CRL_PATH.
■ In case Windows certificate store was used to store the external CA-signed
and this certificate was not backed up in the DR package, you can see a
warning to configure the external CA-signed certificates. Configure the
following external certificate configuration options on the primary server as
per the values provided in the Installer or in the corresponding disaster
recovery email:
■ ECA_CERT_PATH
■ ECA_PRIVATE_KEY_PATH
■ ECA_KEY_PASSPHRASEFILE
■ ECA_TRUST_STORE_PATH
Disaster recovery 253
Restoring disaster recovery package on UNIX
■ ECA_CRL_PATH
For more information on catalog backup and external certificate configuration
options, refer to the NetBackup Administrator's Guide, Volume I.
■ In case the DR_PKG_MARKER_FILE environment variable was set before the
DR installation, the installer displays a message, which conveys that the
touch file exists. Once the external certificate configuration is done, delete
the touch file that you have set for the DR_PKG_MARKER_FILE environment
variable.
NetBackup services are started.
6 Refer to the Installing server software on Windows systems section from the
NetBackup Installation Guide.
To restore the disaster recovery package after NetBackup installation
1 Run the nbhostidentity -import -infile file_path command after
NetBackup installation.
Refer to the NetBackup Commands Reference Guide.
2 Clean up the allowed list cache and restart the NetBackup services on all hosts
in the domain.
3 Carry out this step to remove the NetBackup certificate files in the following
scenario:
NetBackup was configured to use only external CA-signed certificates before
the disaster and NetBackup was configured to use NetBackup certificates or
both NetBackup and external certificates before you manually imported the
disaster recovery package.
Run the following command to remove NetBackup certificate files:
configureWebServerCerts -removeNBCert
Important notes
■ In a clustered primary server setup:
■ The disaster recovery package contains the identity files and configuration
only for the virtual name.
Disaster recovery 254
Restoring disaster recovery package on UNIX
Prerequisites
If external CA-signed certificates are used in your NetBackup domain, ensure the
following:
■ In case of file-based external certificates, ensure that the certificate file path is
configured, accessible, and is the same as the one that was backed up.
■ If you used Windows certificate store as a certificate store before the disaster
and the certificate files were not backed up during catalog backup, you need to
manually configure the external certificate for the host after the disaster. Refer
to the following article:
https://www.veritas.com/support/en_US/article.100044249
■ You have configured the required certificate revocation lists (CRL) before you
begin the disaster recovery installation, if applicable.
For more information on the CRLs, refer to the NetBackup Security and
Encryption Guide.
■ If an external certificate was configured on the primary server before the disaster
and DR installation fails, you can set the environment variable called
DR_PKG_MARKER_FILE to enable you to correct the external certificate
configuration towards the end of the DR installation.
See “About the DR_PKG_MARKER_FILE environment variable” on page 250.
To restore the disaster recovery package during NetBackup installation
1 Start the NetBackup software installation.
Refer to the Installing server software on UNIX systems section from the
NetBackup Installation Guide.
2 When the following message appears, press Enter to continue:
Is this host a master server? [y/n] (y)
4 When the following message appears, provide the name and the path of the
disaster recovery package that you want to restore.
Enter the name of your disaster recovery package along with the
path, or type q to exit the install script:
■ Depending on the value that is specified for the ECA_CRL_PATH option, make
the required CRLs available.
■ If ECA_CRL_PATH is not specified, NetBackup uses the CRLs from CRL
distribution point (CDP) of the peer host's certificate. Ensure that the
URLs that are available in the CDP are accessible.
■ If ECA_CRL_PATH is specified, NetBackup uses the CRLs that are
available in the directory specified for this option. Copy the valid CRLs
in the directory that you specify for ECA_CRL_PATH.
7 Refer to the Installing server software on UNIX systems section from the
NetBackup Installation Guide.
To restore the disaster recovery package after NetBackup installation
1 Run the nbhostidentity -import -infile file_path command after
NetBackup installation.
Refer to the NetBackup Commands Reference Guide.
2 Clean up the allowed list cache and restart the NetBackup services on all hosts
in the domain.
3 Carry out this step to remove the NetBackup certificate files in the following
scenario:
NetBackup was configured to use only external CA-signed certificates before
the disaster and it was configured to use NetBackup certificates or both
NetBackup and external certificates before you manually imported the disaster
recovery package.
Run the following command to remove NetBackup certificate files:
configureWebServerCerts -removeNBCert
Disaster recovery 257
About recovering the NetBackup catalog
Before you recover the NetBackup catalog, you must do the following:
■ Ensure that NetBackup is running in the recovery environment.
■ Configure the recovery devices in NetBackup.
■ Ensure that the media on which the catalog backups exist are available to
NetBackup.
■ If the NetBackup primary server is part of a cluster, ensure that the cluster is
functional.
■ Restore the NetBackup host identity by restoring the disaster recovery package.
See “About restoring disaster recovery package” on page 249.
Caution: After successful catalog recovery, you must set the disaster recovery
package passphrase, because the passphrase is not recovered during the
catalog recovery.
The NetBackup catalog consists of several parts. How you recover the catalog
depends on which part or parts of the catalog you want to recover, as follows:
Recover the entire Veritas recommends that you recover the entire catalog. Doing so helps ensure consistency
catalog among the various parts of the catalog. This method is most useful for recovering a catalog
to the same environment from which it was backed up.
Recover the catalog The image database contains information about the data that has been backed up.
image files and
This type of recovery also restores the data and the metadata for the NetBackup databases
configuration files
(BMRDB, NBAZDB, and NBDB) so that it is available for further recovery processing.
See “About recovering the NetBackup catalog image files” on page 278.
Disaster recovery 258
About recovering the NetBackup catalog
Recover the NetBackup NetBackup stores information in the NetBackup database (NBDB). The metadata includes
databases information about the data that has been backed up, and about where the data is stored.
Recover the NetBackup database if it is corrupt or lost but the catalog image files exist and
are valid.
Recovery of the entire catalog or the catalog image files relies on the disaster
recovery information. That information is saved in a file during the catalog backup.
The location of the disaster recovery file is configured in the catalog backup policy.
See “NetBackup disaster recovery email example” on page 262.
If you do not have the disaster recovery file, you still can recover the catalog.
However, the process is much more difficult and time-consuming.
See “Recovering the NetBackup catalog without the disaster recovery file”
on page 305.
Note: After a catalog recovery, NetBackup freezes the removeable media that
contains the catalog backup. This operation prevents a subsequent accidental
overwrite action on the final catalog backup image on the media. This final image
pertains to the actual catalog backup itself, and its recovery is not part of the catalog
recovery. You can unfreeze the media.
See “Unfreezing the NetBackup online catalog recovery media” on page 310.
■ Creates a staging directory where the command can store the temporary copies:
On Windows: install_path\NetBackupDB\staging
On UNIX: /usr/openv/db/staging
Once the copy is made, NetBackup can back up the catalog files.
■ A child job backs up files in a single stream as follows:
■ Configuration files
■ Databases
bmrdb\
nbazdb\
nbdb\
db/images directory If the NetBackup db/images directory resides on the storage that
is the target of a symbolic link, that symbolic link must exist in the
recovery environment. The symbolic link also must have the same
target in the recovery environment.
Catalog recovery of To recover the NetBackup catalog from a clustered primary server
clustered primary server to a single primary server at a disaster recovery site, you must
create the following symbolic links on the recovery host before
you recover the catalog:
If the symbolic links and their targets do not exist, catalog recovery fails.
Because the recovery consumes job numbers, you must specify the number
before the catalog recovery.
2 Recover the NetBackup catalog.
From: [email protected]
Sent: Thursday, January 3, 2019 05:48
To: NetBackup Administrator
Subject: NetBackup Catalog Backup successful on host
primary.example.com status 0
Attachments: cat_backup_1438271286_INCR
cat_backup_1438271286_INCR.drpkg
Server
primary.example.com
NetBackup Version
8.1.X
Date
4/27/2017 05:46:45 AM
Policy
cat_backup
DR file written to
/dr/nbu_dr_file/cat_backup_1438271286_INCR
* - Primary Media
Disaster Recovery Procedure using the DR Package file and DR Image File
Important Notes:
- If new hardware is required, make sure that the devices contain
drives capable of reading the media and that the drive controllers are
capable of mounting the drives.
- Keep the passphrase associated with the DR Package file handy.
This passphrase is set before the catalog backup policy configuration
using the NetBackup Administration Console or the nbseccmd command.
- If this catalog backup is encrypted using a key from a Key Management Server,
ensure that the Key Management Server is online before doing any of the following steps.
1. Install NetBackup.
a. The installation procedure prompts you to confirm if this is a DR
scenario.
i. On the UNIX installer, you can see a prompt as "Do you want to
do a disaster recovery on this master server? [y,n] (y)".
Select "y"
ii. On the Windows installer click the
"Disaster Recovery Master Server" button.
b. The installation procedure prompts you for the master server’s DR
Package
(refer to the /dr/nbu_dr_file/cat_backup_1438271286_INCR.drpkg
mentioned earlier).
Make sure that the Master Server can access the attached DR package
file.
c. Type the passphrase associated with the Master Server’s DR Package,
when prompted.
i. The installer validates the DR package using that passphrase
ii. In case of errors in validation, the installer aborts the
operation. To work around the issue, refer to the following
article: http://www.veritas.com/docs/000125933
Disaster recovery 265
About recovering the NetBackup catalog
1. Install NetBackup.
2. Configure the devices necessary to read the media listed above.
3. Inventory the media.
4. Run
To recover from copy 1:
bpimport -create_db_info -stype AdvancedDisk -dp dp-advdisk
-dv /storage/advdisk
5. Run:
cat_export -client client1.example.com
6. Go to the following directory to find the DR image file
cat_backup_1438271286_INCR:
/usr/openv/netbackup/db.export/images/master.example.com/1438000000
7. Open cat_backup_1438271286_INCR file and find the BACKUP_ID
(for example: master.example.com_1438271286).
8. Run:
bpimport [-server name] -backupid master.example.com_1438271286
9. Run:
bprestore -T -w [-L progress_log] -C master.example.com -t 35
-p cat_backup -X -s 1438271286 -e 1438271286 /
10. Run the BAR user interface to restore the remaining image database
if the DR image is a result of an incremental backup.
11. To recover the NetBackup relational database, run:
bprecover -r -nbdb
Disaster recovery 266
About recovering the NetBackup catalog
Full backup The NetBackup database that is identified by the DR file is restored.
The images and configuration files that are identified by the disaster
recovery file are restored.
Note: If the catalog was backed up on a NAT media server, you must carry out
certain steps to establish a connection with the NAT media server before catalog
recovery.
See “Establishing a connection with NAT media server before catalog recovery”
on page 277.
Note: Full catalog recovery restores the device and the media configuration
information in the catalog backup. If you must configure storage devices during the
recovery, Veritas recommends that you recover only the NetBackup image files.
See “About recovering the NetBackup catalog image files” on page 278.
Warning: Do not run any client backups before you recover the NetBackup catalog.
To recover the entire catalog by using the NetBackup catalog recovery wizard.
1 Review the prerequisites before starting the catalog recovery.
See “Prerequisites for recovering the NetBackup catalog or NetBackup catalog
image files” on page 259.
2 If NetBackup is not running, start all of the NetBackup services by entering the
following:
■ On UNIX and Linux:
/usr/openv/netbackup/bin/bp.start_all
■ On Windows:
install_path\NetBackup\bin\bpup
3 Sign into the primary server on which you want to recovery the catalog. You
must have the Administrator role or similar permissions.
4 Start the NetBackup web UI.
5 If the catalog backup and the recovery devices are not available, do the
following:
■ Configure the necessary recovery device in NetBackup.
For tape storage or BasicDisk storage, see the NetBackup Administrator's
Guide, Volume I. For disk storage types, see the guide that describes the
option. See the following website for NetBackup documentation:
Disaster recovery 268
About recovering the NetBackup catalog
http://www.veritas.com/docs/DOC5332
■ If the catalog backup was written to an immutable (MSDP WORM) storage
server, add the storage server back to the primary server's configuration
with the CLI nbdevconfig command. See the NetBackup Commands
Reference Guide for more information about the command.
■ Make the media that contains the catalog backup available to NetBackup:
Inventory the robot or the disk pool, add the media for standalone drives,
configure the storage server and disk pool, or so on.
For tape storage or BasicDisk storage, see the NetBackup Administrator's
Guide, Volume I. For disk storage types, see the guide that describes the
option.
http://www.veritas.com/docs/DOC5332
Not successful Consult the log file messages for an indication of the problem.
Click Cancel, fix the problem, and then run the wizard again.
15 Stop and restart NetBackup services on the primary server and other hosts,
as follows:
■ On UNIX and Linux:
/usr/openv/netbackup/bin/bp.kill_all
/usr/openv/netbackup/bin/bp.start_all
■ On Windows:
install_path\NetBackup\bin\bpdown
install_path\NetBackup\bin\bpup
If the NetBackup web UI is active on any of the hosts, the command that stops
the NetBackup services shuts it down.
16 After the services are restarted, run one of the following commands:
■ If NetBackup (or host ID-based) certificates are used in your NetBackup
domain, do the following:
On a non-clustered setup:
UNIX:
/usr/openv/netbackup/bin/nbcertcmd -renewcertificate
Windows:
install_path\netbackup\bin\nbcertcmd -renewcertificate
On a clustered setup:
UNIX:
Windows:
/usr/openv/netbackup/bin/nbcertcmd -enrollCertificate
Windows:
install_path\netbackup\bin\nbcertcmd -enrollCertificate
Disaster recovery 271
About recovering the NetBackup catalog
On a clustered setup:
UNIX:
Windows:
Note: Full catalog recovery restores the device and the media configuration
information in the catalog backup. If you must configure storage devices during the
recovery, Veritas recommends that you recover only the NetBackup image files.
See “About recovering the NetBackup catalog image files” on page 278.
Warning: Do not run any client backups before you recover the NetBackup catalog.
■ Windows:
install_path\NetBackup\bin\bpup.exe
Disaster recovery 273
About recovering the NetBackup catalog
4 If the catalog backup and the recovery devices are not available, do the
following:
c Make available to NetBackup the media that contains the catalog backup:
Inventory the robot or the disk pool, add the media for standalone drives,
configure the storage server and disk pool, or so on.
■ Windows:
install_path\Veritas\NetBackup\bin\admincmd\bprecover.exe
-wizard
Please make sure the devices and media that contain catalog
disaster recovery data are available
Are you ready to continue?(Y/N)
7 Enter the fully qualified pathname to the disaster recovery file for the backup
that you want to restore. For example:
/mnt/hdd2/netbackup/dr-file/Backup-Catalog_1318222845_FULL
If the most recent catalog backup was an incremental backup, use the disaster
recovery file from the incremental backup. (There is no need to first restore
the full backup and then follow with the incremental backup.) Alternately, you
can recover from earlier version of the catalog.
If the pathname is to a valid DR file, a message similar to the following is
displayed:
vm2.example.com_1318222845
All media resources were located
Do you want to recover the entire NetBackup catalog? (Y/N)
If the DR file or the pathname is not valid, the command-line wizard exits.
8 Enter Y to continue. The following is displayed:
Do you want to startup the NetBackup relational database (NBDB)
after the recovery?(Y/N)
The image file is restored to the proper image directory and the NetBackup
databases (NBDB, NBAZDB, and optionally BMRDB) are restored and
recovered.
9 Enter Y or N to continue.
The following is displayed while the restore is in progress:
WRN - NetBackup will not run scheduled backup jobs until NetBackup
is restarted.
When the recovery job is finished, each image file is restored to the proper
image directory, and the NetBackup databases (NBDB, NBAZDB, and optionally
BMRDB) have been restored and recovered.
Disaster recovery 275
About recovering the NetBackup catalog
10 Important: After successful catalog recovery, you must set the disaster recovery
package passphrase, because the passphrase is not recovered during the
catalog recovery.
Do one of the following to set the passphrase:
■ Open the web UI. At the top, click Settings > Global security. On Disaster
recovery tab specify the passphrase.
■ Use the nbseccmd -drpkgpassphrase command to specify the passphrase.
/usr/openv/netbackup/bin/bp.kill_all
/usr/openv/netbackup/bin/bp.start_all
■ On Windows:
install_path\NetBackup\bin\bpdown
install_path\NetBackup\bin\bpup
UNIX:
/usr/openv/netbackup/bin/nbcertcmd -renewcertificate
Windows:
install_path\netbackup\bin\nbcertcmd -renewcertificate
On a clustered setup:
UNIX:
Windows:
/usr/openv/netbackup/bin/nbcertcmd -enrollCertificate
Windows:
install_path\netbackup\bin\nbcertcmd -enrollCertificate
On a clustered setup:
UNIX:
Windows:
■ Importing the backups from the backup media into the catalog
■ Write protecting the media
■ Ejecting the media and setting it aside
■ Freezing the media
Recovery recommendations
See “About NetBackup catalog recovery and symbolic links” on page 260.
Veritas recommends that you recover the catalog images files in the following
scenarios:
■ The NetBackup database is valid, but NetBackup policy, backup image, or
configuration files are lost or corrupt.
■ You recover the catalog using different storage devices. It may be to the same
environment after storage hardware failure or replacement. It may be another
site to which you replicate the catalog backups and the client backups.
Regardless, the catalog backups and the client backups reside on different
hardware.
This recovery does not overwrite the new storage device configuration with the
old, no longer valid storage device information from the catalog backup.
Full backup The image files and configuration files that are listed in the disaster
recovery file are recovered.
Disaster recovery 279
About recovering the NetBackup catalog
/usr/openv/netbackup/db/* install_path\NetBackup\db\*
/usr/openv/netbackup/db/class/* install_path\NetBackup\db\class\*
(optional) (optional)
/usr/openv/netbackup/vault/ install_path\NetBackup\vault\sessions\*
sessions*
/usr/openv/volmgr/database/* install_path\Volmgr\database\*
/usr/openv/volmgr/vm.conf install_path\Volmgr\vm.conf
Warning: Do not run any client backups before you recover the NetBackup catalog.
To recover the catalog image files using the NetBackup catalog recovery
wizard
1 Run the nbgetconfig command and save the output. This output can be used
after the catalog recovery to recover the host-specific information that is
overwritten during the catalog recovery.
For example:
■ On Windows:
install_path\NetBackup\bin\bpup
4 If the catalog backup and the recovery devices are not available, perform the
following steps:
■ Configure the necessary recovery device in NetBackup.
■ Make available to NetBackup the media that contains the catalog backup:
Inventory the robot or the disk pool, add the media for standalone drives,
configure the storage server and disk pool, or so on.
■ Create symbolic links to match those in the original environment.
See “About NetBackup catalog recovery and symbolic links” on page 260.
For tape storage or BasicDisk storage, see the NetBackup Administrator's
Guide, Volume I. For disk storage types, see the guide that describes the
option. See the following website for NetBackup documentation:
5 Open the NetBackup web UI.
6 At the top, click Settings > NetBackup catalog recovery.
Disaster recovery 281
About recovering the NetBackup catalog
7 Specify where the disaster recovery file is stored. You can browse to select
the file or enter the full pathname to the disaster recovery file.
In most cases, you specify the most recent disaster recovery information file
available. If the most recent catalog backup is an incremental backup, use the
disaster recovery file from the incremental backup. (There is no need to first
restore the full backup and then follow with the incremental backup.)
If some form of corruption has occurred, you may want to restore to an earlier
state of the catalog.
Click Next to continue.
8 NetBackup searches for the media that are required to recover the catalog. It
then informs you of the progress and locates the necessary backup ID of the
disaster recovery image. If the media is not located, NetBackup indicates which
media is needed to update the database.
If necessary, follow the wizard instructions to insert the media that is indicated
and run an inventory to update the NetBackup database. The information that
is displayed on this panel depends on whether the recovery is from a full backup
or an incremental backup.
When the required media sources are all found, click Next.
9 Select Recover only NetBackup catalog image and configuration files.
Select a Job priority if wanted and then click Next to initiate the recovery.
10 NetBackup displays the recovery progress.
Your action depends on the outcome of the recovery, as follows:
Not successful Consult the log file messages for an indication of the problem.
Click Cancel, fix the problem, and then run the wizard again.
■ Step c - Export the image header data that you want to import from the
backup.
For example, the following command exports export all image header data.
The data is exported to the netbackup/db.export directory.
cat_export -all
■ Step f- If you recovered the catalog from a disk device, you may have to
fix the disk media ID references. Run the following command:
14 Recover the host settings that you backed up in step 1. Run the following
command.
./nbsetconfig sample.txt
15 Stop and restart the NetBackup services on the primary server, as follows:
■ On UNIX and Linux:
/usr/openv/netbackup/bin/bp.kill_all
/usr/openv/netbackup/bin/bp.start_all
■ On Windows:
install_path\NetBackup\bin\bpdown
install_path\NetBackup\bin\bpup
install_path\netbackup\bin\nbcertcmd -renewcertificate
UNIX:
/usr/openv/netbackup/bin/nbcertcmd -renewcertificate
On a clustered setup:
Disaster recovery 284
About recovering the NetBackup catalog
Windows:
UNIX:
Warning: Do not run any client backups before you recover the NetBackup catalog.
See “About recovering the NetBackup catalog image files” on page 278.
To recover the catalog image files using bprecover -wizard
1 Run the nbgetconfig command and save the output. This output can be used
after the catalog recovery to recover the host-specific information that is
overwritten during the catalog recovery.
For example:
4 Start the NetBackup services on the primary server by entering the following
command:
■ On Windows:
install_path\NetBackup\bin\bpup
6 Enter Y to continue. You are prompted to enter the full path name of the disaster
recovery file, as follows:
Please specify the full pathname to the catalog disaster recovery
file:
Disaster recovery 286
About recovering the NetBackup catalog
7 Enter the fully qualified path name to the disaster recovery file for the backup
that you want to restore. For example:
/mnt/hdd2/netbackup/dr-file/Backup-Catalog_1318222845_FULL
If the most recent catalog backup was an incremental backup, use the disaster
recovery file from the incremental backup. (There is no need to first restore
the full backup and then follow with the incremental backup.) Alternately, you
can recover from earlier version of the catalog.
If you specified a DR file for a full backup, a message similar to the following
appears:
vm2.example.com_1318222845
All media resources were located
vm2.example.com_1318309224
All media resources were located
However, all of the catalog backup images up to the last full catalog
backup are restored. Then you can restore the remaining NetBackup
catalog files from the Backup, Archive, and Restore user interface.
If catalog backup images already exist, all files that were included
in the related set of catalog backups are restored.
WRN - NetBackup will not run scheduled backup jobs until NetBackup
is restarted.
11 When the recovery job is finished, each image file is restored to the proper
image directory and the configuration files are restored. If you chose to recover
the policy data and licensing data, it is restored also.
Disaster recovery 288
About recovering the NetBackup catalog
12 If you want to recover the image header information without recovering the
entire NetBackup database, perform the following steps:
■ Step a - Back up the target database. Run the following command.
nbdb_backup -online directory
Make sure that you do not specify the staging folder as the output directory.
(The staging folder contains the schema data and configuration data for
the NetBackup database from the catalog backup. Image .f and
configuration files are recovered to their final destinations.)
■ Step b - Recover the NetBackup database from the staging directory.
■ Step c - Export the image header data that you want to import from the
backup.
For example, the following command exports export all image header data.
The data is exported to the netbackup/db.export directory.
cat_export -all
■ Step f- If you recovered the catalog from a disk device, you may have to
fix the disk media ID references. Run the following command:
■ Step g - If the result of the dry run is satisfactory, run the following command:
./nbsetconfig sample.txt
16 Stop and restart NetBackup services on the primary server and other hosts,
as follows:
■ On Windows:
install_path\NetBackup\bin\bpdown
install_path\NetBackup\bin\bpup
■ On UNIX:
/usr/openv/netbackup/bin/bp.kill_all
/usr/openv/netbackup/bin/bp.start_all
install_path\netbackup\bin\nbcertcmd -renewcertificate
UNIX:
/usr/openv/netbackup/bin/nbcertcmd -renewcertificate
On a clustered setup:
Windows:
UNIX:
/usr/openv/netbackup/bin/nbcertcmd -enrollCertificate
Windows:
install_path\netbackup\bin\nbcertcmd -enrollCertificate
Clustered setup:
UNIX:
Windows:
Recover from a backup See “Recovering the NetBackup database from a backup”
on page 291.
Recover from the staging See “Recovering the NetBackup database from staging”
directory on page 296.
The database is not If the database is available and the NetBackup Scale-Out
corrupted Relational Database server is running, you do not need to create
a database. Do only step 10 and step 12 in the following procedure.
The database is Follow all of the steps in the procedure only if the NBDB database
corrupted has been corrupted or does not exist. You must create a valid,
empty database, which is included in the full procedure.
Disaster recovery 292
About recovering the NetBackup catalog
UNIX: /usr/openv/netbackup/bin/bp.kill_all
Windows: install_path\NetBackup\bin\bpdown
install_path\NetBackupDB\data\nbdb
install_path\NetBackupDB\data\nbazdb
install_path\NetBackupDB\data\bmrdb
/usr/openv/db/data/nbdb
/usr/openv/db/data/nbazdb
/usr/openv/db/data/bmrdb
4 Create the database. The command that you run depends on your scenario,
as follows:
Run the command from the following directory:
UNIX: /usr/openv/db/bin
Windows: install_path\NetBackup\bin
UNIX: /usr/openv/netbackup/bin/bp.start_all
Windows: install_path\NetBackup\bin\bpup
6 Load the default device protocols and settings into the NetBackup Enterprise
Media Manager (EMM) database by running the following command:
install_path\NetBackupDB\data\nbdb
install_path\NetBackupDB\data\nbazdb
install_path\NetBackupDB\data\bmrdb
/usr/openv/db/data/nbdb
/usr/openv/db/data/nbazdb
/usr/openv/db/data/bmrdb
UNIX: /usr/openv/volmgr/bin/ltid -v
9 If the catalog backup and the recovery devices are not available, do the
following:
http://www.veritas.com/docs/DOC5332
b Make available to NetBackup the media that contains the catalog backup:
Inventory the robot or the disk pool, add the media for standalone drives,
configure the storage server and disk pool, or so on.
http://www.veritas.com/docs/DOC5332
http://www.veritas.com/docs/DOC5332
Disaster recovery 295
About recovering the NetBackup catalog
10 Recover the catalog by running the following command on the primary server:
UNIX: /usr/openv/netbackup/bin/bp.kill_all
/usr/openv/netbackup/bin/bp.start_all
Windows: install_path\NetBackup\bin\bpdown
install_path\NetBackup\bin\bpup
/usr/openv/netbackup/bin/nbcertcmd -renewcertificate
Windows:
install_path\netbackup\bin\nbcertcmd -renewcertificate
On a clustered setup:
UNIX:
Windows:
/usr/openv/netbackup/bin/nbcertcmd -enrollCertificate
Disaster recovery 296
About recovering the NetBackup catalog
Windows:
install_path\netbackup\bin\nbcertcmd -enrollCertificate
On a clustered setup:
UNIX:
Windows:
If the command fails with the exist status 5988, refer to the following topic:
See “Steps to carry out when you see exit status 5988 during catalog recovery”
on page 311.
The database is not See “To recover the NetBackup database from staging if the
corrupted database is not corrupted” on page 297.
The database is See “To recover the NetBackup database from staging if the
corrupted database is corrupted” on page 297.
Disaster recovery 297
About recovering the NetBackup catalog
/usr/openv/netbackup/bin/bp.kill_all
/usr/openv/netbackup/bin/bp.start_all
Windows:
install_path\NetBackup\bin\bpdown
install_path\NetBackup\bin\bpup
install_path\NetBackupDB\data\nbdb
install_path\NetBackupDB\data\nbazdb
install_path\NetBackupDB\data\bmrdb
/usr/openv/db/data/nbdb
/usr/openv/db/data/nbazdb
/usr/openv/db/data/bmrdb
/usr/openv/netbackup/bin/bp.kill_all
/usr/openv/netbackup/bin/bp.start_all
Windows:
install_path\NetBackup\bin\bpdown
install_path\NetBackup\bin\bpup
6 Run the NetBackup tpext command to update the device mapping files, as
follows:
UNIX: /usr/openv/volmgr/bin/tpext -loadEMM
Windows: install_path\Volmgr\bin\tpext -loadEMM
7 If you used the nbdb_move command to relocate NetBackup database, recreate
the directory where the database was located when you backed up the catalog.
8 Start the NetBackup Device Manager, as follows:
UNIX: /usr/openv/volmgr/bin/ltid -v
Windows: Start the device manager service.
9 Run the following command on the primary server to recover NBDB from
staging:
UNIX: /usr/openv/db/bin/nbdb_restore -dbn NBDB -recover -staging
Windows: install_path\NetBackup\bin\nbdb_restore -dbn NBDB -recover
-staging
/usr/openv/netbackup/bin/bp.kill_all
/usr/openv/netbackup/bin/bp.start_all
Windows:
install_path\NetBackup\bin\bpdown
install_path\NetBackup\bin\bpup
/usr/openv/netbackup/bin/nbcertcmd -renewcertificate
Windows:
install_path\netbackup\bin\nbcertcmd -renewcertificate
On a clustered setup:
UNIX:
Windows:
/usr/openv/netbackup/bin/nbcertcmd -enrollCertificate
Windows:
install_path\netbackup\bin\nbcertcmd -enrollCertificate
On a clustered setup:
UNIX:
Windows:
If the command fails with the exist status 5988, refer to the following topic:
See “Steps to carry out when you see exit status 5988 during catalog recovery”
on page 311.
Warning: Veritas recommends that you process the NetBackup NetBackup database
only when directed to do so by Veritas Technical Support. For help with NetBackup
domain merges and splits, contact the Veritas Information Management Consulting
Services:
http://www.veritas.com/business/services/consulting_services.jsp
4 Stop the NetBackup Scale-Out Relational Database Manager service with the
following command:
install_path\NetBackup\bin\bpdown -e vrtsdbsvc_psql
6 Stop the NetBackup Scale-Out Relational Database Manager service with the
following command:
/usr/openv/netbackup/bin/nbdbms_start_stop stop
Step 1 If recovering to a primary server on which NBAC See the NetBackup Security and Encryption Guide:
is configured and operational, disable NBAC (that
http://www.veritas.com/docs/DOC5332
is, set it to PROHIBITED mode).
Step 2 Recover the NetBackup catalog from the online See “About recovering the entire NetBackup
catalog backup using the Catalog Recovery Wizard catalog” on page 266.
or the bprecover command.
Step 3 Configure NetBackup to use NBAC by setting it to See the NetBackup Security and Encryption Guide:
AUTOMATIC or REQUIRED as per the security
http://www.veritas.com/docs/DOC5332
level desired.
Step 1 If recovering to a primary server on which NBAC is See the NetBackup Security and Encryption Guide:
configured and operational, disable NBAC (that is,
http://www.veritas.com/docs/DOC5332
set it to PROHIBITED mode).
Step 3 In Windows, change the startup type of the Instructions for configuring Microsoft Windows are
NetBackup Authentication Service and NetBackup beyond the scope of the NetBackup documentation.
Authorization Service to Disabled. Refer to the appropriate Microsoft documentation.
Step 5 Recover the NetBackup catalog from the online See “About recovering the entire NetBackup catalog”
catalog backup using the bprecover command. on page 266.
Step 6 In Windows, change the startup type of the Instructions for configuring Microsoft Windows are
NetBackup Authentication Service and NetBackup beyond the scope of the NetBackup documentation.
Authorization Service to Automatic. Refer to the appropriate Microsoft documentation.
Step 7 Configure NetBackup to use NBAC. The procedure depends on the environment, as
follows:
http://www.veritas.com/docs/DOC5332
Disaster recovery 304
About recovering the NetBackup catalog
BasicDisk Make sure that the disk that contains the backup is mounted against
the correct mount path (as displayed in the disaster recovery file).
Disk pool For a catalog backup file in a disk pool, do the following:
■ Create the disk storage server for the storage by using the Storage
Server Configuration Wizard.
■ Create the disk pool for the storage by using the Disk Pool
Configuration Wizard.
■ Run the following command to synchronize the disaster recovery
file to the new disk pool.
nbcatsync -sync_dr_file disaster_recovery_file
This command recovers all disaster recovery files from the specified media ID
and places them in the specified directory. The ID can be either a tape media
ID or the fully qualified location of a disk storage unit.
4 Verify that the correct disaster recovery file is available in the specified directory
and that it is available from the NetBackup master server.
5 Continue with the normal catalog recovery procedure by running the Catalog
Recovery Wizard or bprecover command, providing the disaster recovery
file location when prompted.
Refer to the email as your primary source for recovery instructions, because
they are the most current instructions for recovering your catalog. The
instructions are sent when the catalog backup is completed, or when a catalog
backup image is duplicated.
The name of the online catalog backup policy is CatalogBackup. The email
is written to the following file:
/storage/DR/CatalogBackup_1123605764_FULL.
The file name itself indicates if the backup was full or not.
See “NetBackup disaster recovery email example” on page 262.
Disaster recovery 306
About recovering the NetBackup catalog
Note: Use this procedure only if you want to restore the minimal NetBackup catalog
information that lets you begin to recover critical data.
To recover the user-directed online catalog from the command line interface
1 Verify the location of the disaster recovery files that are created from full and
incremental hot catalog backups. These files can be stored in a specified path
of the file system on the primary server and in email attachments to the
NetBackup administrator.
2 Set up each primary server and media server in the same configuration as the
configuration that is used during the last catalog backup. The primary server
and media servers have the following same properties as the backed up catalog
configuration: name, NetBackup version, operating system patch level, and
path to storage devices.
Configure any devices and volumes you may need for the recovery.
3 Locate the latest DR image file corresponding to the backups that are used for
recovery. Open the file in an editor and find values for the following:
media_server The location of the robot or disk storage unit that is used
for catalog backup.
timestamp The four most significant digits in the DR file name and
six zeroes attached.
Example:
file: Hot_Backup_1122502016_INCR
timestamp: 1122000000
Disaster recovery 307
About recovering the NetBackup catalog
/usr/openv/netbackup/db/images/primary_server/timestamp/tmp
Windows:
C:\Program Files\VERITAS\NetBackup\db\images\primary_server
\timestamp\tmp
6 If your catalog recover media is on tape, run the vmquery command to assign
the media to the media server.
Example:
7 To recover the catalog .f file from the hot catalog backup, run a Phase II import
on the media that is specified by the disaster recovery file.
8 If your catalog backup was incremental, recover all the other catalog backup
images up to and including the most recent full catalog backup.
■ Open the Backup, Archive, and Restore client interface for NetBackup.
Select NBU-Catalog as the policy type. Set the source clients and destination
clients to your primary server.
■ Search the backups and restore all files that are located in the following
directory:
install_path/netbackup/db/images/primary_server
■ Verify that all files are restored successfully on the primary server.
Disaster recovery 308
About recovering the NetBackup catalog
9 Restore your critical data by using the Backup, Archive, and Restore client
interface or the command line.
■ Restore the catalog backup images for each media server which requires
data recovery.
■ To restore the backup images, select NBU-Catalog as the policy type.
Source and destination clients should be your primary server. Refresh your
view in the BAR GUI. Traverse the file system for the primary server to the
following:
install_path/netbackup/db/images
Restore the images for each configured media server. Verify that your
images are present by searching for them in the catalog.
10 Recover backup data from each media server in the previous step. Change
the policy type, source client, and destination client to match the client that is
used to back up the desired data. Select the wanted files from the Backup,
Archive, and Restore client interface and restore them.
11 To recover the NetBackup database, run the following:
bprecover -r -nbdb
13 Recover your policies and configuration data on each primary server and media
server.
Before recovering NetBackup policy files, ensure that you have recovered all
of your critical data, or protected the media that contains your critical data.
When policy information is recovered, NetBackup starts to run the scheduled
jobs that may overwrite the media that was written after the last catalog backup.
Open the Backup, Archive, and Restore client interface for NetBackup and
select NBU-Catalog as the policy type.
For each server to be restored, set the source clients and destination clients
to your server, starting with the primary server.
Restore all files that are backed up by the hot catalog backup on each server.
14 Clean up allowedlist cache for all hosts.
15 Stop and restart the NetBackup services on all hosts.
16 After the services are restarted, renew the certificates:
■ If NetBackup (or host ID-based) certificates are used in your NetBackup
domain, do the following:
On a non-clustered setup:
UNIX:
/usr/openv/netbackup/bin/nbcertcmd -renewcertificate
Windows:
install_path\netbackup\bin\nbcertcmd -renewcertificate
On a clustered setup:
UNIX:
Windows:
/usr/openv/netbackup/bin/nbcertcmd -enrollCertificate
Windows:
Disaster recovery 310
About recovering the NetBackup catalog
install_path\netbackup\bin\nbcertcmd -enrollCertificate
On a clustered setup:
UNIX:
Windows:
If the command fails with the exist status 5988, refer to the following topic:
See “Steps to carry out when you see exit status 5988 during catalog recovery”
on page 311.
bpimport
3 On the primary server, for each media that is identified in the disaster recovery
file or email, run the following command:
Steps to carry out when you see exit status 5988 during catalog
recovery
Use this procedure when you come across exit status 5988 during catalog backup.
To resolve the issue
1 Run the following command:
Windows: install_path\NetBackup\bin\nbcertcmd -ping
UNIX: /usr/openv/netbackup/bin/nbcertcmd -ping
■ If it is executed successfully, proceed to the next step.
■ If it fails with status 8509 (The specified server name was not found in the
web service certificate), follow the steps in this article:
https://www.veritas.com/support/en_US/article.000126751
Disaster recovery 312
About recovering the NetBackup catalog
2 Perform the user logon on the primary server. Use the following command:
install_path\netbackup\bin\bpnbat -login -loginType WEB
For example:
install_path\netbackup\bin\bpnbat -login -loginType WEB
3 Note the value of key Client_Name for the primary server. For a clustered
primary server, note the value of key Cluster_Name.
This value can be found at:
Windows:
HKEY_LOCAL_MACHINE\SOFTWARE\Veritas\NetBackup\CurrentVersion\Config
UNIX: /usr/openv/netbackup/bp.conf
This value can be either a FQDN or a short name.
For example:
abc.example.com
4 Note the host ID of the primary server. You can obtain its value by running the
following command:
install_path\netbackup\bin\nbcertcmd -listCertDetails
This command can return multiple records (if only one record is returned, select
the host ID provided in that record).
■ If the host name that was obtained in step 3 is the FQDN, pick the record
where the “Issued By” entry matches its short name.
■ If the host name that was obtained in step 3 is the short name, pick the
record where the “Issued By” entry matches its FQDN.
Disaster recovery 313
About recovering the NetBackup catalog
Example:
install_path\netbackup\bin\nbcertcmd -listCertDetails
install_path\netbackup\bin\admincmd\nbhostmgmt -a -i
78f9eed4-xxxx-4c6a-bb40-xxxxxxxxx -hm abc.example.com
abc.example.com is successfully mapped to
78f9eed4-xxxx-4c6a-bb40-xxxxxxxxx.
Alternately, you can also add this host-ID-to-host name mapping in the
NetBackup web UI. Open Security > Host mappings.
6 Do the following to renew the certificates:
■ To renew the NetBackup (or host ID-based) certificate of the primary server,
use the following command:
install_path\netbackup\bin\nbcertcmd -renewCertificate
For a clustered primary server, run the following command:
Disaster recovery 314
About recovering the NetBackup catalog