Error codes with resolution
Error codes with resolution
Review below general areas for more information on the Status Code 1:
DB backups
User_ops/dbext
Oracle: DBCLIENT log on client, RMAN script output log
Vault: vault log and vault session directory from the master
NDMP: bptm log, ndmp log from the master
■ Check the free space on directories where the files would be restored.
Destination Path
If the destination path must be changed, ensure that this destination restore path is typed with the proper path/syntax.
Regardless of the destination path, check the ownership and permissions on directories where files will be restored. For
restores of UNIX and Windows Clients, the bpcd daemon/NetBackup Client Service must have access to the directory structure
where the files will be restored to. Otherwise, a status 5 will occur.
■ Check if the destination exists or not. In case of NDMP check is the volume-qtree exists.
Page | 1
/usr/openv/netbackup/logs/user_ops/<user_name>logs
Verify the /etc/hosts file on the client should have the entries for the master/media server.
Page | 2
Migrate NetBackup Catalogs
Client images under /usr/openv/netbackup/db/images can be relocated to alternate volumes to reclaim space.
■ Verify that the system is not running out of virtual memory. If virtual memory is the problem, turn off unused applications or
increase the amount of virtual memory.
Page | 3
■ A bad network interface card on a NetBackup client or server.
■ Faulty routers.
■ The error occurs while you use the NetBackup-Java interface: The application server (bpjava processes) for the NetBackup-
Java interface probably ran out of disk space in the file system containing /usr/openv/netbackup/logs/user_ops. The application
server writes temporary files into directories in the /user_ops directory. Try clearing up disk space in the file system.
Page | 4
...
Shared VERITAS Processes
------------------------
root 827 1 0 Sep 23 ? 1:26 /opt/VRTSpbx/bin/pbx_exchange
This problem can occur in the following situation: when a process tries to connect to the NetBackup request daemon ( bprd) or
database manager daemon (bpdbm) and the daemon is not running.
Stop and restart NetBackup.
Unix
/usr/openv/netbackup/bin/bp.kill_all
/usr/openv/netbackup/bin/bp.start_all
Windows
install_path\NetBackup\bin\bpdown
install_path\NetBackup\bin\bpup
If the bpbrm or bptm process was killed the backup fails with status 40.
Status code 40 can also be due to denial of a mount request by the operator.
This status code may occur if nbjm was unable to connect to bpbrm or to bpmount.
NetBackup status code 41:
Message: network connection timed out
Explanation: The server did not receive any information from the client for too long a period of time.
The bpbkar client process has not replied to the server within the Client read timeout or Client connect timeout period. This
error occurs during incremental backups when directories have thousands of unmodified files.
For this case, use Host Properties on the NetBackup server to change Client connect timeout or Client read timeout. Else add
the following in the client’s bp.conf
CLIENT_READ_TIMEOUT or
CLIENT_CONNECT_TIMEOUT.
Page | 5
Before performing the backup Netbackup estimates the files to be backed up. In case we have directories inside directories and
directories and then files that too in large number, NetBackup will hang during the estimation. In such cases we need to exclude
such files.
Enable allow multiple data stream and see which FS is causing the issue and try excluding it or removing the unwanted data.
Else if the backup processes got killed at the client side when the backup is in progress, may be because of any other process.
At times the backup fails while backing up the system state backups with status 50.
Solaris: /var/adm/messages
HP-UX: /var/adm/syslog/syslog.log
This error can also occur if the volume is a cleaning tape but was not specified as a cleaning tape.
Another possible cause: the last available drive has a mount request for a non-backup (such as a restore). Then a
backup that requires the same drive is initiated before the mount completes. This error is due to the drive not being
reported as busy until the mount completes.
Else the drive went down during the backup.
Page | 6
Explanation: The server did not complete the connection to the client. The accept system or winsock call timed out after 60
seconds.
On a Windows NT/2000 or UNIX master server Verify NetBackup Server processes are running.
When using the NDMP protocol make sure that the Policy Type of the policy being used is set to NDMP. This is set in the
Attributes tab of the policy. Having the Policy Type set to something other then NDMP, such as Standard or MS-Windows-NT,
will result in a status code 57 when a backup job is initiated.
For catalog backups, make sure the file paths are valid entries. In the Java GUI go to the Catalog section and select Actions >
Configure NetBackup catalog. The Files tab will list the files that will be backed up as part of the catalog backup. Ensure these
are pointing to the correct path names.
■ If the server is a valid server, verify that it is in the server list on the client. If necessary add it as follows:
Page | 7
On Windows clients: Add the server on the Server to use for backups and restores drop-down in the Specify NetBackup
Machines and Policy Type dialog box. To display this dialog box, start the Backup, Archive, and Restore interface on the
client. Then click Specify NetBackup Machines and Policy Type on the File menu.
■ On UNIX and Linux, and Macintosh clients: add a SERVER entry in the bp.conf file.
If the master server entries are there in the client so you would be able to connect to the client from host properties -> clients -
> servers. So add the media server entries.
Try removing unwanted entries and the first entry should always be the master server.
If multi-streaming is enabled and a path does not exist, but is listed in the exclude list, Status
71 will be generated. This is the result of the order BPBKAR processes the backup information. With
multi-streaming enabled, the BPBKAR process first verifies that everything in the file list exists and
then checks the exclude_list.
Page | 8
In the above screen capture, if the path F:\Shared does not exist, and multi-streaming is enabled, the
backup will fail with a Status Code 71. This differs from non-multi-streamed backups, where
BPBKAR skips missing items in the file list and continues with the backup.
In case everything is fine try removing the selection list and add it again, it works in some cases.
When the backups are going to SAN using FT and settings are set to always use SAN. However the FT services on the media
server or the client are not running/hung, we get status 83.
Page | 9
Tape errors fall into two categories. Either there was an I/O error while writing or the tape was not at the correct position after
the write.
NetBackup keeps track of how much data it is sending to the operating system to write to the device. As an integrity check after
the end of each write, NetBackup will ask the tape device for its position. If this position does not match what NetBackup has
calculated the position should be, then the job will fail with a media write error.
This error will occur when NetBackup encounters an unexpected error while writing to disk. A disk full error will generally be
reported as a Status Code 129, but some disk-full conditions may be reported as the more general Status Code 84.
■ On Windows, make sure that the tapes are not write protected.
■ The wrong media type. Verify that the media matches the drive type you use.
■ Defective media. If it is defective, use the bpmedia command to set the volume to the FROZEN state so it is not used for
future backups.
■ Defective media. In this case, you may not be able to recover all the data on the media. Use the bpmedia command to set the
volume to the FROZEN state so it is not used for future backups.
■ The wrong media type. Verify that the media matches the drive type you use.
■ Defective media. In this case, some data may be lost. Use the bpmedia command to set the volume to the FROZEN state so it
is not used for future backups.
■ The wrong media type. Verify that the media matches the drive type you use.
Solaris: you may need to change one or more default System V Shared Memory settings to prevent jobs failing with the memory
allocation message, as follows:
■ For Solaris 9, the default shminfo_shmmax value is 8 megabytes. You can place the following line in your /etc/system file to
increase this setting. A value of 32 megabytes has been used in this example.
(This parameter is not applicable to Solaris 10).
set shmsys:shminfo_shmmax=33554432
■ For Solaris 9, the default shminfo_shmmni value is 100. You can place the following line in your /etc/system file to increase
this setting. A value of 220 has been used in this example. (This parameter is not applicable to Solaris 10).
Page | 10
set shmsys:shminfo_shmmni=220
Note: If you modify any of these values in the /etc/system file, you must reboot the system with boot -r for the new settings to
take effect.
Message: unable to allocate new media for backup, storage unit has none available
Explanation: The NetBackup resource broker (nbrb) did not allocate a new volume for backups. This error indicates that the
storage unit has no more volumes available in the volume pool for this backup.
Recommended Action:
■ If the storage unit is a robot with empty slots, add more volumes (remember to specify the correct volume pool).
■ If there are no empty slots, move some media to non robotic and then add new volumes.
■ If you have difficulty keeping track of your available volumes, try the available_media script:
This script lists all volumes in the volume configuration, and augments that list with information on the volumes currently
assigned to NetBackup.
■ Set up a scratch volume pool as a reserve of unassigned tapes. If NetBackup needs a new tape and none are available in the
current volume pool, it does the following:
moves a tape from the scratch pool into the volume pool that the backup uses.
To check how many volumes are currently available in the scratch pool:
Else click on media under media and device management and filter with ‘volume pool’ = “Scratch”. Check if the column robot
type shows TLD/ACS as if it shows NONE means that media is non robotic. You can also click on the particular robot no. under
media and then filter with “scratch”.
■ If the storage unit and volume pool appear to have media, verify the following:
■ Volume is not FULL, FROZEN or SUSPENDED.
Page | 11
Check for this condition by using the NetBackup Media List report. If the volume is frozen or suspended, use the bpmedia
command to unfreeze or unsuspend it (if that is wanted).
In case the media labels are failing with status 98. Verify the drives on the media server used, try using a different media
server. If the tapes are VTL tapes check all the physical adaptor links are up. Finally check the nbrb evaluation time and you
might need to restart the NetBackup services or the master server.
The NDMP backup job might fail with or without backing up any data:
Page | 12
Explanation: As NetBackup writes to the file system for the disk storage unit,
runs out of space. Until more file system space is available, images of similar size
or larger may fail with this error when written to this disk storage unit.
In a scheduled backup job that writes to a storage unit group (which contains this
disk storage unit), the following occurs: the NetBackup scheduler requests the
storage unit with the greatest available capacity when the job is retried.
Message: unable to process request because the server resources are busy
Explanation: Status code 134 is an informational message that indicates that all drives in the storage unit are currently in use. If
all drives are in use, NetBackup automatically tries another storage unit. If one is not available, NetBackup re-queues the job
with a status of 134 and retries it later.
Recommended Action: None.
The 134 code is an informational message only and is not considered an error. It can occur for a number of reasons in normal
operation. The 134 status code can occur more frequently in an SSO environment. No action is necessary.
A status 134 is not logged in the error logs. A 134 status causes a new try to appear in the Activity Monitor. It does not increase
the retry count that is associated with the allowed number of retries.
If the Status 134 error is exhibited extensively, this could indicate a possible problem in the implementation of VERITAS
NetBackup™, such as:
1. Too many jobs are running at once for the available server resources.
2. Poor network communication between hosts in the environment.
3. The library is too small to accommodate the environment’s policy configuration.
4. Multiple Storage Units are configured for the same host that also designates the same library and drives.
Page | 13
Explanation: The write to the catalog file failed because the disk that contains the catalog database is full.
Recommended Action: Free up space on the disks where NetBackup catalogs eside and retry the operation.
■ For backups using Snapshot Client and the NAS_Snapshot method, with or without SnapVault:
■ If the backup fails with status code 156, consult the bpfis legacy log, in
/usr/openv/netbackup/logs (UNIX and Linux) or
■ On Windows clients, when restoring files from a backup that is made with the NAS_Snapshot method, log into the NetBackup
Client Service as the Administrator account, not as the local system account. Otherwise, the backup fails with status 156.
If VSS/VSP are enabled a snapshot cache file would be created in the each drive before backup and in case the free space on the
drives is low and Netbackup fails to create the file. Backups might fail with 156.
Try disabling VSP/VSS from host properties - master server – client attributes – windows open file backup. In case the backups
are required with the snapshot try changing the VSP settings.
If the backups are not check pointed you can’t suspend the job. So only check pointed backups can be suspended.
During backups, NetBackup keeps track of the number of bytes sent to it from the hard drive, and pads all blocks it receives to
512 bytes. After a backup finishes, as an integrity check to confirm all data was received, NetBackup divides the total number
of bytes received by 512. If this number divides evenly by 512, it is assumed all data was correctly received. If the number
does not divide evenly by 512, then it is assumed that some amount of data was not received. NetBackup only concludes that
not all of the data was received. During a data transmission for backup, it is the responsibility of the TCP stack to re-transmit
Page | 14
any packets that it recognizes as being dropped. The problem comes when the TCP stack does not recognize that a packet has
been dropped and does not re-transmit it properly.
<32> fill_buffer: attempted to write 8 gigabytes, 1569582536 bytes, it is not a multiple of 512
The bpbkar log on the client will show how much data was sent:
Two typical causes of a Status Code 174 are Network Interface Card (NIC) drivers, and (on some platforms) a gigabit ethernet
card in a 33-mHz PCI slot.
This type of error can be very difficult to troubleshoot given the fact that the TCP stack was unable to detect the loss of data.
The system log will rarely show any possible causes of the problem. Network analysis equipment may be necessary to
determine where the packets are being dropped.
Tape/SCSI Causes
If data is not written to the storage media as requested by NetBackup, a Status Code 174 error can occur.
Example:
In such errors the image is already duplicated but Netbackup thinks its not
■ An automatic import job may cause this error. This job is an import job that shows a storage lifecycle policy name.
■ If a Vault job encountered the error responsible for the duplication, check the duplicate.log files in your sidxxx directories to
determine the root cause:
Page | 15
UNIX and Linux:
/usr/openv/netbackup/vault/sessions/vault_name/sidxxx
(where xxx is the session ID)
Message: client backup was not attempted because backup window closed
Explanation: A backup or an archive operation that the backup scheduler queued was not tried because the backup window
was no longer open.
Recommended Action: Do the following, as appropriate:
■ If possible, change the schedule to extend the backup window for this combination of policy and schedule so it does not occur
again.
■ If the backup must be run, use the Manual Backup command on the Policy menu in the Backup Policy Management window
to perform the backup.
Check if the resources are available till the backup window closes.
Within “Global Attributes” under the Master Server’s host properties, there is a setting for the “Maximum jobs per client”. This
has an effect similar to that of “Limit jobs per policy”, but on the client level.
We need to verify the multiplexing settings as well to ensure jobs will not remain in queue:
Multiplexing - Implementation
Multiplexing is configured in three places:
• Within the Storage Unit, under “Maximum multiplexing per drive”.
• Within the Policy Attributes through the “Allow Multiple Data Streams” checkbox.
• Within the Policy’s Schedule in the “Media Multiplexing” setting.
As mentioned above, the following configuration options might also be relevant when configuring multiplexing:
• Limit jobs per policy (set in Policy Attributes)
• Maximum Jobs per Client (set in Global Attributes)
• Maximum Data Streams (set in ‘Client Attributes’ within the Master Server’s host
properties)
Page | 16
NetBackup status code 213:
Policy has Storage Unit set to “Any Available” and the Drives are Down.
A status code 213 is returned for any policy where the policy’s storage unit is set to "Any available" and there are no available
drives. (A status code 219 is returned for any policy where the storage unit is specified and there are no available drives.)
Page | 17
Verify that bpcd listens on the port for connections. On a UNIX or Linux server where the storage unit is connected, if
you run netstat -a | grep bpcd, it should return something similar to the following:
*.bpcd *.* 0 0 0 0 LISTEN
Check the nbrb and the mds logs by using the vxlogview command.
On Demand Only
If the "On Demand Only" attribute is checked on a Storage Unit, please note that any policies that do not target that Storage
Unit directly will not use that Storage Unit. For example, if a policy is set to use “Any Available” Storage Unit, it will not utilize a
Storage Unit that has the “On Demand Only” box checked.
Multiplexing
In an SSO environment, when drives are shared, a job will typically requeue with a status code 134 if a drive is “over-
committed”. In some situations, due to timing, this scenario may result in a Status Code 219 (if a specific Storage Unit is
designated in the policy). To help minimize this, try increasing the “Maximum Multiplexing per drive” within the Storage Unit.
Allowing more streams per drive will increase drive availability (This also applies to non-SSO environments). Note that
Multiplexing can result in significantly longer restore times. See the Media Manager System Administrator’s Guide for more
information on this feature.
Page | 18
NetBackup status code 800:
Mostly happens when the robotic library is down on the robot and no drives are available.
Increasing the allowed threads for read to the database eliminated the status 800's.
An example of modifications that can be made are:
NUM_DB_CONN=NUM_DB_BROWSE_CONN+1
The above configuration settings increase the read threads to emm. There is only one write thread to the database. This is
done intentionally to ensure data integrity.
It is not recommended to increase the settings further unless directed to do so by a Symantec NetBackup Technical Support
representative as each additional read thread may consume between 200 to 400MB of system memory.
*** USE CAUTION WHEN MODIFYING THE emm.conf and server.conf files! Incorrect spacing or typos prevent normal process
startup for nbemm.
NetBackup status code 2001:
Message: All compatible drive paths are down but media is available
Explanation: All configured drives that can be used for this job are down.
Page | 19
RecommendedAction: Verify that ltid is running on the required media server, and the media server is active for tape. Using
device monitor, bring up the drive paths if they are down. If drives are downed again, clean the drives.
Page | 20