IBM Tivoli Monitoring Troubleshooting Guide V6.3 PDF
IBM Tivoli Monitoring Troubleshooting Guide V6.3 PDF
Version 6.3
Troubleshooting Guide
GC22-5449-00
IBM Tivoli Monitoring
Version 6.3
Troubleshooting Guide
GC22-5449-00
Note
Before using this information and the product it supports, read the information in “Notices” on page 339.
This edition applies to version 6, release 3 of IBM Tivoli Monitoring (product number 5724-C04) and to all
subsequent releases and modifications until otherwise indicated in new editions.
© Copyright IBM Corporation 2005, 2013.
US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract
with IBM Corp.
Contents
Figures . . . . . . . . . . . . . . xiii Diagnosing that the portal client does not
respond . . . . . . . . . . . . . . 20
Tables . . . . . . . . . . . . . . . xv Resolving storage or memory problems . . . . 21
Resolving client configuration problems . . . . 21
Historical data is missing or incorrect . . . . . 22
About this information . . . . . . . xvii Diagnosing that historical data is missing or
incorrect . . . . . . . . . . . . . . 22
Chapter 1. Introduction to Resolving warehouse proxy connection problems 23
troubleshooting . . . . . . . . . . . 1 Resolving warehouse proxy agent problems -
Sources of troubleshooting information . . . . . 1 configuration . . . . . . . . . . . . . 23
Problem classification . . . . . . . . . . . 1 Resolving warehouse proxy agent problems -
Viewing the IBM Support Portal . . . . . . . . 2 connectivity . . . . . . . . . . . . . 24
Subscribing to IBM support notifications . . . . . 2 Resolving summarization and pruning agent
Starting Manage Tivoli Enterprise Monitoring problems . . . . . . . . . . . . . . 24
Services . . . . . . . . . . . . . . . . 3 Resolving persistent data store for z/OS
problems . . . . . . . . . . . . . . 25
Chapter 2. Logs and data collection for Historical data does not get collected at the
z/OS-based agent . . . . . . . . . . . 27
troubleshooting . . . . . . . . . . . 5 A situation does not raise when expected . . . . 27
Appropriate IBM Tivoli Monitoring RAS1 trace Diagnosing that a situation does not raise when
output . . . . . . . . . . . . . . . . 5 expected . . . . . . . . . . . . . . 27
Running snapcore to collect information . . . . . 5 Resolving situation-specific problems . . . . . 28
Locating the core file . . . . . . . . . . . 6 A reflex automation script does not run when it
Getting Dr. Watson dumps and logs . . . . . . 7 should . . . . . . . . . . . . . . . . 30
KpcCMA.RAS files . . . . . . . . . . . . 7 Diagnosing that a reflex automation script does
Monitoring agent LG0 log . . . . . . . . . . 8 not run when it should . . . . . . . . . 30
Sources of other important information . . . . . 8 Resolving format and variable problems . . . . 31
High CPU usage on a distributed system . . . . 31
Chapter 3. Common problem solving 11 Diagnosing high CPU usage on a distributed
About the tools . . . . . . . . . . . . . 11 system . . . . . . . . . . . . . . . 31
I am trying to find out what software is supported 11 Resolving situation problems - diagnostic actions 32
Workspaces are missing or views are empty . . . 12 Resolving situation problems - corrective actions 33
Diagnosing that workspaces are missing or Resolving firewall problems - diagnostic actions 33
empty . . . . . . . . . . . . . . . 12 Resolving firewall problems - corrective actions 34
Resolving application support problems . . . . 13 Resolving Oracle DB Agent problems - diagnostic
Resolving monitoring server problems . . . . 14 actions . . . . . . . . . . . . . . . 34
Resolving monitoring agent problems . . . . 14 Resolving Oracle DB Agent problems - corrective
Status of a monitoring agent is mismatched between actions . . . . . . . . . . . . . . . 34
the portal client and tacmd command . . . . . 15
Diagnosing that the status of a monitoring agent Chapter 4. Tools. . . . . . . . . . . 37
is mismatched between the portal client and Trace logging . . . . . . . . . . . . . . 37
tacmd command. . . . . . . . . . . . 15 Log file locations . . . . . . . . . . . 37
Resolving monitoring agent problems . . . . 16 Installation log files. . . . . . . . . . . 42
Resolving monitoring server problems . . . . 16 Reading RAS1 logs . . . . . . . . . . . 45
The portal server does not start or stops responding 17 Setting traces . . . . . . . . . . . . . 46
Diagnosing that the portal server does not start Dynamically modify trace settings for a Tivoli
or stops responding . . . . . . . . . . 17 Monitoring component . . . . . . . . . . 61
Resolving database problems - missing table or IBM Tivoli Monitoring Service Console . . . . 61
portal server database . . . . . . . . . . 18 Starting the IBM Tivoli Monitoring service
Resolving database problems - user ID and console . . . . . . . . . . . . . . . 62
password . . . . . . . . . . . . . . 18 Service console RAS1 interface . . . . . . . 63
Resolving database problems - instance not Blocking access to the IBM Tivoli Monitoring
started . . . . . . . . . . . . . . . 19 Service Console . . . . . . . . . . . . 65
Diagnosing that portal server logon fails. . . . 20 Displaying portal server tasks in the command
The portal client does not respond. . . . . . . 20 prompt . . . . . . . . . . . . . . . . 66
Contents v
Portal desktop client does not work when Performance impacts of the HTTP and HTTPS
exporting DISPLAY . . . . . . . . . . 140 protocols . . . . . . . . . . . . . . . 155
Password problem using the LDAP Security Users who run the IBM HTTP Server do not have
option on Active Directory system . . . . . 141 permission to the content directory . . . . . . 155
Logon name not accepted after hub monitoring tacmd exportWorkspaces or importWorkspaces
server change . . . . . . . . . . . . 141 receives an out of memory error . . . . . . . 155
Using an administrator name with non-latin1 The portal server and Warehouse Proxy Agent fail
characters, cannot log onto the Tivoli Enterprise to connect to the database on a 64-bit Windows
Portal . . . . . . . . . . . . . . . 141 system . . . . . . . . . . . . . . . 156
On an Active Directory Server, sysadmin cannot Failed to log on as sysadmin with portal server
logon to the Tivoli Enterprise Portal client . . . 142 LDAP enabled . . . . . . . . . . . . . 156
Workspaces . . . . . . . . . . . . . . 142 On AIX systems, newly created users with
Workspaces opened in browser tabs are not auto-expire passwords cause installation failures . 156
displayed correctly . . . . . . . . . . 142 Linux portal server unable to FTP catalog/attribute
Data in the portal client is missing and you files . . . . . . . . . . . . . . . . 156
receive an error. . . . . . . . . . . . 144 Upgrading the Tivoli Enterprise Portal Server takes
Several enterprise workspaces are returning an a long time . . . . . . . . . . . . . . 156
error, KFWITM217E:Request Running the Tivoli Management Services
Error,SQL1_CreateRequest Failed, rc=350 . . . 145 Discovery Library Adapter, results in a book that
Link from Managed System Status workspace 145 does not contain the fully qualified host name . . 156
Historical data . . . . . . . . . . . . . 146 Portal server performance is slow . . . . . . 157
At the bottom of each view, you see a historical Cannot create a Tivoli Enterprise Portal Server
workspace KFWITM217E error . . . . . . 146 database . . . . . . . . . . . . . . . 157
Historical UADVISOR situations are started on You receive a KFW error when a query is sent to
the agent if historical collection is configured to more than 200 managed systems . . . . . . . 158
collect data . . . . . . . . . . . . . 146 Non-hub situations are not associated at the Tivoli
Clicking on the Timespan icon for one view Enterprise Portal Server level . . . . . . . . 159
brings up the data for another view . . . . . 146 Non-root stopping or starting agents causes
Historical Collection Configuration window problems . . . . . . . . . . . . . . . 159
error after changing warehouse database from Root password is not accepted during non-root
Oracle to DB2 on Linux or UNIX . . . . . . 146 Tivoli Enterprise Portal Server configuration . . . 159
Situations. . . . . . . . . . . . . . . 147 Corba user exception is included in the portal
Situations are not firing . . . . . . . . . 147 server log when creating situations . . . . . . 159
Restarting a situation for pure event does not Stopping or starting the eWAS subcomponent of
clear it from console . . . . . . . . . . 147 the portal server . . . . . . . . . . . . 159
Multiple events that occur at the same time are Setting the TEPS/e maximum heap size . . . . 160
loaded too slowly . . . . . . . . . . . 148
You cannot paste non-ASCII characters in the Chapter 9. Monitoring server
Situation editor . . . . . . . . . . . . 148 troubleshooting . . . . . . . . . . 161
Situation editor cannot display advanced advice
Installed packages exceed the 512 maximum . . . 161
help files . . . . . . . . . . . . . . 148
Messages related to the index file are displayed
Installation of situation data fails due to I/O on
when the agent fails back to a remote monitoring
VSAM data sets . . . . . . . . . . . 148
server . . . . . . . . . . . . . . . . 162
Help is not displayed. . . . . . . . . . . 149
A generic RPC communications error is received
Data is not returned to the portal client . . . . 149
when issuing a long-running tacmd execute
Cannot select the Create new group icon within the
command . . . . . . . . . . . . . . 162
Object group editor . . . . . . . . . . . 150
Troubleshooting monitoring server problems on
Monitoring agents show in an unexpected position
distributed systems . . . . . . . . . . . 162
in the Navigator . . . . . . . . . . . . 150
SOAP command failures . . . . . . . . 162
HEAPDUMPs and JAVACore files are placed on
Exposure of passwords in the clear . . . . . 163
desktops when running in browser mode . . . . 151
Receive a seeding failed message . . . . . . 163
Category and Message field of the universal
High monitoring server CPU after restarting
message does not accept DBCS . . . . . . . 153
with Warehouse Proxy Agents configured . . . 163
Agents display offline in the portal client but they
High virtual memory usage on Linux 64-bit
are still started . . . . . . . . . . . . . 153
systems . . . . . . . . . . . . . . 164
Browser client locale configuration . . . . . . 153
Upgrade inconsistency between the History and
Cannot load a ws_pres.css file in order to select a
Object windows . . . . . . . . . . . 165
language other than English . . . . . . . . 154
Attribute groups started for collection on the
managed systems should not be available on the
Chapter 8. Portal server monitoring server list. . . . . . . . . . 165
troubleshooting . . . . . . . . . . 155
vi IBM Tivoli Monitoring: Troubleshooting Guide
To decrypt a password, Kshsoap client failure on UNIX-based systems 172
KDS_VALIDATE_EXT='Y' is required . . . . 165 tacmd login fails after hub monitoring server is
Remote Tivoli Enterprise Monitoring Server recycled . . . . . . . . . . . . . . 173
consumes high CPU when large number of tacmd and SOAP are not able to connect . . . 173
agents connect . . . . . . . . . . . . 165 tacmd login fails when monitoring server is
Unable to start the Tivoli Enterprise Monitoring configured with LDAP authentication . . . . 174
Server after the kdsmain process is terminated Correcting tacmd login error after failover to
abnormally . . . . . . . . . . . . . 166 standby monitoring server . . . . . . . . 175
THRESHOLDS.XML and Tivoli Enterprise Tacmd login command fails with error
Monitoring Server table not cleaned when KUIC00017E. . . . . . . . . . . . . 175
managed system override is removed . . . . 166 In a hot standby (FTO) environment, commands
Situations fail to trigger for attributes by to a mirror hub might not return hub records
applying group function. . . . . . . . . 166 after reconnection . . . . . . . . . . . 176
Monitoring server application support A deleted object is redisplayed when two hot
completes all seeding functions but might crash standby (FTO) hubs reconnect . . . . . . . 176
as the program is exiting . . . . . . . . 166 Troubleshooting monitoring server problems on
Some agents are displayed in the Service z/OS systems . . . . . . . . . . . . . 176
Console list that are not accessible from that Receive Program KDFCINIT and Program
user interface . . . . . . . . . . . . 167 FAXCMON messages. . . . . . . . . . 177
The system crashes when attempting a bulk The Tivoli Enterprise Monitoring Server start
import or export command . . . . . . . . 168 task (CANSDSST default) encountered error
Monitoring server fails to start, but then does message 'KLVST044 LOADLIST MEMBER NOT
after a reboot . . . . . . . . . . . . 168 FOUND IN RKANPAR DATASET (KDSLLIST)
Remote monitoring server lost connection to the KppLLIST KLVST001 CANDLE ENGINE
hub and all agents display offline . . . . . 168 INITIALIZATION ERROR(S), ABEND U0012' in
After the set timeout, the Tivoli Enterprise the RKLVLOG at startup . . . . . . . . 177
Monitoring Server is still pending . . . . . 168 KDS Parameters not generated from the batch
Providing the wrong path to configuration files parm deck . . . . . . . . . . . . . 178
during LDAP configuration causes the Tivoli Cannot encrypt text. A call to CSNBSYE failed.
Enterprise Portal login window to hang . . . 169 Cannot encrypt contents of keyfile . . . . . 178
Crash on Linux remote monitoring server The error “KLVST005 MVS JOBSTEP
during hub failover to Hot Standby . . . . . 169 AUTHORIZATION REQUIRED KLVST001
HUB Tivoli Enterprise Monitoring Server CANDLE ENGINE INITIALIZATION
quiesce prevents the display of the data ERROR(S), ABEND U0012 CSV019I - Required
collected by the attached Tivoli Enterprise module KLVSTWTO not accessed, it is not APF
Monitoring Agents . . . . . . . . . . 169 Authorized (RKANMODL) CSV028I - ABEND
During installation of a remote Tivoli Enterprise 306-0C” occurs in the z/OS monitoring server
Monitoring Server on a Windows system, the RKLVLOG during startup . . . . . . . . 179
agent support is applied, but fails . . . . . 170 The error “KLVSQ000 carved mode in effect for
Using a Deploy Group with addSystem or extended storage” occurred in the RKLVLOG
updateAgent commands. . . . . . . . . 170 during startup . . . . . . . . . . . . 179
Tivoli Enterprise Monitoring Server requires Error message 'KDSMA013 OPEN VTAM for
restart if you issue itmcmd server stop/start VDM1APPL failed with status 8' occurs in the
commands when you are already logged on . . 170 Tivoli Enterprise Monitoring Server start task
Log indicates hub monitoring servers are down (CANSDSST default) . . . . . . . . . . 179
when they are up . . . . . . . . . . . 170
The Platform view in the Manage Tivoli Chapter 10. Automation server
Enterprise Monitoring Services panel shows the troubleshooting . . . . . . . . . . 181
Tivoli Enterprise Monitoring Server as running
Log and environment files . . . . . . . . . 181
as a 32 bit application, but my agents are shown
OSLC-PM service provider fails to connect to
as running as 64 bit applications . . . . . . 171
Registry Services application . . . . . . . . 182
Tivoli Enterprise Monitoring Server does not
OSLC-PM service provider fails to start . . . . 184
release memory after running a large SQL query 171
Linux automation server fails to start when quickly
SQL queries with more than 200 OR predicates
recycled . . . . . . . . . . . . . . . 185
do not complete . . . . . . . . . . . 172
Shutdown takes a long time . . . . . . . . 185
Tivoli Enterprise Monitoring Server aborts
unexpectedly when exiting the telnet session
used to start it . . . . . . . . . . . . 172 Chapter 11. Authorization Policy
KCIIN0084E Timeout appears while waiting for Server troubleshooting . . . . . . . 187
Tivoli Enterprise Monitoring Server to start on Trace and log information . . . . . . . . . 187
AIX 5.3 . . . . . . . . . . . . . . 172
Contents vii
Setting a trace for the dashboard data provider The target host name, platform, and version
or policy client . . . . . . . . . . . . 187 information is not displayed for the deployment
Authorization Policy Server trace setting . . . 188 status in the CLI or the workspace . . . . . 217
pdcollect tool . . . . . . . . . . . . 189 Deploy cannot tell if the installation failed. . . 217
Audit logs for the Authorization Policy Server 191 Warehouse agents . . . . . . . . . . . . 218
Validate correct configuration and operation . . 193 Unable to configure the Warehouse Proxy agent
Authorization Policy Server startup failure . . . 194 with modified parameters from the Tivoli
Authorization failures using the Policy CLI Enterprise Portal GUI . . . . . . . . . 218
commands . . . . . . . . . . . . . . 195 Self-describing agent . . . . . . . . . . . 218
CLI tivcmd commands for dashboard authorization Getting diagnostic information . . . . . . 219
diagnostics . . . . . . . . . . . . . . 195 Solving common problems . . . . . . . . 220
Policy distribution issues . . . . . . . . . 197 Self-describing agent operations no longer
function for remote monitoring server in a hot
Chapter 12. Infrastructure standby environment . . . . . . . . . . 221
management dashboards Local history migration tools move the agent
operation logs to multiple agent history locations . 222
troubleshooting . . . . . . . . . . 203 Unreadable tool tip information for Available EIF
Server dashboard trace settings . . . . . . . 203 Receivers list of the Situation editor . . . . . . 222
Authentication is required in web application . . 204 Unable to locate the file name of an exported
Resource not available or not authorized to see situation that begins with numerals . . . . . . 223
dashboards . . . . . . . . . . . . . . 204 Testing the connection to the Tivoli Data
Situation event results don't display . . . . . . 206 Warehouse database is valid even with an invalid
Dashboards and portal client show different password . . . . . . . . . . . . . . . 223
resources . . . . . . . . . . . . . . . 207 Configured non-root user agent starts up as root 223
No data or only partial data is displayed . . . . 208 Large historical collections slow monitoring agents 223
User's authorization policy updated but not Unable to access History Collection Configuration
showing in dashboards . . . . . . . . . . 209 for any agent . . . . . . . . . . . . . 223
Situation events not updated after changes to Agent names and icons are displayed incorrectly 224
security policy or group membership . . . . . 210 64 bit monitoring agents are not started . . . . 224
Unable to communicate with data provider . . . 210 Errors in the configuration xml file . . . . . . 224
Subnode Limitations for autonomous function . . 225
Chapter 13. Monitoring agent Installing pre-v6.2.1 Monitoring Agent for
troubleshooting . . . . . . . . . . 211 Windows OS onto a v6.2.1 or later monitoring
Startup failure with older agents . . . . . . . 211 server inadvertently unconfigures the monitoring
Command-line interface . . . . . . . . . . 212 server . . . . . . . . . . . . . . . . 226
Historical data . . . . . . . . . . . . . 213 Calendar entries overlap. . . . . . . . . . 226
Historical data is not collected for Agent Receive an error when deploying an System
Operations Log and ITM Audit . . . . . . 213 Service Monitor agent . . . . . . . . . . 226
Take action commands and reflex automation . . 213 The Agent Service Interface is not globalized . . . 227
Workspaces . . . . . . . . . . . . . . 214 Some attribute group names are unintelligible from
A workspace view is showing an error . . . . 215 the History Collection Configuration window . . 227
OS agents . . . . . . . . . . . . . . 215 History collection fails to display the most recent
Linux OS agent fails to start . . . . . . . 215 24 hours of data . . . . . . . . . . . . 227
OS agent start command fails . . . . . . . 215 Situations with attributes from more than 1 group
Specific events are not monitored by the not supported with autonomous agent . . . . . 227
Windows OS agent . . . . . . . . . . 215 Failure when importing situation xml file edited
32-bit Agent Builder agent will not start on with WordPad . . . . . . . . . . . . . 227
64-bit Windows with System Monitor Printer details of another system are displayed on
Agent-installed OS Agent . . . . . . . . 216 the Tivoli Enterprise Portal . . . . . . . . . 228
OS agent restarted unexpectedly on heavily CTIRA_MAX_RECONNECT_TRIES environment
loaded systems . . . . . . . . . . . . 216 variable is now obsolete . . . . . . . . . . 228
Binary Path attribute of the Windows OS agent Agent goes offline after removing history path . . 228
does not show a value . . . . . . . . . 216 Override button is not present for a situation. . . 228
Tivoli Enterprise Portal data for UNIX OS and Agent's Management Definition View columns are
Linux OS agents is not updated after stopping not showing data . . . . . . . . . . . . 228
the disk . . . . . . . . . . . . . . 216 There is a situation distribution discrepancy if
Installing backlevel Windows OS agent on there is a hub monitoring server outage when one
existing environment causes monitoring server or more remote monitoring servers remain active . 229
not to start . . . . . . . . . . . . . 217 Installing v6.2.2 agent application support on a
monitoring server for a prior release causes agents
to fail . . . . . . . . . . . . . . . . 229
Contents ix
Situations deleted from the CLI are still listed Endpoint fails to connect to monitoring server
on Tivoli Enterprise Portal Situation editor . . 251 when running createnode from a monitoring
The tacmd addBundles command returns an server in a different domain . . . . . . . 261
unexpected KUICAB010E error message . . . 251 The tacmd getdeploystatus command is not
Missing options for login -stdin results in returning status return codes . . . . . . . 262
unexpected behavior . . . . . . . . . . 251 tacmd createSit does not send errors if you
A system error occurs with the tacmd mistype the name of an attribute . . . . . . 262
editsystemlist -e command . . . . . . . . 251 tacmd viewUser . . . . . . . . . . . 262
Problem running the tacmd listsystemlist -d itmcmd commands . . . . . . . . . . . 262
command on Linux systems . . . . . . . 252 The itmcmd config -A hd command asks for
Problem running the tacmd listSystems Database Table Partitioning. . . . . . . . 263
command on Linux and UNIX . . . . . . 252 tivcmd commands. . . . . . . . . . . . 263
You receive a message when using a tacmd
command related to agents . . . . . . . . 253 Chapter 15. Performance Analyzer
Improving tacmd command response time when troubleshooting . . . . . . . . . . 265
using VMWare . . . . . . . . . . . . 253
Enabling logging for the agent . . . . . . . 265
Commands with embedded single quotation
Enabling logging for the monitoring portal . . . 266
marks fail . . . . . . . . . . . . . 253
Installation and configuration issues. . . . . . 266
Reconfiguring an agent and then getting the
Problems after upgrading . . . . . . . . . 267
deploy status yields a port number message . . 253
Tivoli Performance Analyzer graphical user
tacmd removeBundles command returns
interface for Tivoli Enterprise Portal fails when
unnexpected KUICRB010E error message . . . 253
downloading tasks list . . . . . . . . . . 268
The suggestbaseline or acceptbaseline
When tasks are started and when you should see
commands fail . . . . . . . . . . . . 254
data in the workspaces . . . . . . . . . . 268
Overrides set against an agent cannot be deleted
No data is displayed in the workspaces . . . . 269
from the command line . . . . . . . . . 254
The Tivoli Performance Analyzer workspaces are
Unexpected KUIC02013E error message . . . 254
not available or not displayed . . . . . . . . 269
Corrupted tacmd responses are displayed in the
No chart is visible on the Forecast Details
command-line interface . . . . . . . . . 255
workspace . . . . . . . . . . . . . . 269
TACMD xxxxAction commands fail on Japanese
The Performance Analyzer Agent Statistics
systems . . . . . . . . . . . . . . 256
workspace shows database errors indicating that
tacmd executecommand command run against
some tables or views are missing . . . . . . . 269
subnode fails . . . . . . . . . . . . 256
Nonlinear tasks take too long to complete . . . . 270
tacmd getfile or putfile command is failing . . 257
Agent never connects to the monitoring server . . 271
Temporary files remain when tacmd getfile or
The Tivoli Enterprise Monitoring Server does not
putfile is interrupted . . . . . . . . . . 257
restart after installation of Domain Support . . . 271
tacmd listsit -m UX Managed System gives no
result . . . . . . . . . . . . . . . 257
Receive an OutOfMemory exception when using Chapter 16. Database troubleshooting 273
the import or export commands . . . . . . 257 Data loss prevention . . . . . . . . . . . 273
addBundles command times out . . . . . . 257 Backing up the TEPS database for recovery
tacmd createNode . . . . . . . . . . . 258 purposes . . . . . . . . . . . . . . 273
tacmd suggestbaseline minimum, maximum, Restoring the original database contents . . . 273
and average function values are ignored . . . 259 If you modify your password or if it expires . . . 273
tacmd suggestbaseline command receives an DB2 pureScale environment . . . . . . . . 274
error . . . . . . . . . . . . . . . 259 Receive First Steps error at the end of a DB2
You receive a message when trying to use the installation . . . . . . . . . . . . . . 275
tacmd maintagent command . . . . . . . 259 Windows portal server cannot connect to the
listSit command with the type option fails with database . . . . . . . . . . . . . . . 275
a KUIC02001E message on Japanese Systems . . 259 Oracle problem with JDBC drivers prior to 11.1.0.7 276
When using the listSystems command, the last Database contents are incorrect after installation 276
two digits for the version appear as 'XX' . . . 259 Migration script fails on DB2 for Linux, UNIX and
The command tacmd restartAgent fails if the Windows . . . . . . . . . . . . . . . 277
agent is already stopped. . . . . . . . . 260 Errors when migrating non-partitioned tables to
tacmd addSystem fails if agent already exists 260 partitioned tables . . . . . . . . . . . . 277
The addSystem command fails with error Using DB2 V9.1 for z/OS, Warehouse Proxy agent
message KUICCR099E . . . . . . . . . 261 encounters a large number of disconnections . . . 281
The addbundles command fails . . . . . . 261 Historical data is not warehoused . . . . . . 282
The exportBundles command does not work for Historical data for logs is incorrect . . . . . . 282
patches . . . . . . . . . . . . . . 261 Incorrect data is collected in the warehouse for
filtering if using a wildcard. . . . . . . . . 282
Contents xi
Data retrieval delays/time-outs causing no data Related publications . . . . . . . . . . . 333
conditions . . . . . . . . . . . . . . 326 Other sources of documentation . . . . . . . 333
You can use this guide in conjunction with the other books for your product.
Troubleshooting is not the same as problem solving, although during the process of
troubleshooting, you can obtain enough information to solve a problem, such as
with end-user errors, application programming errors, and system programming
errors.
You might not always be able to solve a problem yourself after determining its
cause. For example, a performance problem might be caused by a limitation of
your hardware. If you are unable to solve a problem on your own, contact IBM
Software Support for a solution. See Chapter 2, “Logs and data collection for
troubleshooting,” on page 5 for information on the types of data to collect before
contacting Support.
Typically, text messages relay information about the state and performance of a
system or application. Messages also alert the system administrator to exceptional
conditions when they occur. Consult the explanation and operator response
associated with the displayed messages to determine the cause of the failure. See
the document IBM Tivoli Monitoring Messages for message information.
Trace data capture transient information about the current operating environment
when a component or application fails to operate as designed. IBM Software
Support personnel use the captured trace information to determine the source of
an error or unexpected condition. See “Trace logging” on page 37 for more
information about tracing.
Problem classification
The first task in troubleshooting is to determine the origin of the problem, or
which component or function is experiencing a problem. To assist you in
determining the origin of the problem, collect documentation at the time of the
error.
You might experience problems with IBM Tivoli Monitoring in the following areas:
v Installation
v Upgrading
v Configuration
v Connectivity
v Tivoli Enterprise Portal
v Tivoli Enterprise Portal Server
v Tivoli Enterprise Monitoring Server
v Tivoli Authorization Policy Server
Procedure
1. Open the http://ibm.com website and select Support & downloads >
Technical support. You can also launch an IBM support website, such as
http://www.ibm.com/support/us.
2. Enter your IBM user ID when prompted or, in the Quick start page or Support
home, click Sign in to sign in with your IBM user ID or to register if you have
not yet registered.
3. Enter a keyword or keywords for the information you want to find in the
Quick Find or Search support fields. You can also browse through the other
Support tabs.
In the Support portal, you can specify the products for which you want to receive
notifications; choose from flashes, downloads, and technotes; and set up to receive
email updates.
Procedure
1. Open the http://ibm.com website and select Support & downloads >
Technical support. You can also launch an IBM support website, such as
http://www.ibm.com/support/us.
2. In the Quick start page or Support home, click Sign in to sign in or to register
if you have not yet registered.
Results
You begin receiving “IBM My notifications” emails about the products you have
selected and at the interval you specified.
Start Manage Tivoli Enterprise Monitoring Services using one of the following
methods:
Procedure
v Click Start > Programs > IBM Tivoli Monitoring > Manage Tivoli
Enterprise Monitoring Services
v Change to the install_dir/bin directory and run ./itmcmd manage
[-h install_dir] where -h install_dir is optional and used to specify the
installation directory if it is not the one in which the script is located.
Results
The reliability, availability, and serviceability (RAS) trace logs are available on the
Tivoli Enterprise Monitoring Server, the Tivoli Enterprise Portal Server, and the
monitoring agent. By default, the logs are stored in the installation path for IBM
Tivoli Monitoring.
The following links to sections in this document supply more information on these
files:
v For information on where they are stored, see “Log file locations” on page 37
v For information on setting the trace option for an IBM Tivoli Monitoring
component, see “Setting traces” on page 46.
v For information on dynamically setting the trace settings, see “Dynamically
modify trace settings for a Tivoli Monitoring component” on page 61.
v For information on reading RAS1 logs, see “Reading RAS1 logs” on page 45.
v For information on the ras1log tool, see “ras1log tool” on page 68.
The snapcore command gathers a core file, program, and libraries used by the
program and compresses the information into a pax file. The file can then be
downloaded to disk or tape, or transmitted to a remote system.
Take the following steps to run the snapcore command and collect information
you might need to debug and analyze the problem:
Procedure
1. Change to the directory where the core dump file is located:
# ls -l
total 84176
-rw-r--r-- 1 root system 2704 Feb 21 09:52 core.18048.01084144
2. Run the snapcore command to collect all needed files:
# snapcore -d /tmp/myDir core.18048.01084144
The snapcore command gathers all information and creates a new compressed
pax archive in the/tmp/myDir directory. If you do not specify a special directory
If the system stops on UNIX-based systems, collect the core file from the directory
that stores the binary file, to which the process belongs. For example, if the failing
process is the Tivoli Enterprise Portal Server server process, KfwServices, the core
is created in the /opt/IBM/ITM/archtype/cq/bin/ directory.
Procedure
To retrieve information on where the core file is created, enter the errpt -a
command.
Results
A summary of information is displayed about the most recent crashes and also the
location of the core file:
-------------
LABEL: CORE_DUMP
IDENTIFIER: A63BEB70
Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED
Probable Causes
SOFTWARE PROGRAM
User Causes
USER GENERATED SIGNAL
Recommended Actions
CORRECT THEN RETRY
Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE
Detail Data
SIGNAL NUMBER
11
USER’S PROCESS ID:
32248
FILE SYSTEM SERIAL NUMBER
10
INODE NUMBER
655367
PROCESSOR ID
0
CORE FILE NAME
/opt/IBM/ITM/aix533/cq/bin/core
PROGRAM NAME
KfwServices
STACK EXECUTION DISABLED
---------------
Take the following steps to enable Dr. Watson and configure it to create a detailed
dump file:
Procedure
1. To enable Dr. Watson as the default debugger, at the command prompt, enter
the following command: drwtsn32 –i.
2. To open the Dr. Watson configuration dialog, at the command prompt, enter the
following command: drwtsn32
3. Set the following fields:
a. Set the Crash dump Type to FULL.
b. Clear the Dump Symbol Table check box.
c. Enable the Dump all Thread Contexts check box.
d. Enable the Create Crash Dump File check box.
KpcCMA.RAS files
IBM Tivoli Monitoring on Windows systems has ( where pc is the two-character
product or component code) KpcCMA.RAS files in the c:\windows\system32 directory
to collect information about monitoring process failures.
Note:
1. If instance_name:host_name.domain_name is greater than 32 characters, the
characters beyond that number are truncated from the domain name.
2. Messages related to the self-describing agent show the status code with no
description if the message was from a later version of IBM Tivoli Monitoring
than the agent version. For example, you run the tacmd addsdainstalloptions
command on your V6.2.3 monitoring agent but it does not succeed and you
review the LG0 file for that agent. The log file has an entry with SDA status
code 1024, which was introduced in V6.3.0. What shows as the message
description, 0x400, should be SDA Install Blocked.
Self-Describing Agent Register/Install failed with STATUS (1024/0x400)
for PRODUCT "NT", with TEMS "TVT6048:CMS", VERSION_INFO "product_vrmf=06230100;
tms_package_vrmf=06230100;tps_package_vrmf=06230100;tpw_package_vrmf=06230100;".
Note: The version number of the Tivoli Enterprise Portal and theTivoli
Enterprise Portal Server must always be synchronized.
Note: Use the trace settings indicated in these troubleshooting instructions only
while you are trying to diagnose a specific problem. To avoid generating excessive
trace data, go back to the default trace settings as soon as the problem is solved.
The following resources are available to determine the software that is supported:
v For specific information about the supported software for IBM Tivoli
Monitoring, see “Hardware and software requirements” in the IBM Tivoli
Monitoring Installation and Setup Guide
v For platform and database support information for most Tivoli products, consult
the matrix at Tivoli Supported Platforms (http://www-306.ibm.com/software/
sysmgmt/products/support/Tivoli_Supported_Platforms.html)
For more information on workspaces that relate to historical data, see “Historical
data is missing or incorrect” on page 22.
To diagnose the problem that workspaces are missing or empty, see “Diagnosing
that workspaces are missing or empty.”
You can also check that application support has been added.
To diagnose that workspaces are missing or empty, perform the following steps:
Procedure
Preliminary diagnostics
1. Refresh the Navigator by clicking View > Refresh.
2. Verify that the monitoring agent has been started. Restart if necessary. In the
Tivoli Enterprise Portal, right-click the Navigator item of the monitoring agent
and click Start or Restart
3. Verify that the monitoring agent configuration is correct.
4. If your data is missing in an Oracle Agent workspace, see “Resolving Oracle
DB Agent problems - diagnostic actions” on page 34. Similar problems might
exist for other monitoring agents.
5. Check that application support has been added. See “Resolving application
support problems” on page 13.
For more information on actions that relate to these diagnostics, see the problem
resolution tasks.
Complete one or both of the following tasks to ensure that this is an application
support problem:
v “Diagnosing that workspaces are missing or empty” on page 12
v “Diagnosing that a situation does not raise when expected” on page 27
Procedure
What to do next
Procedure
Procedure
Procedure
Preliminary diagnostics
1. Verify the state of the monitoring agent in Manage Tivoli Monitoring Services.
2. Compare the status of the node in the physical Navigator view with the status
reported in the Managed System Status workspace. If the status in the physical
Navigator view agrees with the status shown in the Managed System Status
workspace, then the problem is at the monitoring agent. See “Resolving
monitoring agent problems” on page 16.
3. To determine whether the problem is in the portal server or monitoring server,
set the following trace in the portal server: ERROR (UNIT:ctcmw IN ER)./
4. Then examine the portal server log for the following statement: Node Status
event received (managed system name).
v If the trace shows that the last node status record received for the managed
system matches the status shown in the portal client, then the problem is
located in the monitoring server. See
v If the trace shows that the last node status record received for the managed
system indicated the correct status, then the problem is located in the portal
server. Run the portal server trace, collect logs, and call IBMSoftware
Support.
For more information on actions that relate to these diagnostics, see the problem
resolution tasks.
Procedure
Procedure
To diagnose that the portal server does not start or stops responding, perform the
following steps:
Procedure
Preliminary diagnostics
1. For more information about any messages received, see the IBM Tivoli
Monitoring Messages (http://pic.dhe.ibm.com/infocenter/tivihelp/v61r1/topic/
com.ibm.itm.doc_6.3/messages/itm_messages.htm) reference guide. Operator
responses and general information are provided for each message.
2. Allow the portal client enough time to establish a connection with the portal
server.
3. Is DB2® running?
What to do next
For more information on actions that relate to these diagnostics, see the problem
resolution tasks.
To resolve database problems, you perform diagnostic and corrective actions. These
actions include reconfiguring the portal server.
Procedure
To resolve database problems, you perform diagnostic and corrective actions. These
actions include reconfiguring the portal server and ensuring that your portal client
user ID is the same as the logon user ID of your system.
Procedure
To resolve database problems, you perform diagnostic and corrective actions. These
actions include ensuring that the user ID and password are correct.
The logon failure might cause one or more of the following messages to display:
v KFWITM392E: Internal error occurred during logon.
v KFWITM009I: The Tivoli Enterprise Portal Server is still being
initialized and is not ready for communications.
v KFWITM010I: Tivoli Enterprise Portal Server not ready.
v KFWITM395E: User ID has been locked or disabled.
v KFWITM396E: User ID has been locked or disabled by Tivoli Enterprise
Portal Server.
To diagnose that the portal server logon Tivoli Enterprise Portal Logon fails,
perform the following steps:
Procedure
Preliminary diagnostics
1. For a guide to the messages and operator responses, refer to IBM Tivoli
Monitoring Messages (http://pic.dhe.ibm.com/infocenter/tivihelp/v61r1/topic/
com.ibm.itm.doc_6.3/messages/itm_messages.htm).
2. Look in the portal server or portal client logs for more information concerning
the message.
To diagnose that the Tivoli Enterprise Portal does not respond, perform the
following steps:
Procedure
Preliminary diagnostics
1. Verify that the monitoring server is started.
What to do next
For more information on actions that relate to these diagnostics, see the problem
resolution tasks.
Procedure
Reconfigure the Java Control Panel. See “Performance tuning > Tivoli Enterprise
Portal client” in the IBM Tivoli Monitoring Installation and Setup Guide for more
information.
Procedure
Procedure
Preliminary diagnostics
1. To verify component connectivity through the Self-Monitoring Topology
workspace, perform the following steps:
a. In the Navigator Physical view, click the Enterprise item.
b. Select Workspace > Self-Monitoring Topology.
Alternatively, review the Tivoli Data Warehouse workspaces for the Warehouse
Proxy agent and Summarization and Pruning agent.
2. Verify the historical data collection configuration in the portal client or by
issuing the following tacmd commands:
v tacmd histlistproduct
v tacmd histlistattributegroups
v tacmd histviewattributegroup
v tacmd histConfigureGroups
v tacmd histViewAttributeGroup
v tacmd histUnconfigureGroups
v tacmd histStartCollection
v tacmd histStopCollection
What to do next
For more information on actions that relate to these diagnostics, see the problem
resolution tasks. See also the troubleshooting topics in the IBM Tivoli Warehouse
Proxy Agent User's Guide (http://pic.dhe.ibm.com/infocenter/tivihelp/v61r1/
topic/com.ibm.itm.doc_6.3/wpa/wpagent_user.htm) and IBM Tivoli Warehouse
Summarization and Pruning Agent User’s Guide (http://pic.dhe.ibm.com/infocenter/
tivihelp/v61r1/topic/com.ibm.itm.doc_6.3/spa/spagent_user.htm).
The causes of Warehouse Proxy agent problems might be because the short-term
historical data is being stored at the monitoring agent or the Tivoli Enterprise
Monitoring Server and should be switched, or because the Warehouse Proxy agent
cannot connect to the data warehouse or to the monitoring server.
Procedure
To resolve warehouse proxy agent problems, you perform diagnostic and corrective
actions. These actions include verifying that the warehouse database password and
user ID are correct. Also, you can update the configuration parameters.
Procedure
To resolve persistent data store for z/OS problems, you perform diagnostic and
corrective actions. These actions include verifying that the data store is configured
properly.
Procedure
Example
For more information about the persistent data store, see the IBM Tivoli
OMEGAMON XE and Tivoli Management Services on z/OS: Common Planning and
Configuration Guide (http://pic.dhe.ibm.com/infocenter/tivihelp/v61r1/topic/
com.ibm.omegamon_share.doc_6.3/zcommonconfig/zcommonconfig.htm).
To diagnose that a situation does not raise when expected, perform the following
steps:
Procedure
Preliminary diagnostics
1. Verify that the monitoring agent has started.
2. Verify that the situation is associated with a Navigator item in the Tivoli
Enterprise Portal.
3. In the situation event console, confirm that the situation is true and an event
has opened.
What to do next
For more information on actions that relate to these diagnostics, see the following
tasks:
Related tasks:
“Resolving situation-specific problems”
Procedure
Procedure
Preliminary diagnostics
If the situation does not raise, see “Diagnosing that a situation does not raise when
expected” on page 27.
What to do next
For more information on actions that relate to these diagnostics, see the problem
resolution task.
Procedure
To diagnose that CPU usage is high on a distributed system, perform the following
steps:
Preliminary diagnostics
1. Determine whether an IBM Tivoli Monitoring component is the root cause.
Another application or process running on the system might be causing high
CPU usage.
2. Use the tools and data provided by Task Manager to identify the process
causing high CPU usage. In the Processes tab you can reorder the processes by
CPU usage. An example of a process name is kntcma.exe for the Windows OS
agent.
3. Use the top command to display processes using high CPU. For
UNIX, you can also use the ps auxww command.
4. Verify the following:
v Is historical data collection enabled?
v Is the database undergoing a backup?
5. Is the situation writing a lot of event logs?
v Yes: Disable all event log monitoring situations.
6. Select each of the workspaces in turn, to see which one is consuming high
CPU.
7. Running the following ITMSUPER tools might also provide more information:
v Stressed Resources tool
v Connectivity tool
v Situations tool
8. When the computer (where the monitoring agent is running) has multiple
Network Interface Cards (NICs), the agent might not be bound to the Primary
NIC. The agent might not be able to establish connectivity with the monitoring
server. High CPU usage can result from the agent's frequent attempts to
connect.
a. To correct this, you might need to set the environment variable
KDEB_INTERFACELIST = '!*' or KDEB_INTERFACELIST = IP_address, where
IP_address is the address of the NIC.
b. Make the changes in the associated agent *ENV configuration file for
Windows, or the *.ini configuration file for UNIX or Linux.
What to do next
For more information on actions that relate to these diagnostics, see the problem
resolution tasks.
Procedure
Diagnostic actions
1. Run the Situation Test ITMSUPER tool.
2. Find out which situations have been deployed to the monitoring agent.
3. Open the agent-specific.lg0 file to view a list of situations started for that
agent.
Procedure
Corrective actions
1. Change the formulation of any situations that are causing excessive processing.
2. For situations with wildcard * characters, perform one of the following steps:
v Rewrite the situation using the SCAN strcscan function instead of the
character-by-character pattern-matching function LIKE. For example,
situations with this simple LIKE"*/process" pattern can be rewritten as SCAN
"/process".
v Rewrite the situation using non-UTF8 columns. For example, *IF *LIKE
NT_System.User_Name_U *EQ ’*group’ can be rewritten as *IF *LIKE
NT_System.User_Name *EQ ’*group’ where User_Name is a non-UTF8 column
and User_Name_U is the corresponding UTF8 column.
v Rewrite the situation combining predicates with an OR. For example, *IF
*LIKE NT_System.User_Name_U *EQ ’group*’ can be rewritten as *IF ( (
*VALUE NT_System.User_Name_U *EQ ’groupA’ ) *OR ( *VALUE
NT_System.User_Name_U *EQ ’groupB’ ) *OR ( *VALUE
NT_System.User_Name_U *EQ ’groupC’ ) ).
Procedure
Diagnostic actions
1. Check connectivity between the monitoring agent and the monitoring server.
2. Use the ping command to verify whether communication exists between the
monitoring server and agents. Ping from the monitoring agent system to the
monitoring server, and then from the monitoring server system to the
monitoring agent.
v Use the IP address of the host name specified during agent configuration.
v If the communication is broken and you see high CPU, proceed to the
corrective actions.
3. Turn on the RAS1 trace log to verify whether the monitoring agent has made a
connection to the monitoring server. See “Setting traces” on page 46 for more
information.
Procedure
1. If you still have high CPU usage issues even after ensuring proper connectivity
across firewalls, open a problem report with IBM Software Support or refer to
the IBM Support Portal (http://www.ibm.com/support/entry/portal/
software).
2. For more information, see the “Firewalls” topics in the IBM Tivoli Monitoring
Installation and Setup Guide.
Procedure
Diagnostic actions
1. Collect the detail traces of collector and RAS1 log. See the problem
determination topics for enabling detailed tracing in the collector trace log and
setting RAS trace parameters in the IBM Tivoli Monitoring for Databases: Oracle
Agent User's Guide.
2. Identify the SQL query that caused the high CPU usage issues from the
collector logs.
3. You can identify the SQL query that caused the high CPU usage issue from
ITM Oracle Agent logs or the Oracle tools. Use the following procedure to
identify the problematic cursors from ITM Oracle Agent logs:
a. Open the collector logs and find CFE1645 messages. The messages show the
return time of each cursors. For example: CFE1645T (165929) Time =
2008/06/06 16:59:29, collected records in 6 seconds.
b. The default timeout value of ITM Oracle Agent is 45 seconds. If it takes
more than 45 seconds, it might cause a timeout problem and Open Probe
pipe error will be reported in the collector log. CFE1645T (170246) Time =
2008/06/06 17:02:46, collected records in 203 seconds
c. When a timeout happens, review the previous cursor that executed before
this message. For example:
PDR3000T (170002) Deleting (1) rows for cursor DB6
RPF0300T (170002) Doing prep_l_fet for cursor DB6
ORU0085I (170002) --------------------------------------------------
ORU0090I (170002) Starting new SQL query.
ORU0095I (170002) <SELECT /*+RULE*/ COUNT(*) EXTENTS FROM SYS.DBA_EXTENTS >
ORU0085I (170002) --------------------------------------------------
CAT1610I (170213) Dump of row 1
UPX0100T 000: 20202020 20202020 20202032 34313135 * 24115*
4. The previous cursor (DB6) took about 2 minutes and 11 seconds to return data
causing the performance problem.
5. Were you able to identify an SQL query?
v Yes: Continue to the corrective actions task.
Corrective actions
1. Disable the problematic cursors by setting an environment variable:
v COLL_DISABLE_CURSORS
a. Launch Manage Tivoli Enterprise Monitoring Services.
b. Right-click the row that contains the name of the monitoring agent whose
environment variables you want to set.
c. From the pop-up menu, select Advanced > Edit Variables.
d. If the agent is running, accept the prompt to stop the agent.
e. The list dialog box is displayed. When only the default settings are in
effect, variables are not displayed. The variables are listed after you
override them. Override the variable settings:
1) Click Add.
2) From the Variable menu, select COLL_DISABLE_CURSORS. If the variable
is not there, you can add it.
3) In the Value field, type a value and click OK twice.
4) Restart the agent.
v db_extparms
a. Use a text editor to enter a new value for the db_extparms in the
hostname_or_instance_name.cfg file in the install_dir/config directory.
b. The cursors that are listed below take longer to return data and consume
excessive system resources in some customer environments: DB3, DB6,
KF1, KF4, STATLTRN, TS1, TS3, TS5, , and TS6.
c. Each comma-delimited, no white space, value represents a change to the
SQL cursor that is executed during data gathering operations within the
agent. The values are the SQL cursor name. For example, setting the
Extended Parameters field to DB3, TS1 means that the DB3 and TS1 SQL
cursor is enabled for Set FREEBYTES to zero, Set TSNEXTS to zero, and
Set MAXEXTTS to zero. The SQL cursor name is not case sensitive.
2. Recycle the Monitoring Agent for Oracle to recognize these changes to the
Extended Parameters value.
3. Using the name of the SQL cursor, you can look in the korcoll.ctl file for the
SQL modification that is done when the SQL cursor is enabled. The
korcoll.ctl file is located in the following locations:
v install_dir\TMAITM6
v $CANDLEHOME/misc
When these cursors are enabled, the Monitoring Agent for Oracle displays
default attribute values of these cursors in the Tivoli Enterprise Portal,
meaning, the Monitoring Agent for Oracle no longer monitors the attributes of
the enabled cursors.
4. An example of an SQL cursor is displayed below:
SQL cursor: DB3 - ARCHIVE LOG DISPLAY
SQL:
SELECT TABLESPACE_NAME UTSNAME,
SUM(BYTES) FREEBYTES
FROM SYS.DBA_FREE_SPACE
GROUP BY TABLESPACE_NAME;
Trace logging
Trace logs capture information about the operating environment when component
software fails to operate as intended. IBM Software Support uses the information
captured by trace logs to trace a problem to its source or to determine why an
error occurred.
Trace logs are commonly referred to as RAS1 logs because RAS1 is the name of the
IBM Tivoli Monitoring component that manages trace logging. Furthermore, the
two tracing-related environment variables are KBB_RAS1, which sets a product's
tracing level, and KBB_RAS1_LOG, which assigns the name, size, count, and directory
location of RAS1 log files.
By default, RAS1 logs on distributed systems are stored in the /logs directory
under the installation path for IBM Tivoli Monitoring. On z/OS, the RAS1 log is
stored as a SYSOUT file associated with the RKLVLOG DDNAME.
When an IBM Tivoli Monitoring product is installed and configured, RAS1 tracing
for the product is configured by default to specify ERROR-level logging, which
means only the most important run-time messages are traced. There are different
methods for customizing the logging level, along with the size, count, and location
of log files.
RAS1 log files are maintained with the following naming, as determined by the
KBB_RAS1_LOG environment variable in the product's configuration file:
v RAS1 logs are stored in the \logs directory under the installation path for
IBM Tivoli Monitoring. The following is an example of a log file name that
includes the local system name, the two-character product code of the Tivoli
Enterprise Portal Server, the time stamp in hexadecimal format when the process
started, and the log sequence number:
ibm-kpmn803v01_cq_472649ef-02.log
v On UNIX-based systems, RAS1 logs are stored in the /logs directory
under the installation path for IBM Tivoli Monitoring. The following example of
a log file name includes the local system name, the two-character product code
of the UNIX OS Monitoring Agent, the agent's “stat_daemon” child process
name, the time stamp in hexadecimal format when the process started, and the
log sequence number:
f50pa2b_ux_stat_daemon_49ac1eee-01.log
Note: When you communicate with IBM Software Support, you must capture and
send the RAS1 log that matches any problem occurrence that you report.
Here are the location of the trace log files that are associated with the use of the
following components. The default paths for install_dir are C:\Program Files\IBM
on Windows and /opt/ibm/ on Linux of UNIX.
Tivoli Enterprise Portal Server
install_dir\logs
install_dir/logs/ hostname_CQ_timestamp.log
where:
install_dir specifies the directory where the portal server was installed.
hostname specifies the name of the system hosting the product.
CQ is the component code for the portal server.
timestamp is a decimal representation of the time when the process was
started.
Tivoli Enterprise Portal browser client and Java Web Start
Location of trace log files that are associated with the use of the Tivoli
Enterprise Portal when the client is deployed within a browser or as a Java
Web Start application:
The log location depends somewhat on the version of Windows
being used. Here are the two most common locations:
%USERPROFILE%\AppData\LocalLow\IBM\Java\Deployment\log
%USERPROFILE%\Application Data\IBM\Java\Deployment\log
${user.home}/.java/deployment/log
For the browser client, the file name is pluginnnnnn.trace. For Java Web
Start, the file name is javawsnnnnn.trace.
where nnnnn is a unique, randomly generated numeric suffix to support
generational logs.
Tivoli Enterprise Portal desktop client
install_dir\cnp\logs\kcjerror_n.log
install_dir\cnp\logs\kcjras1.log
install_dir\cnp\kcj.log
install_dir/logs/hostname_CJ_timestamp.log
where:
install_dir specifies the directory where the portal client was installed.
_n represents the circular sequence in which logs are rotated. The logs
range from no _n for the current log file, to 1 through 9 for the previous
logs. A new kcjerror.log file is generated each time the desktop client is
started, at which time the previous log is renamed kcjerror_1.log. If there
was a kcjerror_1.log, that one gets renamed to kcjerror_2.log, and so on
until 9 is reached and the logs start over with kcjerror_1.log.
hostname specifies the name of the system hosting the product.
CJ is the component code for the portal client.
timestamp is a decimal representation of the time when the process was
started.
Chapter 4. Tools 39
You can configure for multiple named instances of the desktop client. This
is typically done, for example, when you want to have multiple desktop
client instances connecting to different portal server environments. When
you use the “Create instance” action from either the Windows Manage
Tivoli Enterprise Monitoring Services utility or the Linux CandleManage
panel associated with the desktop client, you are prompted to provide a
name for the new instance. The default instance has no name. All the log
file names for the desktop client include the instance name, so the file
naming conventions for the 3 logs are as follows: kcjinstance_name.log,
kcjerrorinstance_name.log, and kcjras1instance_name.log. For Linux, the
kcjinstance_name.log file actually uses the standard Linux log filename
convention of hostname_CJ_timestamp.log.
Tivoli Enterprise Monitoring Server
install_dir\logs\hostnameMS_ HEXtimestamp-nn.log
install_dir/logs/hostname_MS_timestamp.log
where:
install_dir specifies the directory where the monitoring server was
installed.
hostname specifies the name of the system hosting the monitoring server.
MS is the component code for the monitoring server.
HEXtimestamp is a hexadecimal representation of the time when the process
was started.
nn represents the circular sequence in which logs are rotated. The logs
range from 1 to 5, by default, although the first is always retained because
it includes configuration parameters.
Dashboard Application Services Hub
C:\Program Files\IBM\JazzSM\profile\logs\server1\SystemOut.log
/opt/ibm/JazzSM/profile/logs/server1/SystemOut.log
tivcmd Command-Line Interface for Authorization Policy
C:\IBM\TivoliMonitoring\logs\kdqras1_51229cb8-01.log
/opt/IBM/TivoliMonitoring/logs/kdqras1_51229cb8-01.log
Audit logs for the Authorization Policy Server
C:\Program Files\IBM\JazzSM\AuthPolicyServer\PolicyServer\
audit\hostname_2013.02.18_16.04.38.458.w7_audit.log
/opt/IBM/JazzSM/AuthPolicyServer/PolicyServer/audit/
hostname_2013.02.18_16.04.38.458.w7_audit.log
where:
hostname is the name of the computer where the Authorization Policy
Server is installed.
2013.02.18_16.04.38.458.w7 is the log time stamp.
Automation Server
install_dir\logs\kasmain.msg
install_dir/logs/hostname_AS_HEXtimestamp-n.log
where:
Chapter 4. Tools 41
HEXtimestamp is a hexadecimal representation of the time when the process
was started.
nn represents the circular sequence in which logs are rotated. The logs
range from 1 to 5, by default, although the first is always retained because
it includes configuration parameters.
The following table lists and describes the log files created when installing when
installing a Tivoli Enterprise Monitoring Server, Tivoli Enterprise Portal Server,
Tivoli Enterprise Portal client, and Tivoli Enterprise Monitoring Agent:
Table 1. Installation log files
Windows UNIX-based systems
v ITM_HOME\InstallITM\Abort<Product_name><date_timestamp>.log $CANDLEHOME/logs/candle_
This log is created if an abort occurs for either a first time installation.log
installation or a modification of previous installation of IBM Tivoli
Monitoring.
v
ITM_HOME\InstallITM\<Product_name>_<timestamp>.log
This log is created during a normal clean installation.
v
ITM_HOME\InstallITM\MOD_<Product_name>timestamp.log
This log is created if you modify an existing product specified with
the PC, or when adding or deleting components.
where:
Product_name
Specifies the product name. IBM Tivoli Monitoring 20050923
1815.log is the log file name for the IBM Tivoli Monitoring
installation CD.
timestamp
A decimal representation of the time at which the process was
started.
You can find a log for uninstallation on Windows in the root directory where the
product was installed:
Uninstall<PC><date_timestamp>.log
You can set the degree of logging and tracing to one of three levels:
v DEBUG_MIN
v DEBUG_MID
v DEBUG_MAX
You can set the level of logging and tracing by using the /z flag when you execute
the setup.exe file in the CLI.
v For GUI installation use one of the following commands:
– setup.exe /zDEBUG_MAX
– setup.exe /zDEBUG_MID
– setup.exe /zDEBUG_MIN
v For silent installation use one of the following commands:
– start /wait setup /z"DEBUG_MAX/sfC:\temp\SILENT_SERVER.txt" /s
/f2"C:\temp\silent_setup.log"
– start /wait setup /z"DEBUG_MID/sfC:\temp\SILENT_SERVER.txt" /s
/f2"C:\temp\silent_setup.log"
– start /wait setup /z"DEBUG_MIN/sfC:\temp\SILENT_SERVER.txt" /s
/f2"C:\temp\silent_setup.log"
For tracing and logging Java code (that is run on UNIX systems), this mechanism
enables problem debugging. Two sets of information are created – logs and traces.
Logs (*.log) are globalized and traces (*.trc) are in English. They contain entry and
exit parameters of method and stack traces for exceptions. The amount of
information traced depends on the level of tracing set.
Chapter 4. Tools 43
v CH/config/ITMConfigRAS.properties (for configuration)
Callpoints are the only component that is handled differently, their logs and traces
always go to the directory CH/InstallITM/plugin/executionEvents. The default
location for installation is CH/logs/itm_install.log(.trc) and for configuration it
is CH/logs/itm_config.log(.trc).
To gather all the needed logs and environment information in case of an error, use
the pdcollect tool. See “pdcollect tool” on page 68.
Upgrade actions taken outside of the Upgrade Toolkit are not recorded in the log.
Table 2. Upgrading from Tivoli Distributed MonitoringTivoli log file
Windows UNIX-based systems
$DBDIR/AMX/logs/log_tool_ $DBDIR/AMX/logs/log_tool_
timestamp.log timestamp.log
where:
$DBDIR
The Tivoli Management Environment Framework environment variable
that specifies the directory where the Object Repository (odb.bdb) is
located.
tool Specifies the IBM Tivoli Monitoring Upgrade Toolkit tool: witmscantmr,
witmassess, or witmupgrade.
timestamp
Specifies a time stamp that includes data and time of execution.
The log file name displays when the Upgrade Toolkit tool completes the upgrade
operation. Each time a Upgrade Toolkit tool runs, its generates a new log file that
is never reused by any tool. The contents of the log file conform to the Tivoli
Message Standard XML logging format. The following example is an excerpt from
an Upgrade Toolkit tool log file:
Even with default ERROR-level logging, you can find information in RAS1 logs
about product configuration, security settings, network interfaces, listening ports,
key milestones during startup and shutdown, run-time errors, user logins,
commands issued, etc.
At the top of a product's first RAS1 log, with -01 in the log file name, there is a
banner section containing details about the general operating environment. Here is
a sample banner section from a Windows OS agent log:
!4F68BA8C.0000!=================> IBM Tivoli RAS1 Service Log <===================
+4F68BA8C.0000 System Name: ITMSYSZ Process ID: 7132
+4F68BA8C.0000 Program Name: kntcma User Name: SYSTEM
+4F68BA8C.0000 Task Name: kntcma System Type: WinXP;5.1-SP3
+4F68BA8C.0000 MAC1_ENV Macro: 0xC112 Start Date: 2012/03/20
+4F68BA8C.0000 Start Time: 10:12:44 CPU Count: 1
+4F68BA8C.0000 Page Size: 4K Phys Memory: 2039M
+4F68BA8C.0000 Virt Memory: 2048M Page Space: 3935M
+4F68BA8C.0000 Service Point: system.itmsysz_nt UTC Start Time: 4f68ba8c
+4F68BA8C.0000 ITM Home: C:\IBM\ITM ITM Process: itmsysz_nt
+4F68BA8C.0000 Executable Name: C:\IBM\ITM\TMAITM6\kntcma.exe
+4F68BA8C.0000 KBB_RAS1: ERROR
+4F68BA8C.0000 KBB_RAS1_LOG: C:\IBM\ITM\TMAITM6\logs\ITMSYSZ_NT_4f68ba8c-.log
INVENTORY=C:\IBM\ITM\TMAITM6\logs\ITMSYSZ_nt_kntcma.inv
COUNT=05 LIMIT=8 PRESERVE=1 MAXFILES=10
+4F68BA8C.0000 KBB_ENVPATH: C:\IBM\ITM\TMAITM6\KNTENV
+4F68BA8C.0000 =====================================================================
As you can see, the values of the two RAS1 environment variables, KBB_RAS1 and
KBB_RAS1_LOG, are echoed in the banner section. You can find the Process ID of
this invocation of the product, as well as statistics about available memory and
CPUs. On UNIX-based systems, you also see Stack Limit, Core Limit, and the
maximum number of file descriptors in the NoFile Limit field. The UTC Start
Time is the hexadecimal representation of the product's start time, which gets
included in the log file name.
After the banner section in the first RAS1 log, are messages concerning major
events during product startup. Although RAS1 logs are often analyzed for
Chapter 4. Tools 45
evidence of problems and errors, the logs also contain key messages indicating the
health of a product, such as its initialization status. For example, here is an excerpt
from a Tivoli Enterprise Portal Server's first RAS1 log:
("CTServer::startServerDll") KFW1002I Starting Service: ’CEV v1.0’
("JVMProxyServer::start") id of object to deploy is ’CTCEV’.
("BSS1_GetEnv") KFW_CEV_TEST_CONFIG="N"
("CTServer::startServerDll") KFW1003I Started Service: ’CEV v1.0’
("CTServer::startServerDll") KFW1002I Starting Service: ’MCSAttribute v1.0’
("JVMProxyServer::start") id of object to deploy is ’CTMCSAttribute’.
("BSS1_GetEnv") KFW_MCS_XML_FILES="c:\ibm\itm\cnps\teclib"
("CTServer::startServerDll") KFW1003I Started Service: ’MCSAttribute v1.0’
("CTServer::startServerDll") KFW1002I Starting Service: ’Startup Complete v2.0’
("CTServer::startServerDll") KFW1003I Started Service: ’Startup Complete v2.0’
("CTServer::runORB") KFW1020I ****** Waiting for requests. Startup complete ******
("BSS1_GetEnv") TEPS_SDA="Y"
The “Waiting for requests...”" trace message signifies that the portal server is ready
to accept users logging in, which is an important milestone in the portal server
startup. RAS1 logs also echo the value of each product environment variable. As
shown in the excerpt, the Self-Describing Agent (SDA) feature is enabled in the
portal server.
Setting traces
When you encounter an error with IBM Tivoli Monitoring that requires contacting
IBM Software Support, you might be asked to submit a copy of the RAS1 log for
the product encountering the error.
The RAS1 log is an essential part of the trace diagnostic tools in Tivoli Monitoring.
By default, the RAS1 tool is set to only log errors and other critical messages.
However, you can configure RAS1 to log more detailed product information, as
instructed by IBM Software Support. If you are modifying a product's
configuration file to change RAS1 logging levels, be sure to first back up the
configuration file.
RAS1 syntax
Follow the RAS1 syntax for setting traces in your environment file.
KBB_RAS1= global_class (COMP: component_type) (ENTRY: entry_point)
(UNIT: unit_name, class)
where:
global_class
Indicates the level of tracing that you want. This is a global setting that
applies to all RAS1 filters in the process. If you set this global class by
itself, it is global in scope and the trace cannot filter on any of the other
keywords. Separate combined classes with a space. The following values
are possible. Valid abbreviations are in parentheses.
ERROR (ER):
returns severe error messages only (this is the default for most
applications).
STATE (ST):
records the condition or current setting of flags and variables in the
process. If state tracing is enabled, you can see the current state of
particular variables or flags as the process is running.
Usage notes
Chapter 4. Tools 47
restrictive filters (ERROR or STATE) or fewer UNIT keywords. The i5 OS
monitoring agent process is particularly sensitive to overload (see the workaround
in “Setting the trace option for the i5/OS agent” on page 54.)
As a rule, syntax checking of RAS1 tracing filters is lenient, for the following
reasons:
v The RAS1 grammar is flexible, which makes it difficult to determine what the
coder really intended and whether a coding error has occurred.
v A Tivoli Monitoring process should not fail to start because of a RAS1 syntax
problem, thus every effort is made to revert to defaults to allow the process to
keep running with a valid RAS1 setting in effect, even if KBB_RAS1 was
specified incorrectly.
Because of the lenient syntax checking, it is not always obvious if there's a
typographical error or other mistake in a RAS1 filter. This issue applies whether
KBB_RAS1 was configured in an environment file before process startup, or if it was
dynamically modified using one of the methods, such as tacmd settrace, that are
described in other Tools topics (see also “Dynamically modify trace settings for a
Tivoli Monitoring component” on page 61). In the unit_name example above, if you
inadvertently specified (UNIT:kfasdALL) without a blank between “kfasd” and
“ALL”, the RAS1 syntax checker interprets it to mean that the filter applies to
compilation units whose names begin with “kfasdALL”. Because no class value is
present in (UNIT:xyzALL), a default class of NONE is used for those compilation
units. When KBB_RAS1 changes are implemented, it is important to carefully check
what was specified to ensure it matches what was intended.
This log file contains all of the RAS1 tracing for the portal client. Whenever you
start a new work session, the log file is purged and rewritten for the current work
session. If you want to preserve the log file from the last work session, you must
rename it or copy it to another directory before starting the portal client again. The
kcj.log file contains errors generated by the Java™ libraries used in the portal
client.
Procedure
1. Always backup the files before altering them.
2. From the Tivoli Enterprise Portal menu, select File > Trace Options.
3. Select a trace class from the list or as instructed by IBM Software Support (such
as UNIT:Workspace ALL):
v ALL provides data for all classes. Use the setting temporarily, because it
generates large amounts of data.
v ERROR logs internal error conditions. This setting provides the minimum
level of tracing, with little resource overhead, and ensures that program
failures will be caught and detailed.
v NONE turns off the error log so no data is collected.
4. Click OK to close the window and turn on logging.
Procedure
v On the computer where the portal server is installed, click Start >
Programs > IBM Tivoli Monitoring > Manage Tivoli Enterprise Monitoring
Services.
1. Right-click the Tivoli Enterprise Portal Server service.
2. Select Advanced > Edit Trace Parms to display the Trace Parameters
window.
3. Select the RAS1 filters. The default setting is ERROR.
4. Accept the defaults for the rest of the fields and click OK.
v Set the following variable in the install_dir/config/cq.ini where
filter is the component you want to trace and trace_level is the level of
tracing you want.
KBB_RAS1=ERROR (UNIT:filter trace_level)
What to do next
Procedure
v Windows:
1. On the computer where the Tivoli Enterprise Monitoring Server is installed,
select Start > Programs > IBM Tivoli Monitoring > Manage Tivoli
Enterprise Monitoring Services.
2. Right-click the Tivoli Enterprise Monitoring Server service and select
Advanced > Edit Trace Parms to display the Trace Parameters window.
3. Select the RAS1 filters, adding a space between each UNIT trace setting, such
as ERROR (UNIT:kdy all) (UNIT:kfaprpst all). RAS1 is the unit trace for the
monitoring server. The default setting is ERROR.
4. Accept the defaults for the rest of the fields.
5. Click OK to set the new trace options.
6. Click Yes to recycle the service.
v Linux and UNIX:
1. Change to the install_dir/config directory.
2. Open the ms.ini file in a text editor.
3. Set the following variable, adding a space between each UNIT trace setting,
such as KBB_RAS1=ERROR (UNIT:KDY ALL) (UNIT:KFAPRPST ALL):
KBB_RAS1=ERROR (UNIT:filter trace_level)
Chapter 4. Tools 49
where filter is the component to trace and trace_level is the level of
tracing. The following example traces everything in the Deploy component:
KBB_RAS1=ERROR (UNIT:KDY ALL)
4. To trace the monitoring server command line interface, set the following
variable in install_dir/bin/tacmd.
5. Regenerate the hostname_ms_tems_name.config file by running the ./itmcmd
config -S [ -h install_dir ] [ -a arch ] -t tems_name command. (For
more information in the itmcmd commands, see the IBM Tivoli Monitoring
Command Reference (http://pic.dhe.ibm.com/infocenter/tivihelp/v61r1/
topic/com.ibm.itm.doc_6.3/cmdref/itm_cmdref.htm).)
6. Recycle the monitoring server. The command syntax for starting and
stopping the monitoring server is ./itmcmd server [ -h install_dir ] [-l]
[-n] start|stop tems_name.
What to do next
For information on how to set trace levels dynamically, see “Dynamically modify
trace settings for a Tivoli Monitoring component” on page 61.
Due to the performance impact of enabling diagnostic tracing, do not enable RAS1
or KAS_DEBUG tracing unless instructed to do so by IBM Support.
KAS_DEBUG
The KAS_DEBUG facility writes its trace messages directly to the existing
automation server RAS1 log file. The following list shows the increasing
granularity of trace message levels that are supported for KAS_DEBUG.
I - for Inhibit (NONE)
N - for Normal (default value for Error tracing)
P - for Performance
Y - for Yes (the same as S)
S - for State
V - for Verbose
T - for Trace (the same as internal FLOW tracing)
D - for Detail
M - for Maximum
A - for ALL (turn on all available levels)
Each KAS_DEBUG trace level that you set displays messages for the specified
trace level, plus all lower trace levels. For example, setting KAS_DEBUG=P
displays debug trace messages for P (Performance) and N (Error); and
Complete these steps to set the automation server trace option in the kas
environment file:
Procedure
1. Stop the automation server.
2. Edit the environment file:
v
a. Click Start > Programs > IBM Tivoli Monitoring > Manage Tivoli
Enterprise Monitoring Services.
b. Right-click the Tivoli Enterprise Monitoring Automation Server and
select Advanced > Edit Variables.
c. Click Add and enter KAS_DEBUG for the variable name and one of the trace
levels for the value, such as Y or D (for the list of trace levels, see
KAS_DEBUG).
d. If you also want to create two log files that capture additional debug
messages, click Add and enter KAS_DEBUG_MAXLOGS for the variable name
and Y for the value.
e. Click OK to save your changes.
v
a. Change to the install_dir/config directory.
b. Open the as.ini file in a text editor, and add the following line to the
end of the file: KAS_DEBUG=N, where N is the trace level you want to
specify, such as S or V (for the list of trace levels, see KAS_DEBUG).
c. If you also want to create two log files that capture additional debug
messages, add the following line to the end of the file:
KAS_DEBUG_MAXLOGS=Y.
d. Save and close the environment file.
3. Restart the automation server for the changes to take effect.
Chapter 4. Tools 51
Results
After you restart the automation server, the trace level is changed and you can see
the following message in the log file, where ERROR is the trace level:
INFO: KAS DEBUG set to [ERROR]
What to do next
If you set a high trace level, repeat the procedure to return the level to the normal
setting after you are done with diagnostic tracing.
If you also set KAS_DEBUG_MAXLOGS, the two log files are created and continue to
grow until the server is stopped or recycled. Set the value back to
KAS_DEBUG_MAXLOGS=N and delete the resources.txt and itmevents.log files after
you are done using this variable.
Complete the following steps set the automation server trace level from your
browser:
Procedure
where
protocol is the http or https protocol used by the automation server.
host is the name of the computer where the automation server is installed.
port is the port number used by the automation server for incoming requests
(default for http is 10001).
level is the KAS_DEBUG trace level to use, such as Detail or D:
I - for Inhibit (NONE)
N - for Normal (default value for Error tracing)
P - for Performance
Y - for Yes (the same as S)
S - for State
V - for Verbose
Results
After the new trace level is accepted by the automation server, a message is
displayed in the browser confirming the trace level; an equivalent message appears
in the RAS log. Example web browser message:
INFO: KAS DEBUG set to [ERROR]
What to do next
If you set a high trace level, repeat the procedure to return the level to the normal
setting after you are done with diagnostic tracing.
On Windows systems:
1. On the computer where the Tivoli Enterprise Monitoring Server is installed,
select Start > Programs > IBM Tivoli Monitoring > Manage Tivoli Enterprise
Monitoring Services.
2. Right-click the Tivoli Enterprise Monitoring Server service.
3. Select Advanced > Edit Trace Parms > to display the Trace Parameters
window.
4. Type (UNIT:kdy all) in the Enter RAS1 Filters field.
5. Accept the defaults for the rest of the fields.
6. Click OK to set the new trace options.
7. Click Yes to recycle the service.
On Linux systems, set the following variable in $CANDLEHOME/config/lz.ini:
KBB_RAS1=ERROR(UNIT:kdy ALL)(UNIT:kdd ALL)
If the agent is configured to use SNMPv3 Encryption when emitting the SNMP
alerts, set (COMP:SNMP ALL) so that the trace setting would be the following:
ERROR (UNIT:KRA ALL) (COMP:SNMP ALL)
Chapter 4. Tools 53
Use (COMP:SNMP ALL) when you are focusing on SNMP traps. If you are
focusing on an agent communication error or crash, then use:
KBB_RAS1=(UNIT:KRA ALL) (UNIT:s_ ALL)
The (UNIT:s_ ALL) trace level includes tracing of system calls during SNMP
processing.
Configure the KBB_RAS1 variable in the i5/OS monitoring agent environment file
with one of the following settings to set RAS1 tracing:
Procedure
v KBB_RAS1=NONE
1. Set KBB_RAS1=NONE in QAUTOTMP/KMSPARM(KBBENV) for the i5/OS agent.
2. Recycle the agent.
This setting produces a limited RAS1 log and the agent ends gracefully during
shutdown.
v KBB_RAS1=ERROR
1. Set KBB_RAS1=ERROR in QAUTOTMP/KMSPARM(KBBENV) for the i5/OS agent.
2. Comment out the following two lines by adding an asterisk (*) at the
beginning of each line:
KBB_RAS1_LOG=(QAUTOTMP/KA4AGENT01 QAUTOTMP/KA4AGENT02
QAUTOTMP/KA4AGENT03)
INVENTORY=QAUTOTMP/KA4RAS.INV LIMIT=5 PRESERVE=1
Commenting out the lines removes them from processing and a spool file is
created under the QAUTOMON user with trace data, which can be discarded
unless requested by IBM Support.
3. Save the environment file and recycle the agent.
4. To verify that the agent is ending normally, use ENDOMA or select Option 3 on
GO OMA and accept the default options.
Note that it is possible for the spool file to become full based on the agent trace
configuration settings.
Then you need to create a file called kKHDconfig.sysprops.cfg under the directory
$CANDLEHOME\TMAITM6 for UNIX systems, and install_dir\TMAITM6 for
Windows systems, containing a link to the handler configuration file as shown
below:
DInstallRASConfig="ITMConfigRAS.properties"
When the Warehouse Proxy Agent configuration panel is executed, tracing appears
in the $CANDLEHOME/logs/config.trc file for UNIX systems, and
install_dir/logs/config.trc for Windows systems, as described by the handler
configuration file.
To trace the 2way translator, set the trace level to (UNIT: KDY ALL) (UNIT:
KHD_XA ALL) in the Warehouse Proxy Agent environment file for KBB_RAS1.
The Summarization and Pruning agent uses C-based RAS1 tracing, Java-based
RAS1 tracing and Java-based internal tracing. By default, Summarization and
Pruning agent trace data is written to a file in the logs subdirectory. The default
RAS1 trace level is ERROR for all Summarization and Pruning agent components
and modules.
The following trace options are available for the Summarization and Pruning
agent:
KBB_RAS1=ERROR
Trace general errors. KBB_RAS1=ERROR Affects the content of the C-based
RAS1 tracing (hostname_sy_HEXtimestamp-nn.log).
KBB_RAS1=ERROR (UNIT:ksz ALL)
Trace agent startup. Affects the content of the C-based RAS1 tracing
(hostname_sy_HEXtimestamp-nn.log).
KBB_RAS1=ERROR (COMP:com.tivoli.twh.ksy ALL)
Minimum level trace for summarization. Affects the content of the
Java-based RAS1 tracing (hostname_sy_ras1java_timestamp-nn.log).
Chapter 4. Tools 55
KBB_RAS1=ERROR (UNIT:ksy1 ALL)
Medium level trace for summarization. Affects the content of the
Java-based internal tracing (hostname_sy_java_timestamp-n.log)
KBB_RAS1=ERROR (UNIT:ksy2 ALL)
Connection level trace for summarization. Affects the content of the
Java-based internal tracing (hostname_sy_java_timestamp-n.log)
KBB_RAS1=ERROR (UNIT:ksy3 ALL)
Statement level trace for summarization. Affects the content of the
Java-based internal tracing (hostname_sy_java_timestamp-n.log).
KBB_RAS1=ERROR (UNIT:ksy4 ALL)
ResultSet level trace for summarization. Affects the content of the
Java-based internal tracing (hostname_sy_java_timestamp-n.log).
KBB_RAS1=ERROR (UNIT:ksy5 ALL)
Column value level trace for summarization. Affects the content of the
Java-based internal tracing (hostname_sy_java_timestamp-n.log).
KBB_RAS1=ERROR (UNIT:ksysql ALL)
Traces every SQL statement being executed. Affects the content of the
Java-based internal tracing (hostname_sy_java_timestamp-n.log).
KBB_RAS1=ERROR (UNIT:ksysql1 ALL)
Same as (UNIT:ksysql ALL) but also includes all the parameter values used
in the parameterized statements.
Note:
1. The following settings: (UNIT:ksy3 ALL) or (UNIT:ksy4 ALL) or (UNIT:ksy5
ALL) produce a high volume of trace output.
2. By default, the Java-based internal trace (hostname_sy_java_timestamp-n.log)
wraps at 5 files, and each file contains 300,000 lines. To change the defaults, use
the following settings in the KSYENV (Windows) or sy.ini (Linux or UNIX) file:
KSZ_JAVA_ARGS=-Dibm.tdw.maxNumberDetailTraceFiles=<A>
-Dibm.tdw.maxLinesForDetailTraceFile=<B>
where:
<A> Specifies the maximum number of Java-based internal trace files that
can exist at any one time for a single launch
<B> Specifies the maximum number of lines per Java-based internal trace
file.
To reduce the number and size of *_sy_java_*.log to the minimum, you can
set <A> and <B> to 1, which creates a single *_sy_java_*.log file with a
maximum file size of approximately 225 KB. (If you set <A> and <B> to 0, the
Summarization and Pruning agent stops working.)
Edit the handler configuration file to set the debug tracing level and other
parameters.
Then you need to create a file called kKSYconfig.sysprops.cfg in the (Linux and
UNIX) install_dir\TMAITM6 or (Windows) install_dir\TMAITM6 directory,
containing a link to the handler configuration file:
DInstallRASConfig="ITMConfigRAS.properties"
To trace the 2-way translator, set the trace level to (UNIT: KDY ALL) (UNIT: KHD_XA
ALL) in the Summarization and Pruning agent environment file for KBB_RAS1.
Procedure
v Manually edit the install_dir\KUIENV file with the standard KBB_RAS1
statement to include the following settings:
KBB_RAS1=ERROR(UNIT:ksh all) (UNIT:kui all)
To debug KT1 as well, edit the line to be like the following example:
KBB_RAS1=ERROR(UNIT:ksh all) (UNIT:kui all) (UNIT:kt1 all)
v Manually edit the install_dir/bin/tacmd shell script to add a line
like the following example:
KBB_RAS1=ERROR(UNIT:ksh all) (UNIT:kui all)
To debug KT1 as well, edit the line to be like the following example:
KBB_RAS1=ERROR(UNIT:ksh all) (UNIT:kui all) (UNIT:kt1 all)
Setting the trace option for the IBM Tivoli Monitoring upgrade
toolkit
Table 3. Setting the trace option for the Tivoli Monitoring upgrade toolkit
Trace option Instructions
Endpoint tracing Run the following command to set log_threshold=3 or higher on an endpoint and enable
endpoint tracing:
wep ep set_config log_threshold=3
Chapter 4. Tools 57
Table 3. Setting the trace option for the Tivoli Monitoring upgrade toolkit (continued)
Trace option Instructions
Tracing in a test A Boolean value of TRUE or FALSE default. The default is FALSE.
environment.
Run the following command from a Tivoli Management Environment command prompt to
enable tracing:idlcall oid _set_debug TRUE
where:
oid Specifies the object ID of the Upgrade Manager object.
Run the wlookup Framework command to locate the Upgrade Manager object ID in the Tivoli
Management Environment:
wlookup -a | grep Upgrade
Note: Setting the trace value to TRUE sets all Upgrade Toolkit tools to TRUE, affecting all
users running Upgrade Toolkit tools.
Use the event forwarding trace facility to diagnose problems with event
forwarding.
The event forwarding trace facility uses RAS1 tracing. Event forwarding is set
during installation. The acceptable values include:
v STATE
v DETAIL
v ALL
The default trace value is STATE. If you change the trace level, you must restart
the monitoring server for the change to take effect.
:
1. In Manage Tivoli Enterprise Monitoring Services, right-click the Tivoli
Enterprise Monitoring Server.
1. Edit install_dir/config/
hostname_ms_Tivoli_Enterprise_Monitoring_Server_ID.config
where:
install_dir
Specifies the installation directory of the monitoring server.
hostname
Specifies the host name value supplied during installation.
2. Add (UNIT:kfaot trc_class) to the line KBB_RAS1=’ERROR’
where:
trc_class
Specifies one of the following levels of trace detail:
v STATE - minimum detail.
v DETAIL - medium detail.
v ALL - maximum detail.
For example, ’KBB_RAS1=’ERROR (UNIT:kfaot STATE)’
3. Save the file.
4. Recycle the monitoring server for the trace to take effect.
5. The monitoring server log can be found in install_dir/logs/
hostname_ms_nnnnnnn.log where is a time stamp. There might be multiple files
with different time stamps in the logs directory.
Setting the trace option for the IBM Tivoli Enterprise Console
Situation Update Forwarder
If your monitoring environment is configured for the IBM Tivoli Enterprise
Console, you can forward situation events to the Tivoli Enterprise Console event
server. You can also view events on the event server through the Tivoli Enterprise
Portal. If you want to forward situation events to and view updates from IBM
Tivoli Enterprise Console in the Tivoli Enterprise Portal, you can set the trace for
the Situation Update Forwarder on the IBM Tivoli Enterprise Console event server.
The default trace setting is low. You can edit the trace setting using the sitconfig
command.
$BINDIR/TME/TEC/OM_TEC/bin/sitconfig.sh update
fileName=configuration_file_name logLevel=trace_level
where:
Chapter 4. Tools 59
configuration_file_name
The file name of the actively loaded configuration file as indicated by the
situpdate.properties file.
trace_level
Specifies the level of trace as low, med, or verbose.
Use the IBM Tivoli Enterprise Console Situation Update Forwarder trace facility to
diagnose problems with the IBM Tivoli Enterprise Console Situation Update
Forwarder. The trace for the IBM Tivoli Enterprise Console Situation Update
Forwarder is set during installation. The acceptable values include:
v low
v med
v verbose
The default trace value is low. If you change the trace level after the Situation
Update Forwarder is started, you must restart the Situation Update Forwarder for
the change to take effect. There are two trace files:
synch_trace.log
is always created.
synch_msg.log
is created if an error occurs while running the Situation Update Forwarder.
where:
configuration_file_name
The file name of the actively loaded configuration file as indicated by the
situpdate.properties file.
trace_level
Specifies the level of trace as low, med, or verbose.
This syntax is used to specify a RAS1 trace in the KppENV file (where pp is the
product code: HL for the OMEGAMON z/OS Management Console or DS for the
Tivoli Enterprise Monitoring Server). After you add this configuration setting to
the KppENV file, you must stop and restart the address space for the setting to take
effect. After that, it remains in effect for the life of the address space. To end the
trace, you must edit the KppENV file again to reset the trace level, and stop and
restart the address space.
The KBB_RAS1 environment variable setting follows the RAS1 trace setting syntax
as described in “RAS1 syntax” on page 46.
The tacmd settrace command is described in the IBM Tivoli Monitoring Command
Reference (http://pic.dhe.ibm.com/infocenter/tivihelp/v61r1/topic/
com.ibm.itm.doc_6.3/cmdref/itm_cmdref.htm). The IBM Tivoli Monitoring Service
Console and its RAS1 interface are described here.
Note: When a product is restarted, its trace settings are read again from the
product's configuration file, either KpcENV or pc.ini where pc is the two-character
product code. Dynamically modifying trace settings does not change the KBB_RAS1
environment variable in the product's configuration file. To persist tracing changes
across product restarts, you must update KBB_RAS1 in the product's configuration
file.
The service console is available on all platforms and it offers a variety of system
administrator utilities to display component status and to query and modify
configuration information for an active Tivoli Monitoring process.
Service consoles are uniquely identified by their service point names. If you point a
browser to the integrated web server's 1920 listening port on a system that is
running any IBM Tivoli Monitoring processes, you will see a web page like the one
shown here.
Table 4. IBM Tivoli Monitoring Service Index. In this representation of the IBM Tivoli
Monitoring Service Console are two monitoring agents running on the AIX system, the UNIX
Log File agent and the UNIX OS agent. To access the service console for the UNIX Log
File agent, you would click the “IBM Tivoli Monitoring Service Console” link under the
service point name, “root.tameaix5_ul”.
IBM Tivoli Monitoring Service IndexWed, 21 Mar 2013 14:31:02 GMT
Chapter 4. Tools 61
In addition to service console links, the example has links to the Agent Service
Interface, which allows you to get reports for an installed agent. These reports
include information about an agent's private situations, private history data,
attribute descriptions, and current values. You can also make a service interface
request by supplying XML elements.
A Service Index might include other types of links. For example, if you are running
a hub Tivoli Enterprise Monitoring Server, the Service Index on that system
includes a link to the “IBM Tivoli Monitoring Web Services”, which provides an
interface to the hub monitoring server's SOAP server.
By default, users are permitted five unsuccessful login attempts to the service
console before they are locked out. The default lockout period is 30 minutes.
Procedure
1. Open a browser window.
2. Enter the following URL where hostname is the fully qualified name or IP
address of the computer where the Tivoli Monitoring process is running, and
1920 is the well-known HTTP listening port of the integrated web server:
http://hostname:1920
If you type ras1 in the command input area, you can see usage information for
this command. The ras1 sub-commands include set, list, and units.
Turn on dynamic tracing: ras1 set
The ras1 set command dynamically replaces a product's current KBB_RAS1
value with the new value that you specify. As an example, if the default
KBB_RAS1=ERROR is in effect and you want to enable full tracing of the xxx
compilation unit and low-level detailed tracing of the yyy compilation unit,
you would enter this command after logging in to the service console:
ras1 set ERROR (UNIT:xxx ALL) (UNIT:yyy ERROR DETAIL)
. The two additional UNIT filters take effect immediately, without requiring
a product recycle.
Display current tracing levels: ras1 list
The ras1 list command displays a product's current tracing levels.
Best practice is to issue a ras1 list before ras1 set for the following
reasons:
v To not omit an important tracing parameter in the new ras1 set value.
(Note that each ras1 set command is a complete replacement.)
v To record the original tracing level so that you can restore it after you
finish capturing diagnostic data.
Setting trace to ALL includes every trace point defined for a particular
component or compilation unit. It is the equivalent of setting “Error Detail
Flow State Input Output Metrics”. This might result in a large amount of
trace. ALL can sometimes be necessary when you begin troubleshooting a
problem but if you have been given a more specific setting, use the specific
setting instead of ALL.
The ras1 listras1 list command displays information in a slightly
different format that requires some explanation. For example, here is the
ras1 list ras1 list output against a Universal Agent running with
KBB_RAS1=ERROR (UNIT:kumamain Error State) (UNIT:kt1 Error Detail
Flow).
00000003, Unit="kt1", Class=EVERYE+EVERYU+ER+FL+DET
00000001, Unit="kumamain", Class=EVERYE+EVERYU+ER+ST
00000005, Comp="KLX", Class=EVERYE+EVERYU+ER
00000007, Comp="KDE", Class=EVERYE+EVERYU+ER+ST+IN+OUT+ME
00000009, Comp="NCS", Class=EVERYE+EVERYU+ER
0000000B, Comp="KDH", Class=EVERYE+EVERYU+ER
Chapter 4. Tools 63
In this sample Universal Agent configuration, there is a component debug
variable active, KDE_DEBUG=Y. This environment variable provides a
short-hand notation for activating ERROR STATE INPUT OUTPUT METRICS
tracing (listed above as ER+ST+IN+OUT+ME) in all compilation units belonging
to the KDE component. The components KLX, NCS, and KDH are running
with a Kxx_DEBUG value of ‘N’, which defaults to ERROR
(EVERYE+EVERYU+ER).
Determine which UNITs and COMPs are active: ras1 units
The ras1 units command helps you determine which UNITs and COMPs
are active in an IBM Tivoli Monitoring product. The first column contains
the available UNIT values, and the last column shows the corresponding
COMP values. Here is an excerpt of ras1 units command output for the
Windows OS agent:
kbbcre1.c, 630, Mar 20 2013, 14:42:14, 1.1, *
kbbcrn1.c, 630, Mar 20 2013, 14:42:13, 1.1, *
kdhb1de.c, 630, Mar 20 2013, 14:52:47, 1.1.1.1, KDH
kdh0med.c, 630, Mar 20 2013, 14:52:28, 1.1.1.1, KDH
kdhsrej.c, 630, Mar 20 2013, 14:53:37, %I%, KDH
kdhb1fh.c, 630, Mar 20 2013, 14:52:45, 1.1, KDH
kdhb1oe.c, 630, Mar 20 2013, 14:52:53, 1.2, KDH
kdhs1ns.c, 630, Mar 20 2013, 14:53:38, 1.3.1.2, KDH
kbbacdl.c, 630, Mar 20 2013, 14:41:46, 1.2, ACF1
kbbaclc.c, 630, Mar 20 2013, 14:41:45, 1.4.1.1, ACF1
kbbac1i.c, 630, Mar 20 2013, 14:41:47, 1.11.1.1, ACF1
kdhsfcn.c, 630, Mar 20 2013, 14:53:42, 1.3, KDH
kdhserq.c, 630, Mar 20 2013, 14:53:15, 1.2, KDH
kdhb1pr.c, 630, Mar 20 2013, 14:52:54, 1.1, KDH
kdhsgnh.c, 630, Mar 20 2013, 14:53:12, 1.1.1.4, KDH
kdh0uts.c, 630, Mar 20 2013, 14:52:27, 1.1, KDH
kdhsrsp.c, 630, Mar 20 2013, 14:53:46, 1.2, KDH
kdhs1rp.c, 630, Mar 20 2013, 14:53:44, 1.1, KDH
kdhscsv.c, 630, Mar 20 2013, 14:53:25, 1.14.1.1, KDH
kdebbac.c, 630, Mar 20 2013, 14:50:03, 1.11, KDE
kdebsac.c, 630, Mar 20 2013, 14:50:34, 1.2, KDE
The output shows that KDH, ACF1, and KDE are among the active
components. Several Tivoli Monitoring components have their own
DEBUG environment variable that provides equivalent tracing functionality
to the COMP option. For example, adding KDH_DEBUG=A to an agent's
KpcENV or pc.ini file would have the same effect as adding (COMP:KDH
ALL) to KBB_RAS1. It activates ALL level tracing for each source file that has
KDH listed in the last column.
Suppose that you want to capture additional tracing for the first two
source files in the ras1 units output. We know that the UNIT value
matches any compilation unit that starts with the specified character string.
Therefore, you can use the ras1 set command to add (UNIT:kbbcr ERROR
FLOW). ERROR tracing is already in effect because it is the global class, but
the addition of FLOW tracing causes function entry and exit data to be
logged for the two source files that begin with “kbbcr”, namely,
“kbbcre1.c” and “kbbcrn1.c”.
Turn off dynamic tracing: ras1 set (UNIT:name ANY)
After you have used ras1 set to activate additional tracing and captured
the required diagnostic data, you can use the ANY option to turn off the
tracing. Continuing with the previous example, you would deactivate
kbbcr FLOW tracing by entering the following command: ras1 set
(UNIT:kbbcr ANY). This command has the effect of removing kbbcr from
You can prevent users from accessing the service console that is available through
the integral web server.
If you need to prevent users from accessing the service console, use one of the
following steps. Each step is listed in order from most general to most specific:
Procedure
v Disable the entire integrated web server so that it is not initialized during
process startup:
1. Update the KDE_TRANSPORT environment variable that configures a product's
networking options. (Note that KDC_FAMILIES is an older environment
variable that serves the same purpose and follows the same syntax rules as
KDE_TRANSPORT)
2. At the end of the KDE_TRANSPORT string, add a space and the following
parameter:
http_server:n
The resulting value might look like this: KDE_TRANSPORT=IP.PIPE PORT:1918
use:y IP.SPIPE PORT:3660 use:y IP use:n SNA use:n http_server:n. As a
consequence of setting this environment variable, the service console facility and
its listening ports are not started. For more information on the KDE_TRANSPORT
parameters, see the “Tivoli Monitoring protocol usage and protocol modifiers” in
the IBM Tivoli Monitoring Installation and Setup Guide.
v Disable only the service console and allow the integrated web server to start:
1. Update the KDE_TRANSPORT environment variable that configures a product's
networking options.
2. At the end of the KDE_TRANSPORT string, add a space and the following
parameter:
http_console:n
The resulting value might look like this: KDE_TRANSPORT=IP.PIPE PORT:1918
use:y IP.SPIPE PORT:3660 use:y IP use:n SNA use:n http_console:n. After
you update the environment variable and recycle the product, if you go to
http://hostname:1920, you do not see “IBM Tivoli Monitoring Service Console”
listed under the product’s service point.
v Disable a particular listening port so that the service console cannot be accessed:
1. By default, the integrated web server starts an http listener on port 1920 and
an https listener on port 3661. Assigning a port number of 0 has the effect of
disabling that listener.
2. If you want to enforce SSL-only web access, you can update the
KDE_TRANSPORT environment variable: KDE_TRANSPORT=IP.PIPE PORT:1918
use:y IP.SPIPE PORT:3660 use:y IP use:n SNA use:n http:0
Chapter 4. Tools 65
After setting this environment variable, you can access the service console only
through the https://hostname:3661 URL. To block both listening ports, and
prevent all service console access, add a space and the follow parameter to the
KDE_TRANSPORT value:
https:0
Procedure
1. From your Windows desktop, select Start > Programs > IBM Tivoli
Monitoring > Manage Tivoli Enterprise Monitoring Services.
2. Right-click Tivoli Enterprise Monitoring Server, then select Change Startup
from the menu.
3. Select the Allow Service to Interact with Desktop check box.
Results
The next time the portal server is started, the process tasks are shown in a
command prompt window.
KfwSQLClient utility
This utility provides an optional cleanup step if any of the portal server-generated
workspace queries must be deleted. A sample scenario where this might be
necessary is if you initially create a metafile application called DISKMONITOR for
the Tivoli Universal Agent that has five attribute groups in it. Assume that you
subsequently remove two of the attribute groups, which results in a new
application version suffix. You then decide to run um_cleanup to reset the
DISKMONITOR version back to 00. After completing the cleanup process, the
Navigator tree still shows workspaces for each of the five original attribute groups,
even though the metafile contains only three attribute groups.
This mismatch is caused by the fact that the portal server saves workspace queries
in the KFWQUERY table of the portal server database, which is not updated by the
um_cleanup script. Therefore, the original 00 version of the queries, which knows
about the five original attribute groups, is still being used when you view the
DISKMONITOR00 application.
If you determine that you need to delete one or more portal server-generated
queries for your Tivoli Universal Agent applications, there is a Tivoli Universal
Agent-provided script called um_cnpsCleanup.bat, which is installed on Windows
computers, that demonstrates how to perform the delete. The script is very short
and uses only the following command:
kfwsqlclient /d TEPS2 /e "delete from kfwquery where id like ’zkum.%%’;"
Note that this command deletes all portal server-generated Universal Agent
queries, which always begin with zkum. To confirm that portal server-generated
Tivoli Universal Agent queries have been deleted, or to see which queries are
currently defined, run the following select command against the KFWQUERY
table:
kfwsqlclient /d TEPS2 /e "select id, name from kfwquery where id like ’zkum.%%’;"
Procedure
1. If the Tivoli Enterprise Portal is running, exit by closing the browser window.
2. Start the Java Control Panel:
v Click Start > Settings > Control Panel and double-click the Java
Control Panel icon.
v Run the following command: java_install_dir/jre/bin/ControlPanel
3. In the Java Control Panel's General panel, click the Settings button and the
Delete Files button. Select all the check boxes that contain references to
Applications and Applets. Click OK to clear the cache.
4. When a message indicates that the JAR cache is cleared, click OK.
What to do next
If you want to start browser mode again, restart your browser and type the URL
for the Tivoli Enterprise Portal. The Java Extension Installation progress bar shows
as each Java archive file is downloaded. Upon completion, the logon window
opens and prompt you to enter a user ID.
Chapter 4. Tools 67
v When an API or socket client program connected and disconnected.
The DPLOG also records other actions including metafile refreshes. The
two most common universal agent problem symptoms are:
v One or more managed systems do not come online.
v The managed systems are online but the workspaces are empty.
Use the UAGENT application workspaces as one of the first tools to
diagnose a universal agent problem. You might find the solutions for both
problems in the appropriate DPLOG. The ODBC data provider also
includes a DPLOG message indicating when monitoring started for every
attribute group listed in every ODBC metafile.
ACTION workspace
Whenever a Take Action command is issued or a Reflex Action fires, an
entry is added to the ACTION workspace. The Action table is keyed and
ActionID is the Key attribute. The Action table rows have a time-to-live
value of 30 minutes. Unlike the DPLOG which is data provider-specific,
the ACTION table is shared by all data providers. If you run multiple data
providers, the ACTION workspace under every UAGENT application
contains the same rows.
The Action_Result can indicate what happened to a particular Take Action
command. For example, if universal agent reflex actions fire faster than one
per second, the ACTION workspace temporarily stops recording the
results. Recording resumes after several minutes if the action rate slows
down.
pdcollect tool
Use the pdcollect tool to collect the most commonly used information from a
system. Technicians in IBM Software Support use this information to investigate a
problem.
The pdcollect tool is used to gather log files, configuration information, version
information, and other information to help solve a problem. You can also use the
tool to manage the size of trace data repositories.
The pdcollect tool is run from the tacmd pdcollect command. To use this tool, you
must install the User Interface Extension. When you install or upgrade the Tivoli
Enterprise Portal Server, the Tivoli Enterprise Services User Interface Extensions
software is automatically installed in the same directory. The portal server
extensions are required for some products that use the Tivoli Enterprise Portal,
such as IBM Tivoli Composite Application Manager products. For more
information about this command, see the IBM Tivoli Monitoring Command Reference
(http://pic.dhe.ibm.com/infocenter/tivihelp/v61r1/topic/com.ibm.itm.doc_6.3/
cmdref/itm_cmdref.htm).
ras1log tool
This is a tool that converts the time stamps contained in trace logs into readable
values. This tool can be found in the itm_install/bin directory on both Windows
and UNIX systems. The following lists how the help appears:
usage: ras1log [-l|u] logfile ...
-l for local time
-u for UTC time
The first example sends the result to the screen, the second sends the result to grep
to find all of the lines with the text 'GetEnv' in them, which are then printed on
the screen, and the third sends the result to a file named tems_log.
By default this tool converts the timestamps to UTC time. When using the -l
option, it writes local time instead.
You can record these events in the Tivoli Data Warehouse for later retrieval and
analysis.
For more information about common logging, see “Audit logging” in the IBM
Tivoli Monitoring Administrator's Guide. For errors that occur with the facility, see
Chapter 19, “Auditing facility troubleshooting,” on page 311.
ITMSuper
The ITMSUPER tool performs audits of the IBM Tivoli Monitoring environment
(topology, connectivity, application support consistency checks, situations
distribution, warehouse analysis, etc.).
This tool can be run in stand-alone mode by pointing to the Tivoli Enterprise
Monitoring Server on any platform. You can run the ITMSUPER tool from a
Windows system without having other ITMSUPER software installed. The
Chapter 4. Tools 69
ITMSUPER Tools are included in the IBM Support Assistant (ISA), a free local
software serviceability workbench that helps you resolve questions and problems
with IBM software products. See IBM Support Assistant (http://www-01.ibm.com/
software/support/isa).
After starting the script, run the SetPerm command, which is located under
install_dir/bin/ directory. This command sets root permission for certain
UNIX-based systems agent files.
The product was installed with When the hub Tivoli Enterprise Monitoring Server is installed and configured
non-root user and started by root with non-root user and is started by root user, there might be some files that are
user. Why am I having issues accessible to the root user but not the non-root user. The non-root user cannot
with permissions? access some files created when the monitoring server was started as root The
changes in permissions due to the root user affect the monitoring server. Several
attempts with root user might cause the monitoring server service to fail. To
overcome the permission issue, run securemain with non-root user, and start the
component with non-root user.
How can I set the trace option to Add the following in the pc.ini file where pc is the two-character product code
capture any abends (core files)? for the monitoring agent. For an example if it is KUX agent, add the following
line in install_dir/config/ux.ini file:
KBB_SIG1=-trace –dumpoff
In an environment of 50 servers
with at least one agent per server, Is it not necessary to change all the other UNIX-based systems agents for
a new agent (vt) was installed IP:PIPE. You must configure only the agent, which connects to the Tivoli
outside the firewall. The new Enterprise Monitoring Server through a firewall. However, you must configure
agent must be configured on the Tivoli Enterprise Monitoring Server for IP.PIPE communication.
Tivoli Enterprise Monitoring
Server for IP.PIPE communication. While configuring the agent, which communicate through the firewall, you get
Is it necessary to change all the the following options:
other UNIX-based systems agents v Does the agent connect through a firewall? [YES or NO] (Default is: NO)
for IP:PIPE? v IP.PIPE Port Number (Default is: 1918)
v Enter name of KDC_PARTITION (Default is: null)
Pressing the backspace key, If you receive one of these symptoms when using the backspace on UNIX
characters such as "^?" and "^H" computers, the backspace key was incorrectly configured.
are displayed on the screen.
Configure your terminal and "stty erase" to use the same key code for
The backspace key appears to be backspace. Consider using "^?" as the key code. Verify your configuration with
working correctly when entering the IBM Tivoli Monitoring distributed utility, Install: BackspaceCheckUtility.
text, but you later find characters
such as "^?" and "^H" in
configuration files and your
software malfunctions.
When running the install.sh script
on a Linux system, I get a When I run the command "getconf GNU_LIBPTHREAD_VERSION" on my
Memory fault (core dump) at system, the response I receive is "linuxthreads-0.10" or something similar. This is
different, random stages of the caused by the /etc/profile entry of "LD_ASSUME_KERNEL=2.4". If I unset this
installation, regardless of what variable or change the value of /etc/profile to "2.6", the getconf command
selections I make. returns "NPTL 2.3.4" or something like it. This enables me to run the install.sh
script without causing the memory fault.
OR
Why does a Linux or UNIX-based This is an expected condition. The following example depicts an AIX installation
installation to a non-default path to a non-default location. The following links are created when the SetPerm
create directories in the default command is run:
/opt/IBM/ITM path? /opt/IBM/ITM/tmaitm6
/opt/IBM/ITM/tmaitm6/links
/opt/IBM/ITM/tmaitm6/links/aix52
/opt/IBM/ITM/tmaitm6/links/aix52x6
/opt/IBM/ITM/tmaitm6/links/aix53
The SetPerm command creates those links by design. Some of the binary files
have hard-coded execution paths. This coding is required by the operating
system in order to start a program object in authorized mode [root owned with
uid].
The IBM Tivoli Monitoring Installation and Setup Guide documents installation
on a single target location. However, by using local testing and configuration
control, you can install to multiple target locations and run Tivoli Monitoring
from all of them. For example, you can run multiple remote monitoring servers
on a single server. The multiple monitoring servers require a non-default
configuration, such as using different base port numbers.
v If all installations on the system are at the same maintenance level, running
the SetPerm command and updating the hard-coded /opt/IBM/ITM/tmaitm6/
links directory structure does not cause any problems.
v If all installations on the system are not at the same maintenance level,
running the SetPerm command and updating the hard-coded
/opt/IBM/ITM/tmaitm6/links directory structure can cause problems. This
scenario needs more testing than the scenario where all installations are at the
same level.
The following procedure might resolve problems you encounter in the latter
scenario:
v Maintain an installation on this system with the most current maintenance.
v Run the SetPerm command from this installation each time after other
installations apply maintenance or add agents.
v Run the SetPerm command from this installation each time after other
installations run the SetPerm command or the secureMain commands.
Note: For some cases, the OS Agents for example, only one agent can be
installed because of the agent's interaction with the operating system.
Mismatched files are identified when you forget to upgrade the agent support files
during your upgrade or you forget to upgrade the TEPS support, but upgrade the
agent support files.
To remedy this situation, complete the support upgrade specified by the warning.
See “Resolving application support problems” on page 13 for more information.
Startup Center
Review the symptoms that can occur in the Startup Center for suggested causes
and solutions.
If the Startup Center fails to create the Tivoli Data Warehouse database and user,
follow the Warehouse Proxy Agent configuration instructions to create the Tivoli
Data Warehouse database and user. See "Configuring a Warehouse Proxy agent" in
the IBM Tivoli Monitoring Installation and Setup Guide.
If the Startup Center fails to reset the sysadmin password on the hub Tivoli
Enterprise Monitoring Server configuration panel, reset the password manually.
On operating systems such as UNIX, a new user cannot be created and a password
cannot be reset in the Startup Center when you use a non-root user to install the
Warehouse Proxy Agent and Tivoli Enterprise Portal Server. To remedy this
situation, create the user or reset the password manually.
If the Tivoli Monitoring Warehouse DSN is not created by the Startup Center,
create the DSN manually by using the Warehouse Proxy Agent configuration
instructions. See “Configure communications between the Tivoli Enterprise Portal
Server and the data warehouse > Configuring a Windows portal server (ODBC
connection)” in the IBM Tivoli Monitoring Installation and Setup Guide. For more
information, check the WAREHOUSE_ODBC.log and WAREHOUSE_ODBC.trc files under the
target system temporary_directory\DSNUtil (for example, C:\Temp\DSNUtil).
If you have an existing 32-bit Tivoli Data Warehouse database in the 64-bit DB2
instance, the Startup Center fails to test the DSN for database connectivity after
creating the Tivoli Monitoring Warehouse DSN. The WAREHOUSE database is not
upgraded from 32-bit to 64-bit automatically. For more information, check
WAREHOUSE_ODBC.log and WAREHOUSE_ODBC.trc under target system
temporary_directory\DSNUtil, for example, C:\Temp\DSNUtil.
When you run the discovery process for available machines, the Startup Center
might not identify the type of operating system for some systems. These operating
systems are listed as “Unknown Operating System”.
This issue does not prevent the use of the affected systems. If the operating system
type of a specific system cannot be discovered, you are given the opportunity to
categorize the system manually in a later step. When you assign systems to the
components, if a system categorized as “Unknown Operating System” is assigned
to a component, you can select the correct operating system from the list in the
window that is displayed. After you have specified the correct operating system,
the system is moved to the correct category in the list.
Whenever you find an operating system that is not discovered correctly, you
should ideally force nmap to generate a signature, so that you can submit it to
Insecure.Org for integration in the NMap fingerprint database.
Run the nmap -O -sSU -T4 -d target command, where target is the misidentified
system in question. The fingerprint is a series of lines where each start with “OS”.
Submit the information at http://insecure.org/cgi-bin/submit.cgi?corr-os.
If you run the Startup Center from eclipse.exe, the distributed installation process
might not discover systems within the IP range that you specify. Instead, run the
Startup Center from launchStartup.bat. Note that this behavior is limited to
distributed installations of Windows systems, not of Linux or UNIX systems, when
running the Startup Center from launchStartup.sh.
When a Global Access List hub monitoring server is installed with a previous
version of a remote monitoring server, you will see duplicate insert errors (SQL1
return code 80) after an agent switches away from the remote monitoring server
and then switches back. These messages do not indicate an environment execution
error.
Any monitoring agent released with IBM Tivoli Monitoring V6.2.2 or before, other
than agents built with the latest Agent Builder tool, should not be installed on top
of the IBM Tivoli Monitoring system monitor agents.
When using remote deployment to upgrade the Tivoli Data Warehouse agents
(Warehouse Proxy Agent and Summarization and Pruning Agent), you must use a
specific workaround to ensure that the upgrade is successful.
v Add the following line to the KHDCMA.INI file for the Warehouse Proxy
Agent or the KSYCMA.INI file for the Summarization and Pruning Agent, and
then reconfigure and restart the agent:
CTIRA_SYSTEM_NAME=%computername% .TYPE=REG_EXPAND_SZ
v Add the following variable to the hd.ini file for the Warehouse
Proxy Agent or the sy.ini file for the Summarization and Pruning Agent, and
then restart the monitoring agent:
CTIRA_SYSTEM_NAME=$RUNNINGHOSTNAME$
This error is caused by the deletion of the gskit directory, whether intentionally or
by accident, without clearing the registry information. If gskit was previously
installed by another product and has a dependency on it, for example DB2 9.1,
then let that product reinstall it, or if there are no other products that depend on
the version of that gskit, then you can clear the GSK7 entry in the registry that can
be found under My Computer\HKEY_LOCAL_MACHINE\SOFTWARE\IBM\GSK7. Then rerun
the IBM Tivoli Monitoring installation to allow the gskit to be reinstalled.
This error is not necessarily a fatal error. It simply means the application did not
provide an upgrade seeding file. There are generally two types of seeding files:
install and upgrade. The installer determines which one to apply by checking to
see if there are already situations belonging to the application on the target hub. If
no situation is found, then the installation seeding file is chosen, otherwise the
upgrade seeding file is used if provided. In a hot standby setup, as soon as one
hub is seeded, the other hub can copy the situations immediately. So when seeding
is applied to the second hub, the installer detects existing situations and looks for
the upgrade seeding file instead. Even though some applications do not provide
upgrade seeding files, because the hubs automatically synchronize seeded data, it
is generally not a serious issue. Seeding can still be forced on the second hub by
using the -f option.
These exceptions can be ignored and have no impact on either eWAS or IBM Tivoli
Monitoring functionality.
Monitoring agents might fail to start, however, if all of the following conditions are
met:
v The managed system does not have a system GSKit installed or the system
GSKit is at a version not supported by IBM Tivoli Monitoring V6.2.1 or later.
v The agent is configured to use secure communications (IP.SPIPE) rather than
normal communication (IP.PIPE).
If agents on a managed system fail to start after an IBM Tivoli Monitoring V6.2
Multi-Instance Agent or IBM Tivoli Monitoring V6.2 Agent Support is installed,
any one of the following corrective actions can be taken:
v Run kinconfig.exe -G on the managed system.
v OR
v Reconfigure any of the IBM Tivoli Monitoring V6.2.1 (or later) monitoring agents
on the managed system by running kinconfig.exe -rKproductcode.
v OR
v Install another IBM Tivoli Monitoring V6.2.1 monitoring agent (or later).
Change to the directory where setup.jar exists, and then use java -jar
setup.jarto run the installer.
The application support for the version being deployed must be installed to the
portal server, or the agent configuration xml file (for example,
r2_dd_062100000.xml) must be manually copied to the same location in the portal
server (../classes/candle/kr2/resources/config) where the current-level
configuration xml file (for example, r2_dd_062200000.xml) resides.
Using the Application Support Installer with the Silent Installation Response file
option is not supported. The recommended mechanism for the installation is using
the GUI interface.
On Windows systems:
1. Run cmd.
2. Get the IBM Java location by running the following script:
CANDLEHOME\InstallITM\GetJavaHome.bat
To identify the right bundle for a particular system, the generated Software
Package Definition (SPD) file uses the naming convention: product_code
interp.spd. The interp tells you in which operating system, architecture, or kernel
the bundle can be installed.
To prevent this problem, stop the running agent, and use Tivoli Configuration
Manager (TCM) or Tivoli Provisioning Manager (TPM) to install the second agent.
Recovery from a hard kill of the installer is currently not a supported scenario
since the current installer does not have built-in rollback capability. Executing a
Chapter 5. Installation and configuration troubleshooting 83
hard stop of the installer will leave some or all IBM Tivoli Monitoring functions
(including uninstall) in an unpredictable or disabled state.
However, you should be able to continue with the uninstallation after ensuring
that there is indeed no installation being run on the system.
If you click YES, there is a risk of losing your customized tables. To ensure that
you do not lose data, complete the following steps:
Procedure
1. Click NO and exit the upgrade installation.
2. Restart the remote monitoring server computer.
3. Stop all Tivoli Monitoring components.
4. Rerun the upgrade installation now with the remote monitoring server in the
stopped state.
Results
Remote configuration and installation of a database agent requires that IBM Global
Security Kit (GSKit) be installed in directory C:\Program Files\ibm\gsk7, or that
the GSKit directory be defined in the Windows System environment variable
Choose one of the following resolutions that best fits your environment:
v Install the GSKit product by executing the InsGSKit.exe program in the target
directory C:\Program Files\ibm\ directory.
v Assign the System Environment variable named ICCRTE_DIR to the directory
path of the currently installed GSKit product (for example, C:\ibm\gsk7).
v When the error is reported, manually configure the Monitoring Agent for DB2
Service Startup Parameters to use the correct user name and password to
interact with the DB2 9.1 product. Ensure that the InteractsWithDesktop Service
is not enabled for this DB2 Agent Service.
Create the repository at the C:\IBM\ITM\CMS level of the directory structure, not at
the C:\IBM\ITM\CMS\depot level. Then set DEPOTHOME to DEPOTHOME=\\hubtems\
centralrepository\depot.
The agent was not installed for one of the following reasons:
v There is another installation in progress that cannot complete until the computer
is restarted.
–OR–
v You are attempting to install a component that is already installed.
Refer to the lcfd.log on the endpoint and the agent installation log as listed in
Table 8 to determine the exact cause of the problem.
Table 8. lcfd log file
Windows UNIX-based systems
install_dir/Install/Abort IBM Tivoli Monitoring install_dir/logs/candle_installation.log
timeStamp.log
Contact IBM Software Support if you cannot install the agent. See Chapter 2, “Logs
and data collection for troubleshooting,” on page 5 for information on what types
of data to collect before contacting Support. See the IBM Support Portal
(http://www.ibm.com/support/entry/portal/software).
Also, monitoring agents that have been configured for autonomous operation
cannot be installed on the same system as those that are connected to a monitoring
server.
These warnings are caused by an older installer missing some tags required by the
chkconfig utility, used to manage system startup files. These warnings do not
adversely affect the installation, and can safely be ignored.
Self-describing capability
Review the self-describing agent and application support problems to learn if your
installation or configuration problem is related.
A reported failure means that the agent is installed without using the
self-describing mode. You can relaunch the agent installation to attempt support
installation again. Before relaunching the installation, ensure that the failure will
not reappear by checking the previous installation log for the reasons for the
failure. Correct the obtained failure reasons (for example, fix the contents of the
manifest, unlock required files or folders). The alternative is to leave the agent
installed in non-self-describing mode. When relaunching, select only the agents
that must be installed with installation self-describing mode support.
When the installation errors have been manually corrected on the monitoring
server, you can retry a new self-describing monitoring agent installation on the
affected hub or remote monitoring server installations.
To clear up these errors, you must be able to identify the source of the error
condition. These error condition messages are provided by the monitoring server
workload manager (WLM) in RAS1 or MSG2 error messages, or audit facility
messages. You must examine these messages to identify the specific problems in
the monitoring server installation process and in the TAPPLPROPS table.
You can view and delete the monitoring server metadata or seed installation
records stored into the TAPPLPROPS table by using the tacmd listappinstallrecs
command and the tacmd deleteappinstallrecs command. You must first log in to
the monitoring server by using the tacmd login command. For more information
about these commands, see the IBM Tivoli Monitoring Command Reference.
To verify that the managed system group is present, run tacmd listsystemlist at
the command line. If the managed system group is not listed, recycle the
monitoring agent.
As an example, consider the following error messages that were logged even
though the installation was successful. Such messages can be disregarded.
(0002-E9A166E3:kfasdwlm.c,1961,"KFASDM_RequestMgr") Self-Describing Install
Completed Successfully for PRODUCT "CP", VER "05100173", ID "TMS", IDVER
my "05100173".
(003-E9A166E3:kfasdsrm.c,1531,"KFASDM_DeleteTapplpropLocal") Open request for
delete local TAPPLPROPS failed.
status <79> product <CP> product version <05100173>
(0004-E9A166E3:kfasdwlm.c,1976,"KFASDM_RequestMgr") KFASDM_DeleteTapplpropLocal
returned sqlStatus 79. product <CP> product version <05100173>
The most recent version of the monitoring server application support is not
installed through the self-describing mechanism.
To resolve this issue, disable self-describing mode on the agent that has the older
application support and add the most recent version of application support to the
z/OS monitoring server or use the self-describing mechanism to install the new
monitoring server support across your environment. See “Adding application
support to a monitoring server on z/OS” in the Configuring the Tivoli Enterprise
Monitoring Server on z/OS guide.
The return code (3), indicates the software could not create a file due to either an
issue with file permissions or a corruption on the hard drive. First check the file
permissions of the destination file. Running the Windows chkdsk and
defragmentation programs might resolve issues with corruption on the physical
hard drive.
Note: In this case, a 'RC_INFO: Pending rename operations found, must reboot
before installation can continue' message is found in the Abort IBM Tivoli
Monitoring for Databases.log file.
This behavior can happen if a program that locks files is running in the
background, such as Norton Antivirus.
This is an InstallShield limitation. You cannot install the product from the specified
network drive. Try installing from another network drive. Install the product from
a local drive if you continue to receive the error.
A correct message would tell you that required configuration properties were not
provided when using the -p|--property|--properties command line option. After
you have provided the required configuration properties using the
-p|--property|--properties command line option, the agent instance is properly
deployed.
where PC is the product code for the self-describing agent in question, such as nt
for the Windows OS agent. The following example of the output from tacmd
listappinstallrecs -t nt shows how the results of a successful installation might
look:
HUB/RTEMS PRODUCT VERSION GRPID ID IDVER SEEDSTATE STATE STATUS
HUB_TEMS NT 06230000 5655 TMS 06230000 Y IC 0
HUB_TEMS NT 06230000 5655 TPS 06230000 IC 0
HUB_TEMS NT 06230000 5655 TPW 06230000 IC 0
REMOTE_TEMS_ZOS NT 06230000 5655 TMS 06230000 Y IC 0
The command output has entries for the HUB TEMS and any REMOTE_TEMS that the
agent can be connected to. The entries for PRODUCT NT with ID TMS have a value of
IC in the STATE column, Y in the SEEDSTATE column, and 0 in the STATUS column.
An unsuccessful installation and seeding might generate results like those in the
following example:
HUB/RTEMS PRODUCT VERSION GRPID ID IDVER SEEDSTATE STATE STATUS
HUB_TEMS NT 06230000 5655 TMS 06230000 0
The record for PRODUCT NT and ID TMS might not exist or, if it does exist, the STATE
and SEEDSTATE columns are empty. In this example, the results show that the
product NT was manually installed at the hub (STATE column is empty and there is
an entry only for ID TMS).
As a result, the Windows OS product is never seeded and the portal server might
not have product support for Windows OS.
See “Self-describing agent installation” in the IBM Tivoli Monitoring Installation and
Setup Guide and “Self-describing monitoring agents” in the IBM Tivoli Monitoring
Administrator's Guide.
If any entries under /usr/lib64 are missing under /usr/lib, locate and
install the 32-bit version of the compat-libstdc++-33 package.
To enable remote deploy on some workload partitions, you must change the
install.sh file, located in the Tivoli Monitoring depot:
CANDLE_HOME/tables/TEMS/depot/PACKAGES/unix/kci/<VERSION>/install.sh
if [ -n "$fieldSep" ] # BigK.
You receive the following message after running the install.sh file: This system
is using a version of the Korn Shell (ksh) which will allow the
installation of specific older releases of ITM Monitoring application
agents to regress the installation, configuration and execution tools for
this installation. Refer to the Flash note titled "Newer ksh shell may
allow regression of ITM installation" for more information, including steps
to take to avoid damage to your IBM Tivoli Monitoring installation.
The same error when loading the shared object file for the Linux OS agent
shows a path of /data/itm/li6263/lz/bin/klzagent/ (where li6263 is the
platform).
Cause The Linux missing C-runtime library error affects Tivoli Monitoring
products or components running on Linux that make calls to C runtime
functions.
Solution
Ensure that the appropriate 32-bit or 64-bit Linux libstdc++.so.5 library is
installed:
ls /usr/lib/libstdc++.so.5
Installation on UNIX and Linux systems uses install.sh command; running this
command fails with a "runGSkit failure: Return error code: 99" and a "install.sh
failure: Return error code: 4". Because it failed within runGSkit when it called
verifyInstall, review the InstallDirectory/logs/candle_installation.log file and
look for references to runGSkit. For example, output similar to the following might
be present:
runGSkit: ----- Running command: /opt/IBM/ITM/ls3263/gs/bin/private_verifyinstall
/opt/IBM/ITM/ls3263/gs/bin/gsk7ver: error while loading shared libraries:
libstdc++.so.5:
cannot open shared object file: No such file or directory
Error: Verify Failed
Expected Details of gskit in /opt/IBM/ITM/ls3263/gs
Got
runGSkit: return code from command is 99
runGSkit: ----- End of running command -----
runGSkit: error Return error code: 99
runGSkit: error GSKit check failure, script: /opt/IBM/ITM/ls3263/gs/bin/private_
verifyinstall
runGSkit: error ls3263 - GSK check error, verifyInstall test failed
In the example above, the libstdc++.so.5 file is not present. This comes from the
package compat-libstdc++-33-3.2.3-61.s390.rpm. When this package is installed, the
problem no longer occurs.
On some Solaris, Linux, and HP platforms, IBM Tivoli Monitoring has files with
different bit sizes: executable files are 64 bit, while the provided JRE is 32 bit. This
mismatch does not allow the JNI to work properly, so the current method cannot
be used and the previous method will be used.
If the nodename and the host name are not in sync then you must shutdown all
IBM Tivoli Monitoring processes before starting an addition product installation or
upgrade.
The portal server must be restarted to have this change go into effect.
Alternatively, this command can be invoked on Linux and AIX systems using the
itmcmd execute command. Example:
itmcmd execute cq "KfwSQLClient -f myqueries.sql"
Ensure that the DB2 instance is started by running the db2start command as the
instance user:
$ su - db2inst1
$ db2start
Set the encryption key parameter in the silent install file to exactly 32 characters as
in the following example:
INSTALL_ENCRYPTION_KEY=IBMTivoliOMEGAMONEncrytionKey62
Procedure
1. In a terminal window, run the following command to display the Java version:
java -version
2. Determine where the Java executable program is located by entering the
following command:
which java
3. Rename or uninstall the Java program. This effects any other applications
which depend on Java. Be sure that it is safe to do so. If you are unsure,
rename the Java executable program.
4. Run the following command again to ensure that the Java program is not
found in the path:
which java
5. Install the product.
Perform the following steps to resolve this problem before running the installation
again:
1. Run the following command before running any installation or configuration
command for the agent:
export JAVA_COMPILER=NONE
2. Install the s390x.rpm RPM (Red Hat Package Manager) files, in addition to the
s90.rpm files, located in the CD ISO images for Red Hat As 4.0 s390x:
v compat-libstdc++-295-2......s390x.rpm
v compat-libstdc++-33-3.......s390x.rpm
It requires the two s390x.rpm files, in addition to the s390.rpm files. You can
obtain the required RPM files from the CD for Red Hat As 4.0 s390x.
AIX
Review the AIX errors for problem descriptions and solutions during installation
and configuration on AIX systems.
If this error happens with a prerequisite check that is part of the CLI tacmd
command createNode, addSystem, or updateAgent, use the
IGNOREPREREQCHECK=Y option to by pass the prerequisite check failure and
proceed with the agent install.
Use the following steps to ensure the TCP/IP transport service is available:
Note: Failure at any of the following prevents the TCP/IP service from initializing
in the address space.
1. Ensure the INITAPI service is successful. See “The INITAPI call.”
2. Ensure the Name Resolution is successful. See “Name Resolution” on page 106.
3. Ensure the first SEND ran without error. See “The First SEND” on page 107.
Notice that the INITAPI failure is characterized by a return code of (-1) and an
ERRNO value, in this case X'3F3' or decimal 1011. ERRNO-s have names. These
names are found in TCPERRNO.H and decimal 1011 is EIBMBADTCPNAME. The
most common INTIAPI ERRNOs are EMVSINITIAL (156),
EIBMBADTCPNAME(1011), and no-name(10218).
Name Resolution: IBM Tivoli Monitoring V6.1 depends on IBM's HPNS EZASMI
getaddrinfo and EZASMI getnameinfo calls for resolver services. These calls are
used to find the symbolic name and dotted-decimal IP address of the default
network interface for the z/OS image. A failure in either EZASMI call results in
failure to initialize the TCP/IP service for the z/OS address space.
In this example, the interface 'SYSL' is found and source=hostname indicates that
the host name SYSL was successfully resolved to an IP address.
The messages above indicate that no (IPV6) interface is registered. This can also
result in TCP/IP service initialization failure for the IBM Tivoli Monitoring address
space. The absence of an interface can only be fixed by the z/OS TCP/IP
administrator.
When the first network I/O is successful, the response indicates link and transport
connectivity with the hub computer.
Repairing errors in the Server list: Errors in SNA initialization might be name
mismatches. Examine the LLB entries and the GLB entries for the Omegamon
Platform address space for typographical errors. The VTAM® network ID is victim
of frequent error. Typically the VTAM network ID (the first component of the SNA
socket address, ATOOEN01 in the example above) is the same for ALL entries.
While it CAN differ, typically, it does not. A difference between the LLB VTAM net
ID and the GLB VTAM net ID is often an error in member KDCSSITE of
RKANPAR.
If the bind failure is due to EADDRINUSE but the Broker service is started, the
error might not be fatal. Determine whether or not the bind of this IBM Tivoli
Monitoring address space was to the LLB in that address space. In some instances,
an address space can bind to the LLB of another address space. This can only
occur in the same system image. If the bind failure is fatal, then another process on
this system image has the 'well-known' port. Bind failures due to insufficient
authorization are fixed by granting RACF permission for the OMVS segment to the
Omegamon Platform started task name.
Do the following if the Tivoli Enterprise Monitoring Server hub is not available:
v Review the RAS1 log for the Tivoli Enterprise Monitoring Server to ensure it is
connected.
v Review network topology to ensure Firewall policy does not prohibit connection
initiation from the Tivoli Enterprise Monitoring Server hub.
v Review "Transport opened" on the Tivoli Enterprise Monitoring Server to ensure
at least one transport service is common between it and this Tivoli Enterprise
Monitoring address space.
You can increase the following storage-related parameters if IBM Software Support
personnel instructs you to do so:
v Web Services SOAP Server
v startup console messages
v communications trace
v storage detail logging and associated intervals
'DATA SET NOT FOUND' JCL error occurs when submitting the
DS#3xxxx monitoring server create runtime members job.
If you get a DATA SET NOT FOUND error when you attempt to create runtime
members on your z/OS Tivoli Enterprise Monitoring Server, check that the RTE
build job was successful and that the fields were set correctly.
The 'Enter required field' error occurs for the 'Global location
broker applid of Hub' or the 'Network ID of Hub' VTAM-related
fields
About this task
Note: Neither of these values adversely affect the connection for the remote
Tivoli Enterprise Monitoring Server.
2. From the Configure the Tivoli Enterprise Monitoring Server main menu,
select the Specify communication protocols option.
3. On the Specify communication protocols panel, specify the IP protocols of
choice and ensure. Specify SNA.PIPE as one of the protocols that the remote
Tivoli Enterprise Monitoring Server uses for connection.
4. Navigate forward to specify the communication protocols values for the
selected protocols.
5. From the Configure the Tivoli Enterprise Monitoring Server main menu,
select the Create runtime members option to generate the DS#3xxxx Create
runtime members job.
6. Submit the job and check for good condition codes.
7. From the Configure the Tivoli Enterprise Monitoring Server main menu,
select the Complete the configuration option.
8. Review the remaining tasks to finish the configuration of the product before
starting the Tivoli Enterprise Monitoring Server.
To confirm the problem, enter the following command in the logs directory on the
portal server: grep "PORT not allowed after EPSV ALL" itm_config*.trc.
If you do not have the Integrated Cryptographic Service Facility installed, you
need to add USE_EGG1_FLAG=1 to the Tivoli Enterprise Portal Server
configuration to enable connection with the Tivoli Enterprise Monitoring Server.
Complete the following steps so that the monitoring server can connect to the
portal server:
Procedure
1. During monitoring server configuration, select Configure the Tivoli Enterprise
Monitoring Server > Specify configuration values > Integrated Cryptographic
Service Facility (ICSF) installed?and specify N(No).
2. After the monitoring server configuration is complete and it is running, the
Tivoli Enterprise Portal Server configuration must be modified to use an older,
less robust encoding algorithm. Edit the kfwenf file in install_dir\CNPS, where
install_dir is C:\IBM\ITM by default in a text editor:
a. On a new line, type USE_EGG1_FLAG=1
b. Save the file and exit.
c. Stop the portal server if it is running and restart it.
Procedure
1. Back up all of the following datasets:
v Runtime
v Target
v INSTDATA
Remove the agent instances first and then uninstall the agent. If the agent was
already uninstalled, you can re-install the agent again, remove the instances, and
then uninstall the agent again.
Use the following steps to remove a partially installed IBM Tivoli Monitoring
installation:
1. Ensure that there is no entry in the Add and Remove Programs tool for the
component that you attempted to install. If there is an entry, use that entry to
uninstall the product. If there is no entry, proceed to the next step.
2. Open the Windows Explorer and navigate to the IBM Tivoli Monitoring
installation directory (C:\IBM\ITM by default).
The first step to removing a failed upgrade is to determine where the installation
failed: either before the files were copied or after the files were copied. For GUI
installations, the files are copied after you click Next on the Start Copying Files
window. If you performed a silent installation, look for a pair of entries separated
by a blank line:
FirstUIBefore exiting to file copy
FirstUIAfter entry after file copy
If both entries exist, the installation failed after the files were copied. See
“Installation failed after files were copied”
If the entry exists in the Add and Remove Programs tool, you can still use your
existing Candle OMEGAMON installation. Launch Manage Candle Services to start
all components.
Check the Add and Remove Programs tool to see if either your previously
installed Candle OMEGAMON or IBM Tivoli Monitoring is available.
If neither are available, see “Neither products are available in the Add and Remove
Programs tool.”
If one is available see “One product is available in the Add and Remove Programs
utility” on page 119.
If both are available, “Both products are available in the Add and Remove
Programs tool” on page 120.
Neither products are available in the Add and Remove Programs tool:
About this task
Use the following steps if neither Candle OMEGAMON or IBM Tivoli Monitoring
exists in the Add and Remove Programs tool:
1. Open the Windows Explorer and navigate to the IBM Tivoli Monitoring
installation directory. By default, the installation location is
C:\IBM\ITM\Install, C:\IBM\ITM\InstallITM, C:\Candle\Install, or
C:\Candle\InstallITM.
2. Launch the Manage Tivoli Enterprise Monitoring Services utility by
double-clicking the KinConfg.exe file located in either the Install or
InstallITM subdirectory. Launch the KinConfg.exe from the InstallITM
directory if possible.
3. If any agents, the portal server, or the monitoring server are listed in the
window, right-click each and click Advanced > Unconfigure. Repeat this step
for all components that are listed.
4. Open the Windows Control Panel.
5. Double-click Administrative Tools and then double-click Services.
If the Windows Add and Remove Programs utility has an entry for Candle
OMEGAMON or IBM Tivoli Monitoring, attempt to uninstall.
Use the following steps if an entry for either Candle OMEGAMON or IBM Tivoli
Monitoring exists in the Add and Remove Programs utility:
1. Attempt to uninstall both releases from the Add and Remove Programs entry.
2. If this is successful, proceed to “Verifying the uninstallation” on page 120.
3. If this is not successful and the entry has been removed from the Add and
Remove Programs tool, see “Neither products are available in the Add and
Remove Programs tool” on page 118.
4. If the entry is still present in the Add and Remove Programs tool, copy the
KINWIINS.VER file (from the \WINDOWS\VERFILES\KINWIINS.VER directory on the
installation CD) to the <install_dir\Install\Ver directory, where install_dir is
the IBM Tivoli Monitoring installation directory.
Delete the KINWIINSMSTR.VER file from this directory if it exists.
Both products are available in the Add and Remove Programs tool:
About this task
Use the following steps if both the Candle OMEGAMON or IBM Tivoli Monitoring
entries exist in the Add and Remove Programs tool:
Procedure
1. Uninstall IBM Tivoli Monitoring through the Add and Remove Programs tool.
2. Uninstall Candle OMEGAMON through the Add and Remove Programs tool.
3. Proceed to “Verifying the uninstallation.”
Use the following steps to verify that you successfully removed the failed
installation:
1. Verify that the installation home directory and all contents have been removed.
2. Open the Registry Editor by clicking Start > Run and typing regedt32. Click
OK.
3. Expand the key HKEY_LOCAL_MACHINE registry key.
4. Expand the SOFTWARE registry key.
5. Verify that the Candle registry key and all sub-keys have been removed from
HKEY_LOCAL_MACHINE\SOFTWARE.
You can manually uninstall the operating system agent by running the command
for your platform as listed in Table 11:
Table 11. Uninstall OS command
Windows UNIX-based systems
LCF_BINDIR\\..\\TME\\ITMUpgrade LCF_BINDIR/../TME/ITMUpgrade/
\\ITMUpgradeManager\\setup. ITMUpgradeManager/uninstall.sh
Contact IBM Software Support if you cannot uninstall the agent. See Chapter 2,
“Logs and data collection for troubleshooting,” on page 5 for information on what
types of data to collect before contacting Support. See the IBM Support Portal
(http://www.ibm.com/support/entry/portal/software).
The Microsoft MS SQL Server database was not running and the portal server is
installed and configured with a DB2 database and not an MS SQL Server database.
It is likely that the system you are using at one time had an MS SQL Server
database installed that was not properly uninstalled. It does not matter whether
the database is running or not; if the data source exists you will be asked the
question, and if you answer yes there will be an attempt to remove the database.
When the Tivoli Enterprise Portal detects a connection error, it can repair the error
and your client session can resume unaffected. Use the connectivity topics to
diagnose and recover from connectivity problems.
If you are running the Tivoli Enterprise Monitoring Server on z/OS, see
“Troubleshooting z/OS-based installations” on page 104 for information about
configuration problems affect connectivity.
The following table provides resolutions for problems logging in to the Tivoli
Enterprise Portal Server.
Table 12. Cannot log in to the Tivoli Enterprise Portal Server
Problem Corrective action and solution
User authorization failed v Ensure that the user ID and password are correct. (The user ID must use
10 or fewer ASCII characters and contain no spaces. The name is limited
-OR-
to 8 characters if user authentication is at the hub monitoring server and
KFWITM215E: Unable to process uses RACF® security for z/OS.)
logon request
v Verify that the monitoring server has started.
v Define the user in the portal server.
v Configure the TEPS or TEPS2 data sources.
v If security validation is active on the hub monitoring server, make sure
the user ID is defined to the security system.
For more information on security validation see the IBM Tivoli Monitoring
Installation and Setup Guide (http://pic.dhe.ibm.com/infocenter/tivihelp/
v61r1/topic/com.ibm.itm.doc_6.3/install/itm_install.htm) or IBM Tivoli
OMEGAMON XE and Tivoli Management Services on z/OS: Common
Planning and Configuration Guide (http://pic.dhe.ibm.com/infocenter/
tivihelp/v61r1/topic/com.ibm.omegamon_share.doc_6.3/
zcommonconfig/zcommonconfig.htm).
KFWITM010I: Tivoli Enterprise v Wait for the portal server to establish connection.
Portal Server not ready. To determine whether or not the portal server is ready for portal client
logon, search the portal server trace log for this text string: Waiting for
-OR-
requests. If that string is not found, the portal server has not completed
KFWITM402E: Communication with the initialization. Portal server initialization can take as long as 20 minutes.
Tivoli Enterprise Server could not
be established. To view the trace log, open Manage Tivoli Monitoring Services, right-click
the portal server, and select Advanced > View trace log
v Recycle the portal server.
For more information see “The portal server does not start or stops
responding” on page 17.
If the status bar displays the If you are an administrator, restart the monitoring server. Otherwise, notify
Validating User Credentials message an administrator and wait for the monitoring server to be restarted.
continuously, the monitoring server
stopped.
-OR-
-OR-
If the LDAP connection is broken and the normal procedure to switch off
LDAP-based authentication does not work, the following steps need to be
performed:
1. For AIX and Linux systems, stop the portal server with the ./itmcmd
agent stop cq command invoked from the installation directory.
2. Run the ./disableLDAPRepository.sh script from candle_home/arch/iw/
scripts, where arch is the system architecture, for example "li6263" or
"aix533."
3. Reconfigure the portal server and disable LDAP authentication using the
./itmcmd config -A cq command invoked from the installation
directory.
4. Start the portal server with the ./itmcmd agent start cq command
invoked from installation directory. The portal server authentication
through the monitoring server is now enabled.
5. If the monitoring server was also configured to use LDAP and the
reason for this procedure being applied is LDAP being out of service,
ensure you also change the monitoring server configuration to not
authenticate through LDAP, following steps from the monitoring server
configuration help.
1. For Windows systems, stop the portal server service using the Manage
Tivoli Enterprise Monitoring Services application. \
2. Run the disableLDAPRepository.bat script from candle_home\CNPSJ\
scripts.
3. Reconfigure the portal server using the Manage Tivoli Enterprise
Monitoring Services application and disable the "Validate User with
LDAP" option.
4. Start the portal server service using the Manage Tivoli Enterprise
Monitoring Services application. The portal server authentication
through the monitoring server is now enabled.
5. If the monitoring server was also configured to use LDAP and the
reason for this procedure being applied is LDAP being out of service,
ensure you also change the monitoring server configuration to not
authenticate through LDAP, following the steps from the monitoring
server configuration help.
KFWITM001W Unable to 1. Check the kfw1ras.log for details if an attempt to log in fails with message
connect to Tivoli KFWITM001W. The kfw1ras.log can list any of the following messages that
Enterprise Portal Server indicate a reason for the failure:
KFWITM215E Unable to v SQL1224N A database agent could not be started to service a request,
process logon request or was terminated as a result of a database system shutdown or a
force command.
v SQL1226N The maximum number of client connections are already
started. SQLSTATE=57030
Both messages SQL1224N and SQL1226N occur when the portal server
attempts to validate the user ID entered in the browser.
2. Restart the database.
3. Attempt the log in again.
–OR–
ping -s hostname
The ping command displays the IP address of the computer it pings. Ensure that
the IP address is the same as the one to which the remote portal client is
attempting to connect. For example, if your portal server is located on a host with
the host name tepshost, and the host's /etc/hosts file includes an entry similar to
the following:
The localhost must be an IPV4 interface and not IPV6. Running ping tepshost
displays an IP address of 127.0.0.1, which is the address of the local loopback
network interface and the reason a remote computer cannot connect to it. In this
case, you must update the /etc/hosts file to give tepshost its own entry, as in the
following example:
127.0.0.1 localhost.localdomain localhost
192.168.0.9 tepshost
Note: Do not use localhost for 127.0.0.1 and simultaneously for ::1 (which is the
IPv6 address).
Tivoli Enterprise Portal client 1. Select Start > Programs > IBM Tivoli Monitoring > Manage Tivoli Enterprise
cannot find the Tivoli Enterprise Monitoring Services.
Portal Server
2. Check that the portal server service (KfwServices.exe) is running and, if not,
restart it.
3. If the portal server is started, see the KFWRAS1.LOG for any errors reported by
the portal server application.
After the portal server starts, an “event ID 1: KFW0001 - Tivoli Enterprise
Portal Server startup complete” information entry is logged.
If you see an error entry, double-click the item to see the description.
Review the issues that you might encounter when establishing a connection to the
dashboard data provider and viewing the infrastructure management dashboards
or custom dashboards.
The portal server is not ready for requests from the portal client until its process,
kfwservices.exe is fully started. Keep the Logon window open and click OK after
waiting a moment or two.
When the Tivoli Enterprise Portal Server is available again, your client session is
automatically reconnected and you can resume normal interactions.
v If you are running the portal client in browser mode and the portal server is
available but the client does not reconnect after several minutes, exit the
browser, restart the browser, and log on to the portal server again.
v If you are running the browser client and, from the same window or a tabbed
window, log on to a different portal server, the disconnections might continue to
occur and you must exit one of the work sessions. Complete one of the
following steps to avoid intermittent disconnection:
– Start another instance of the browser in the same way that you started the
first instance, such as from the task bar.
– Use two different browsers, such as Firefox and Internet Explorer.
– Use two different portal client types, such as the browser client for one portal
server and desktop client for a different portal server.
Check the following log files for messages that indicate why the Tivoli Enterprise
Portal Server did not start:
Before completing the steps below, verify with your database administrator that
the following conditions are not the cause of the problem:
v The database manager has not been started on the database server.
v The database manager was stopped.
v The database agent was forced off by the system administrator.
v The database manager has already allocated the maximum number of agents.
v The database agent was terminated due to an abnormal termination of a key
database manager process.
If the problem is not due to any of the above, it is most likely that the application
is using multiple contexts with the local protocol. In this case, the number of
connections is limited by the number of shared memory segments to which a
single process can be attached. For example, on AIX, the limit is ten shared
memory segments per process.
Procedure
1. On the computer with the database that you want to connect to, configure the
database manager to use TCP/IP on AIX.
2. On the server system, log in as the DB2 instance owner.
3. Set DB2COMM to TPC/IP, for example:
db2set DB2COMM=tcpip
4. Edit the /etc/services file to include both a DB2 connection service port and a
DB2 interrupt connection if they do not already exist, such as,
db2cDB2 50000/tcp # DB2 connection service port
db2iDB2 50001/tcp # DB2 interrupt connection
# service port
5. Update the database manager configuration, such as, db2 update dbm cfg
using svcename db2cDB2. The argument after svcename must match the name
of the DB2 connection port service that you placed in /etc/services.
6. Start and stop DB2:
What to do next
For many shell environments, the user ID has a .profile file that is run during the
initial processing to ensure a consistent working environment and must satisfy
certain requirements.
If any of these requirements are violated, then the results can be failure to start or
even failure for normal server processes to start. The .profile should be simple
and clear. This might require creating a special user ID for this purpose to avoid
impacting other users.
The VMware company is aware of this issue, and has written several articles that
address this problem. Search on “linux guest clock” in the VMware Knowledge
Base (http://kb.vmware.com/selfservice/microsites/microsite.do). See also IBM
Service Management Connect (http://www.vmware.com/files/pdf/Timekeeping-
In-VirtualMachines.pdf)
Note: Running Tivoli Enterprise Monitoring Agents at V6.1 and V6.2 on the same
computer requires Java 1.4.2 and Java 1.5 on that computer. However, having
multiple versions of Java installed on a Windows platform is not a best practice,
especially if the browser client is being used on that computer. Refer to your
monitoring agent user's guide or consult your administrator before implementing
multiple versions of Java.
delete the WebRenderer directory under the home path. For Linux systems, this
path is $HOME/.webrendererswing, and for Windows systems, this path is
%HOMEPATH%/.webrendererswing.
IBM Java 7 is the preferred default version of Java being shipped with IBM Tivoli
MonitoringV6.3 or later. It is possible to get a warning message when you start the
browser client if there are application support files that contain unsigned content.
These support files typically contain text for areas of the portal client where
application-level information is displayed, such as the Physical Navigator view and
predefined workspaces.
IBM Java 7 detects the use of downloaded files with unsigned content and, if
found, provides a warning message. In the context of the portal client, you must
answer No to enable the content. Otherwise, the blocked content disables certain
Portal desktop client called from Java Web Start does not
work properly after adding agent support
Review the symptom and possible solution if the Java Web Start-installed Tivoli
Enterprise Portal client is getting errors after installing agent application support
on the Tivoli Enterprise Portal Server.
Symptom
After starting the portal desktop client, you get a Java exception and an
“unable to load resource” message with an accompanying file name.
Solution
1. Exit the portal desktop client if you are logged on.
2. Start Manage Tivoli Enterprise Monitoring Services as described in
“Starting Manage Tivoli Enterprise Monitoring Services” on page 3.
3. Right-click Tivoli Enterprise Portal Server and take one of the following
steps.
Click Reconfigure, and click OK twice to accept the existing
configuration.
Click Configure and accept the defaults settings when
prompted for configuration choices.
4. After finalizing the configuration, restart the portal server.
5. Restart the portal desktop client through the Java Web Start
If you review the log for the Tivoli Enterprise Portal desktop client on the Linux
system, install_dir/logs/kcjras1.log, you see the following error:
EXCEPTION: Attempting to load home workspace:
java.lang.IllegalArgumentException: Width (0) and height (0)
Take one of the following steps to edit the cnp.sh startup file:
Procedure
v Add the Java system property, “-Dawt.toolkit=sun.awt.motif.MToolkit”:
1. Locate the install_dir/architecture/cj/bin/cnp.sh file on the Linux
system.
2. Change this line: ${TEP_JAVA_HOME}/bin/java -Xms64m -Xmx256m
-showversion -noverify -classpath ${CPATH} -Dkjr.trace.mode=LOCAL
... to include the system property, ${TEP_JAVA_HOME}/bin/java -Xms64m
-Xmx256m -showversion -noverify -classpath ${CPATH}
-Dawt.toolkit=sun.awt.motif.MToolkit -Dkjr.trace.mode=LOCAL ...
v Use a 1.6 JRE to run the Tivoli Enterprise Portal desktop by doing the following:
1. Locate the install_dir/architecture/cj/bin/cnp.sh file on the Linux
system.
By default, both the old and the new passwords continue to work for
approximately one hour after the password change. After one hour, the old
password stops working. Windows 2003 Service Pack 1 introduced this behavior
into Active Directory. Please refer to Microsoft KB article 906305 for information on
what occurs and for instructions on disabling the behavior if necessary.
If you log onto a Windows system with an administrator name with non-latin1
characters, you cannot log onto the portal server by either the Tivoli Enterprise
Portal desktop client or the Tivoli Enterprise Portal browser client. Set the
cnp.browser.installdir Tivoli Enterprise Portal parameter to a path that does not
contain any non-latin1 characters by completing the following steps:
1. Click Start > Programs > IBM Tivoli Monitoring > Manage Tivoli Monitoring
Services.
2. In the Manage Tivoli Monitoring Services window, right-click Tivoli Enterprise
Portal > Browser > Tivoli Enterprise Portal > Desktop_InstanceName > and
click Reconfigure.
3. In the client configuration window that opens, double-click the
cnp.browser.installdir parameter.
You receive the error message: invalid id/password. You must set the local security
policy on an Active Directory server. If you installed IBM Tivoli Monitoring on a
system where Microsoft Windows Active Directory is installed, you must update
the local security policy on that system to include the sysadmin user so that you
can log on to the Tivoli Enterprise Portal. This configuration task is necessary only
on Active Directory systems and must be performed whether or not you plan to
use the Monitoring Agent for Active Directory. Follow these steps to configure the
local security policy:
v Enter secpol.msc at a command prompt. The Local Security Settings window is
displayed.
v In the navigation pane on the left, select Local Policies -> User Rights
Assignments.
v In the list of policies, right-click Log on locally and select Security.
v Click Add. The Select Users or Groups window is displayed.
v Select sysadmin from the list of user names.
v Click Add.
v Click OK.
v On the Local Security Policy Setting window, verify that the check box in the
Effective Policy Setting column is selected for the sysadmin user.
v Click OK to save the new setting and exit.
Workspaces
Workspaces display queried data from the monitoring agents that are installed on
your monitoring network and from the Tivoli Enterprise Monitoring Servers and
Tivoli Enterprise Portal Server. If you observe unusual behavior, get errors, or get
no workspace at all, review the possible causes and solutions.
1. Open a shell prompt and locate the plugins directory associated with
your Firefox browser (such as /usr/lib/mozilla/plugins). If IBM Java
7 is installed and the next-generation Java plug-in is registered with
your Firefox browser, the following shared object file link should be
present in the Firefox /plugins directory: libnpjp2.so. If a link to this
file is not in the Firefox /plugins directory, continue with the following
steps to register the Java plug-in.
2. If the /plugins directory has no libnpjp2.so file link, but does have a
libjavaplugin_oji.so link, delete the link with the following command
(might require root authority to execute):
rm -f libjavaplugin_oji.so
Ensure that you configure all groups that supply data to the view. In the Historical
Configuration view, ensure that data collection is started for all groups that supply
data to the view. Views containing multi-row attributes show this message if no
row data are collected.
Clicking on the Timespan icon for one view brings up the data
for another view
The timespan panel provides this check box at the bottom of the panel: Apply to
all views associated with this view's query. If both views in question share the
same query, and this check box is selected, a change in one view's time span also
affects the other as expected. However, when the check box is unselected, this
behavior is still exhibited.
If you want one of the views to not have the time span, you must now go back in
and change it manually.
Procedure
1. Stop the Tivoli Enterprise Portal Server.
Situations
Review the Tivoli Enterprise Portal situation troubleshooting scenarios for
situations to find a solution to situation and event related problems.
Do the following to determine why situations are not firing in the Tivoli Enterprise
Portal :
v Confirm the situation is firing in the event console.
v Ensure that the situation was distributed.
v Verify whether the situation is associated with an item in the Tivoli Enterprise
Portal Navigator view.
v Ensure that the situation condition is true.
v Check the operations log of the agent.
See also “Environment variables” in the IBM Tivoli Monitoring Installation and Setup
Guide; and “Adding an Until modifier” (Reset Interval Expires option) in the Tivoli
Enterprise Portal User's Guide.
Workaround: If you see this error, check data sets whose names end in RKDS* to
determine if they are out of space or have run out of extents. For example,
&rhilev.&rte.&vsamfsv.RKSSSITF, where &rhilev is the VSAM runtime high-level
qualifier , &rte is the RTE name, and &vsamvsf is the monitoring server EIB VSAM
low-level qualifier." Refer to the TEMS started task to see a complete list of VSAM
EIB files.
Do the following to ensure that data can return to the Tivoli Enterprise Portal:
v Ensure that the monitoring agent is online.
v Verify that all the application-related files were installed with the Tivoli
Enterprise Portal Server.
v Check the kfwras1.log for errors.
v Set the following trace option in the KFWENV file:
(UNIT:ctsql INPUT)
The default presentation assumes that each monitoring agent type comes from a
different IP address and host name. When that is not true, the monitoring agent
displays at an unexpected or random location.
One example is when multiple agents are installed on a single server. Another
example is when a high availability option like Microsoft Windows Clustering is
used and an IP address is shared.
Additionally, Navigator items are ordered internally on the Tivoli Enterprise Portal
and therefore might not reflect a logical ordering in any particular language.
Changes to the agent environment variable CTIRA_HOSTNAME affect the display name
in the workspace but not within the Navigator view. Here is an explanation of how
to configure the Tivoli Enterprise Portal Server for desired navigator item display.
The portal server has a configuration environment variable that changes the
Navigator to depend on the host name instead of the IP address:
KFW_TOPOLOGY_CLUSTER_LIST. This variable is added to install_dir\cnps\kfwenv
file in Windows or install_dir/config/cq.ini in Linux and UNIX. Its purpose is
to force dependence on the host name for Navigator positioning instead of on the
host address. A sample setting looks like this:
KFW_TOPOLOGY_CLUSTER_LIST=AFF_xxx AFF_yyy
Where the setting lists the agent affinities to which this host name logic should
apply. Here are some affinities for monitoring agents distributed with IBM Tivoli
Monitoring:
AFF_NT_SYSTEM "Windows OS"
AFF_ALL_UNIX "UNIX OS"
AFF_LINUX_SYSTEM "Linux OS"
As an example, if you needed to use the host name for Navigator positioning of
your Linux OS and UNIX OS monitoring agents, it would look like this:
KFW_TOPOLOGY_CLUSTER_LIST=AFF_ALL_UNIX AFF_UNIX_LOG_ALERT AFF_LINUX_SYSTEM
There are many agents, so listing the affinity for each would be cumbersome. Here
is a general method of figuring out the correct affinity name:
1. If you have an agent connected and showing as online, click on the navigator
tree top node, then right click and select Managed System Status.
2. Make a temporary change to this workspace to show the agent affinity by
right-clicking on any row and selecting Properties.
In many cases, after you have completed the steps above, you have finished this
task. However, there are a few cases where additional configuration is necessary:
v The host address is normally used for system identification, however, with
KFW_TOPOLOGY_CLUSTER_LIST set, the first part of the Origin Node is used
(“Primary:” is ignored). That first part defaults to the TCP/IP host name, but it
is replaced if CTIRA_HOSTNAME is set. This gives you control over deciding where
an agent is positioned. Configuring an agent to have a specific CTIRA_HOSTNAME
value manages cases where the IP address changes, and then
KFW_TOPOLOGY_CLUSTER_LIST forces the navigation display to use that apparent
host name instead of the TCP/IP host name.
v Some agents require a different mechanism to change the apparent host name. In
the case of the WebSphere MQ Monitoring Agent, for example
(AFF_MVS_MQM), update the mq.cfg file and add:
SET AGENT NAME(hostname)
v Long host names can create confusion, such as when a host name is fully
qualified. The default portal server processing only uses the first part of the fully
qualified name, so two names like abc.xyz.large.com and abc.def.large.com
would both appear at the node labeled abc. That can be controlled by adding
the following to the portal server environment file (install_dir\cnps\kfwenv or
install_dir/config/cq.ini):
KFW_TOPOLOGY_KEEP_DOT_IN_NODE_NAMES=Y
v Remember, the managed system names are limited to 32 characters, therefore,
the hostname:product can be truncated and cause accidental duplications. That
would be another case where you would need to set the CTIRA_HOSTNAME.
The Tivoli Enterprise Portal client uses the IBM Java Plug-in, which is
automatically installed on your computer with the portal client. Adjust the IBM
Java Plug-in properties if performance is slow or your workstation receives
Procedure
1. Open the Windows Control Panel.
2. Double-click IBM Control Panel for Java(TM) to display the Java(TM) Control
Panel.
3. From the Java(TM) tab:
a. Click View.
b. Double-click on the Java Runtime Parameters field and enter: -Xms128m
-Xmx256m .
4. Click OK.
5. From the General tab complete the following steps to clear the browser cache:
a. Click Delete Files...
b. Check the box labeled Downloaded Applets.
c. Click OK.
6. Click OK in the Java(TM) Control Panel.
Note: On 64-bit systems when a 32-bit SUN JRE is already installed, the 32-bit
SUN JRE control panel appears independently when you access Start->Control
Panel->Add or Remove Programs->Java. After installing a 64-bit SUN JRE over the
32-bit SUN JRE, the 32-bit SUN JRE control panel disappears from the location,
and the 64-bit SUN JRE control panel appears when you access Start->Control
Panel->Add or Remove Programs->Java instead of the 32-bit SUN JRE control
panel. If you want to access the 32-bit SUN JRE control panel, use the
32bit_jre_install_dir/bin/javacpl.exe file.
Category and Message field of the universal message does not accept
DBCS
About this task
Agents display offline in the portal client but they are still started
The Tivoli Enterprise Portal Navigator Physical view shows that some monitoring
agents are offline, but the situations continue to be sampled and agent logs show
them to be running.
This error can occur if the agent names in the group identified contained
embedded spaces. Agent names cannot contain embedded spaces. Edit the agent
names to remove the spaces. The CTIRA_HOSTNAME environment variable on the
monitoring agents must include a specific definition for correct agent host names.
If you do not find the resolution to your problem, review the topics in Chapter 6,
“Connectivity troubleshooting,” on page 123.
Users who run the IBM HTTP Server do not have permission to the
content directory
During configuration and startup of the Tivoli Enterprise Portal Server, the system
attempts to confirm that the user running the IBM HTTP Server has permission to
access the IBM HTTP Server content directory. If not, one of the following
messages are displayed:
KCIIN2723W User who runs IHS (IBM HTTP Server) does not have proper
permissions to IHS content directory. Do you want to continue?
or
KCIIN2724W User who runs IHS (IBM HTTP Server) does not have proper
permissions to IHS content directory.
This error occurs when the IBM HTTP Server is running under a user that does
not match the user specified in the CANDLE_HOME/ARCH/iu/ihs/httpd.conf file. For
the portal server to run correctly, the user who is running the IBM HTTP Server
must have access to the IBM HTTP Server content directory.
The portal server and Warehouse Proxy Agent fail to connect to the
database on a 64-bit Windows system
The ODBC control panel available in the Windows Start menu is for 64-bit ODBC
configuration. Since the portal server and the Warehouse Proxy Agent are 32-bit
applications the 32-bit ODBC control panel must be used. Launch the 32-bit ODBC
window from C:\Windows\SysWOW64\odbccp32.cpl and manually create the
"ITM Warehouse"/"TEPS" ODBC source by going to the System DSN tab and
clicking Add.
If you do not want to use Secure Socket Layer communication between Tivoli
Monitoring components and the Tivoli Enterprise Portal Server, use the following
steps to disable it:
1. In Manage Tivoli Enterprise Monitoring Services, right-click Tivoli Enterprise
Portal Server.
2. Click Advanced > Edit ENV file.
3. Find the following line:
kfw_interface_cnps_ssl=Y
4. Change the Y to N.
5. Save the file and exit.
6. Click Yes when you are asked if you want to recycle the service.
When using DB2 8.1 or 8.2, you must install the correct fix pack versions of DB2 in
order to create a Tivoli Enterprise Portal Server database. These fix pack versions
are:
v DB2 V8.1 with Fix Pack 10 or higher fix packs
v DB2 V8.2 with Fix Pack 3 or higher fix packs
Also, on AIX systems, a failure occurs when you attempt to install a Tivoli
Enterprise Portal Server with a DB2 database. Using the db2 installation user ID
(default is db2inst1), do the following:
1. Stop the DB2 server if not already stopped using the following command:
cd /db2inst1/sqllib/adm
db2stop
2. Issue the following configuration changes:
export EXTSHM=ON
db2set DB2ENVLIST=EXTSHM
db2set -all
3. Using your preferred editor add the following lines to the /db2inst1/sqllib/
db2profile file:
EXTSHM=ON
export EXTSHM
4. Restart the DB2 server using the following command:
cd /db2inst1/sqllib/adm
db2start
5. Restart the Tivoli Enterprise Portal Server using the following command:
cd /opt/IBM/ITM/bin
./itmcmd agent start cq
You receive a KFW error when a query is sent to more than 200
managed systems
You receive the following error when a query is sent to more than 200 managed
systems:
KFWITM217E Request error: Request to xxx nodes exceeds the limit of 200.
Please specify a smaller distribution or increase the maximum.
There is a default limit of 200 nodes for any single query for a workspace view. If
the following conditions exist in the query for a workspace view, you must
increase the KFW_REPORT_NODE_LIMIT environment variable for the Tivoli Enterprise
Portal server environment variable as described below:
v The query is assigned to a managed system list that contains more than 200
managed systems.
OR
v More than 200 managed systems are explicitly assigned to a query in any
workspace view.
Under these conditions, you must increase the following Tivoli Enterprise Portal
server environment variable.
KFW_REPORT_NODE_LIMIT=xxx
where xxx is an integer that is equal to or greater than one of the following:
v The number of managed systems defined in a managed system list.
OR
v Explicitly assigned to a query over 200 in a Tivoli Enterprise Portal workspace
view.
After you change the KFW_REPORT_NODE_LIMIT variable, you might receive the
following error:
KFWITM217E Request error: SQL1_CreateAccessPlan failed, rc=1.
Typically this problem is caused when too many explicitly defined managed
systems are assigned to a query for a workspace view. The best practice for
resolving this problem is as follows:
1. Create a managed system list that specifies the explicitly defined managed
systems.
2. Remove the explicit assignments from the query.
3. Assign the managed system list to the query.
Alternatively, you can reduce the number of managed systems that you explicitly
define in the query.
When non-hub situations are associated at the Tivoli Enterprise Portal Server, they
turn TRUE, meaning they are visible in the situation event console. Through the
Situation editor, if you assign all the agents and managed systems lists from a
situation, that situation event continues to appear in the situation event console.
To avoid this situation, use the root account or an account with granted required
permissions (itmuser group).
You can increase the maximum heap size of the TEPS/e JRE to improve server
response time and better accommodate a larger monitored environment.
The memory required by the TEPS/e depends on the size of the monitoring
environment and on the Java heap size parameters. The default maximum Java
heap size is 1500 MB, which is appropriate for 32-bit systems. On 64-bit systems, if
you encounter “out of memory” exceptions, you can increase the maximum heap
size of the TEPS/e JRE to values greater than 1500 MB to allow more memory
usage and accommodate a larger monitored environment.
Complete the following steps on the computer where the portal server is installed
to adjust the TEPS/e maximum heap size.
Procedure
1. At the command prompt, change to the bin directory of your Tivoli Monitoring
installation:
v install_dir\cnpsj\profiles\ITMProfile\bin
v install_dir/platform/iw/profiles/ITMProfile/bin
2. Enter the following command to start the eWAS console:
v wsadmin.bat
v wsadmin.sh
3. At the wsadmin prompt, enter the following command: $AdminConfig modify
[$AdminConfig list JavaVirtualMachine] "{maximumHeapSize 2000}"
4. Enter the following command to save your changes: $AdminConfig save
5. Enter the following command to exit: quit
What to do next
Restart the portal server. If you continue to encounter “out of memory” exceptions,
try increasing the maximum heap size again.
Messages related to the index file are displayed when the agent fails
back to a remote monitoring server
These messages indicate that the remote monitoring server was stopped forcefully
(for example, when it crashes), but that the database is not corrupted. The
messages help to ensure that even though the remote monitoring server stopped
unexpectedly, no loss of data occurred, and that the database has been restored
successfully.
To avoid this issue, install V6.2.3 application support files on the remote
monitoring server. After you restart the remote monitoring server, the SOAP
requests will work.
The total number of entries registered in the hub monitoring server Global
Location Broker from the Warehouse Proxy Agent are a combination of the
following amounts:
The following example of Global Location Broker entries for the Warehouse Proxy
Agent is from a remote monitoring server RAS1 log when the remote monitoring
server has the configuration parameters KDC_DEBUG=Y and KDE_DEBUG=Y set
in its environment:
+4A8F0367.007D object: 85f536a00000.02.0a.09.fe.31.00.00.00
+4A8F0367.007D type: 85f532330000.02.0a.09.fe.31.00.00.00
+4A8F0367.007D interface: 865fc14a0000.02.0a.09.fe.31.00.00.00
+4A8F0367.007D annotation: Candle_Warehouse_Proxy
+4A8F0367.007D flags: 0x2 addr-len: 16
+4A8F0367.007D saddr: ip:#9.77.148.246[205]
The Linux runtime libraries determine how memory is allocated when Tivoli
Monitoring processes request RAM. Tivoli Monitoring does not have control over
these algorithms. It has been observed that 64-bit Tivoli Monitoring products might
have higher virtual memory usage on Linux systems with large amounts of RAM
than on systems with smaller amounts of RAM. This is a Linux system
determination.
You must first check to see if any override is associated with that particular
managed system before removing it from the situation. If an override is found,
remove it before the situation's distribution is modified.
According to standard, grouping functions can only return the grouping function
results and any columns used in the grouping. This predicate is looking for a
single row, but a grouping function is an aggregate of the grouped rows.
A core or dump file might be created during the program crash. Creation of a core
or dump file usually depends on if the system has been configured to save crash
information. However, even if the system is configured to save crash data, this
particular crash might not produce a core or dump file.
The expected seeding completion messages will follow the crash message. The
normal seeding messages are checked by the Tivoli Monitoring configuration tool
for successful completion of all seeding functions. The crash message always
appears in the monitoring server seeding output even if a core or dump file is not
produced.
Monitoring server seeding output files are stored in different files on UNIX and
Windows systems.
UNIX examples:
$ITM_HOME/logs/Node_ci_query_Process_ID.log
$ITM_HOME/logs/Node_ci_query_Process_ID.log
where :
Node The system host name
Process_ID
The program process ID
Windows example:
C:\IBM\ITM\CNPS\logs\seedApp.log
where:
App The 3-character product code, such as knt for the Monitoring Agent for
Windows OS
The exact cause of the monitoring server seeding program crash has not been
determined. The program has finished all seeding functions and is exiting. The
crash has only occurred when only a few seeding changes are required. Seeding
functions making many updates to the monitoring server tables have never
resulted in this type of program crash. It is very possible that there is something
unique about the system where this crash has been seen. The crash has only been
observed on one internal test system, which was a Linux for AMD (Opteron)
system.
Some agents are displayed in the Service Console list that are
not accessible from that user interface
For instance, if you click IBM Tivoli M5 Agent Service Interface, a popup is
displayed for the User Name and Password. There is no valid user name and
password combination that will work, and you must click Cancel to get out of this
page. Other agents do not have this problem.
The default time for starting and stopping a Tivoli Enterprise Monitoring Server
service is ten minutes. In the following two situations, this time can be ten times as
long:
1. If you have a large or complicated infrastructure connected to the Tivoli
Enterprise Monitoring Server.
2. If you have a remote Tivoli Enterprise Monitoring Server, but the hub Tivoli
Enterprise Monitoring Server is offline.
The "nofiles" parameter is the number of file descriptors available to a process. For
the monitoring server process (kdsmain), the "nofiles" parameter should be set
larger than the maximum number of agents that will be connecting to the
monitoring server. If the monitoring server is unable to get file descriptors when
needed, unexpected behavior can occur, including program failures. Consider
increasing the value to 1000 file descriptors or more.
There are other user limit parameters that control how much data, stack and
memory are available to a process. For large environments, consider increasing
these memory-related user limit parameters for the monitoring server (kdsmain)
process.
Configuring the user limit parameters usually requires root access, and involves
changing system startup files which are operating system specific. Consult the
operating system manuals for information on how to configure the user limit
parameters.
The value of the MINIMUM parameter within the KDSSYSIN member of the
RKANPARU library might need to be increased if the STGDEBUG(X) or
STGDEBUG(Y) parameter is also supplied within KDSSYSIN. If the address space
controlled by this KDSSYSIN member enters a "storage quiesce" state (indicated by
This message normally indicates that the OS agent is not online. If the agent is, in
fact, online, cancel current operations to this node:
# tacmd cleardeploystatus -h hostname
Then issue the operation directly by using the Managed System Name parameter
(instead of the deploy group):
# tacmd updateAgent -t product_code -n managed_OS
A new login solves the problem and enables the tacmd listsystems command.
Log indicates hub monitoring servers are down when they are
up
The statusPut process fails periodically, resulting in an incorrect hub Tivoli
Enterprise Monitoring Server status. This condition is harmless and does not cause
any operational change by the software. The following is an example of the log:
Confirm that the prerequisite software has been installed. The C libraries are
critical for the Tivoli Enterprise Monitoring Server performance at start and stop
times and are important for communication between Tivoli Enterprise Monitoring
Server and Tivoli Enterprise Portal Server.
The installation should check the prerequisites and show information in one of the
logs, such as the candle installation log or the Tivoli Enterprise Monitoring Server
log. If one of the prerequisites is missing the installation will not continue
automatically.
When the monitoring server is stopped, the internal web server swaps to another
IBM Tivoli Monitoring process such as an OS Agent. When the monitoring server
starts again, the monitoring server and SOAP are running, but the internal web
server is not in the same process. You can determine which process is running the
internal web server by starting a browser session to the service console
http://server:1920. Ensure the browser View/Status is checked, and then move the
cursor over each link. The port involved will be seen in the status line like this:
Service Point: cnp
-> IBM Tivoli Monitoring Service Console
-> IBM Tivoli Enterprise Portal Web Client
Service Point: nmp180_hd
-> IBM Tivoli Monitoring Service Console
In this case the cursor was on the service console link under "Service Point:
nmp180_hd" and that was the process running the internal web server.
If a firewall rule is in place between the tacmd login process and the server
running the hub monitoring server and SOAP, the tacmd login command might
fail. The tacmd logic attempts to use the base port connected with the "IBM Tivoli
Monitoring Web Services" link. If that is unavailable, it uses the 1920 port. That
will fail if the 1920 process is not the same as the monitoring server process.
If this condition occurs, it can be resolved by stopping all IBM Tivoli Monitoring
processes on that server, starting the hub monitoring server, and then starting up
the rest of the IBM Tivoli Monitoring processes. When things are running again,
the tacmd login begins to operate as expected.
When the problem occurs, make sure the hub monitoring server started under the
non-root login is not running. Then recycle whichever process shows up first on
the Service Index page.
Set the monitoring server tracing and LDAP client-side tracing on the hub
monitoring server:
KBB_RAS1=ERROR (UNIT:kdslg ALL) (UNIT:kdsvl ALL) (UNIT:kgllg ALL) (UNIT:kglld ALL)
Run the following commands on the system hosting the UNIX or Linux hub
monitoring server:
1. export LDAP_DEBUG=65535
2. export LDAP_DEBUG_FILE=/opt/IBM/ITM/logs/ldaptrace.txt (or whatever
path/file you want)
3. rm /opt/IBM/ITM/logs/ldaptrace.txt (to remove the file before restarting the
Hub)
Restart the hub monitoring server. Note that these LDAP-related trace settings
remain active until the monitoring server is restarted from a shell session where
the variables are not exported. If the monitoring server is restarted from the same
session where these variables are still exported, then the settings will be active
again after restart.
Reproduce the problem. As you reproduce it, any LDAP requests add trace content
to that ldaptrace.txt file.
Retrieve that ldaptrace.txt file, and the hub monitoring server log files. The hub
monitoring server logs will show any possible failures that occur leading up to the
calls to the SOAP server. The ldaptrace.txt file will show any activity and
possible failures occurring when it actually binds to the SOAP server and attempts
to lookup users.
Windows Systems
Windows systems use the same KBB_RAS1 trace settings as UNIX and Linux
systems, but enabling the additional LDAP trace requires a different procedure.
From the Manage Tivoli Enterprise Monitoring Services (MTEMS) GUI, complete
the following steps:
1. Stop the monitoring server.
2. Right-click the entry for the monitoring server, and select Advanced... -> Edit
Variables....
3. In the Override Local Variable Settings window that appears, click Add. Enter
LDAP_DEBUG for the Variable, and 65535 for the value, and click OK.
4. Click Add again and enter LDAP_DEBUG_FILEfor the Variable, and any path or
file that you wish (for example, C:\temp\ldaptrace.txt). Then click OK.
5. Click OK to save the changes.
6. Start the monitoring server. New login and LDAP-related monitoring server
activity is now logged in the LDAP_DEBUG_FILE.
When you are finished reproducing the problem and want to stop tracing, go back
to the Manage Tivoli Enterprise Monitoring Services (MTEMS) GUI and complete
the following steps:
For example, after failover to the standby server, the tacmd login -s myserver -u
sysadmin -t 1440 command returns the following error:
Validating user...
KUIC00006E: The specified user name or password is incorrect.
You are not logged in because you specified an incorrect user name or password
or you do not have permission to log in.
Verify the correct user name and password and that you have permission to log in.
Procedure
1. On the monitoring server computer that you cannot log in to, open the
kshxhubs.xml file in a text editor.
2. Change the setting for <CMS_Name> to specify the IP address. For example,
from <CMS_Name>ip.pipe:hub_myserver[1918]</CMS_Name> to
<CMS_Name>ip.pipe:9.11.10.188</CMS_Name>.
3. Save and close the kshxhubs.xml file.
4. Restart the monitoring server.
Results
The tacmd login command to the server completes successfully if you enter a valid
user ID and password.
The mirror hub does not keep accurate node status of any endpoint, whether hub
or remote monitoring server. As the tacmd listappinstallrecs command presents
install records for any monitoring server, if you issue at a mirror, the acting hub
might or might not be omitted (as well as any remote monitoring servers in the
installation).
For more information about configuring the monitoring server on z/OS, see
Configuring the Tivoli Enterprise Monitoring Server on z/OS (http://pic.dhe.ibm.com/
infocenter/tivihelp/v61r1/topic/com.ibm.omegamon_share.doc_6.3/ztemsconfig/
ztemsconfig.htm).
Use the following steps to ensure that the Configuration tool references the correct
SMP/E target high-level qualifiers:
1. From the Configuration tool main menu, select Install products or
maintenance > Set up product or maintenance environment > Specify
environment information.
The problem is that the KCITVARS ISPF table is out of order. The procedure to sort
the table is:
1. Back-up INSTDATA.
2. In TKANCUS, create a CLIST called XSORT. The contents will be as follows:
======================================================
PROC 0
SET SORTLIST = &STR(CIGPRF,C,A,CIGPRD,C,A,CIGVAR,C,A)
ISPEXEC TBOPEN KCITVARS WRITE SHARE ISPEXEC TBSORT KCITVARS FIELDS(&SORTLIST)
ISPEXEC TBCLOSE KCITVARS PAD(30) EXIT CODE(0)
======================================================
3. Invoke ICAT.
4. On the ICAT main menu, select Configure products->Services and
utilities->Execute a CLIST in the TKANCUS library option.
5. On the next panel, specify XSORT as the Name. Ensure that the panel displays
a "Completion Code = 0" message after this CLIST is invoked.
6. Navigate to the RTE main menu and regenerate the batch parameter member
for RTE=SYP1. Ensure that all the applications are now generated.
Note: If you revert your Registry Services for IBM Jazz™ for Service Management
environment or the associated Registry Services database (FRSDB) to a prior state,
you must stop and restart any and all running automation servers. Otherwise, any
currently running automation server will not be known as a registered service
provider, resulting in Registry Services rejecting any HTTP requests made by the
automation server.
For the debugging variable trace settings, see “Setting the trace option for the
automation server” on page 50. For a complete list of the automation server
environment, see “Tivoli Enterprise Monitoring Automation Server environment
variables” in the IBM Tivoli Monitoring Installation and Setup Guide.
Other factors such as network delays and unresponsive routers can also slow
shutdown.
Problem
The automation server fails to stop or does not stop in a timely fashion.
UNIX command to stop the automation server and the response after
timeout occurs:
./itmcmd agent stop as
Processing. Please wait...
Stopping Tivoli Enterprise Monitoring Automation Server ...
Product as was not stopped. You can use /data/630/d2275a/bin/itmcmd agent
Note: Also at shutdown, the automation server's ras1 log shows a list of
component locks being released. At times, invalid numbers can be noted with the
number of locks that are being released. You can ignore these invalid number error
messages unless there are issues with the component shutdown.
Review the trace and log information and problem scenarios for guidance with
diagnosing Authorization Policy Server issues.
Set dashboard data provider tracing when you have trouble with data retrieval to
the server dashboards and have been requested by IBM Support. Policy client
tracing can be helpful when you are troubleshooting problems involving run-time
authorization checks or policy distribution.
The TEPS/e administration console is disabled by default for security reasons and
to save system resources. The Tivoli Enterprise Portal Server must be running
before you enable the console.
Take these steps to enable and start the TEPS/e administration console, and set a
trace for the dashboard data provider or policy client:
Procedure
1. Enable the TEPS/e administration console :
v In the Manage Tivoli Enterprise Monitoring Services window,
highlight Tivoli Enterprise Portal Server and select Advanced > TEPS/e
Administration > Enable TEPS/e Administration.
v From the command line, change to the scripts directory (Intel
Linux: install_dir/li6263/iw/scripts; zLinux:install_dir/ls3266/iw/
scripts; AIX®:install_dir/aix533/iw/scripts) and enter the following
command, where true starts the console and false stops the console:
What to do next
You can use tacmd pdcollect to collect the necessary data provider logs from the
portal server. For more information about the tacmd commands, see the IBM Tivoli
Monitoring Command Reference (http://pic.dhe.ibm.com/infocenter/tivihelp/v61r1/
topic/com.ibm.itm.doc_6.3/cmdref/itm_cmdref.htm).
Configure Authorization Policy Server trace settings through the JazzSM console if
IBM Support has requested traces.
Procedure
1. Enter the following URL in your browser to start the JazzSM console:
https://hostname:admin_host_port/context_root/logon.jsp
where
hostname is the fully qualified host name of the JazzSM server.
What to do next
You can run one of the following scripts to create a PolicyServerLogs.zip file in a
child output directory that contains the log information from the server dashboards
and Authorization Policy Server:
JazzSM_installdir\AuthPolicyServer\PolicyServer\tools\
pdCollect.bat
JazzSM_installdir/AuthPolicyServer/PolicyServer/tools/
pdCollect.sh
pdcollect tool
Use the pdcollect tool to gather log files, IBM Installation Manager logs,
configuration information, version information, properties and other information to
help diagnose problems with authorization policies.
The pdcollect tool is used at the command line by running the pdcollect script.
The pdcollect tool is installed with the Tivoli Authorization Policy Server or the
tivcmd Command-Line Interface for Authorization Policy (See “Installing and
configuring the Tivoli Authorization Policy Server and tivcmd Command-Line
Interface for Authorization Policy” in the IBM Tivoli Monitoring Installation and
Setup Guide.)
The following example shows the beginning of the audit record that is
generated by the Policy Distribution Client when the dashboard data
provider cannot communicate with the Authorization Policy Server. The
Review the scenarios to learn more about using tivcmd commands to investigate
problems with server dashboards.
Managed system not shown
When determining why a managed system is not shown in the server
dashboards for a particular user, you must have the following information:
the user ID, the managed system name, and the managed system groups
that the managed system is a member of.
You can run the following command to display the list of roles that the
user is a member of, either explicitly or through user group role
membership:
tivcmd listroles -u userid
If you suspect that the user is not a member of a role that has permission
to view the managed system, you can enter this command to display the
complete membership for the role:
tivcmd listroles -n role_name -m
Either the user can be added to the role, or the user can be added to a
group that is already a member of the role.
If the user is already a member of the expected viewing role, ensure that
the correct permissions are applied to that role using this command:
tivcmd listroles -n role_name -p
If the permissions are correct for the role, check to see if the user is a
member of a group or role that has been explicitly denied the permission
to view the specified managed system. Note that access denials,
implemented with tivcmd exclude, always take priority even if a tivcmd
The command returns all permissions that specify the given managed
system for each associated role. If the user is a member of one of those
roles and an excluded “view” operation is present, the authorization is
either correct or the user was incorrectly added to the role explicitly or
through a user group. In that case, one of the following resolutions might
solve the problem:
v Remove the user with the tivcmd removefromrole command if the
membership is explicit, or through the User/Group management system
if the membership is implicit.
v Use the tivcmd revoke command to remove the denial rule for the
resource if you have determined that the rule is no longer needed.
If the problem is that no role currently exists with permission to the given
managed system or to the managed system group that it is a member of,
the solution might be to create a new role, with the required permissions,
and add the user to the role.
Lastly, it might be the case that the role or permissions for viewing the
managed system exist and the user has been correctly added to the role,
but the policy updates were not distributed to the Tivoli Enterprise Portal
Server so that it could make use of the updates. In that case, you might
need to take the following steps:
1. Review and possibly shorten the policy Polling Interval configured
on the portal server if the policy retrievals are not occurring quickly
enough. The portal server has a default interval of 30 minutes for
checking the Authorization Policy Server for changes. (For details on
reconfiguring the portal server authorization policy settings, see
“Enabling authorization policies in the portal server” in the IBM Tivoli
Monitoring Administrator's Guide.)
2. Examine the portal server audit log in the Tivoli Enterprise Portal to
see whether a policy distribution error occurred. For more information,
see “Audit logging” in the IBM Tivoli Monitoring Administrator's Guide.
For more details on these types of errors, see “Policy distribution issues”
on page 197.
Managed system group not shown
To determine why a managed system group is missing from a server
dashboard, follow the same basic troubleshooting steps as you would for a
missing managed system. First, run the following command to list the roles
a user is a member of, either explicitly or through role membership.
tivcmd listroles -u user_ID
The command returns all permissions that specify the given managed
system group for each associated role. The user should be a member of one
of those roles, either explicitly or implicitly through a group. If that is not
the case, you can either add the user to a role that has access to the
Lastly, it might be the case that no role exists with permission to the given
managed system group, in which case the solution might be to create a
new role, grant the required permission, and add the user to the role.
Cannot see events for a managed system
For a user to see events for a managed system, they must be a member of
a role that has access on the “event” object type.
For a user to see event details for a managed system, they must be a
member of a role that has access on both the “attributegroup” object type
and “event” object type.
You can check the permissions of the role that the user belongs to by
running the following command:
tivcmd listroles -u user_ID -p
For a description of all the tivcmd commands, see the IBM Tivoli Monitoring
Command Reference (http://pic.dhe.ibm.com/infocenter/tivihelp/v61r1/topic/
com.ibm.itm.doc_6.3/cmdref/itm_cmdref.htm).
You can see the User ID that is configured for the authorization policy
download by using the portal server reconfigure function:
Click Start > Programs > IBM Tivoli Monitoring > Manage
Tivoli Enterprise Monitoring Services. Right-click the Tivoli Enterprise
Portal Server, select Reconfigure, and click OK to accept the current
configuration and open the Authorization Policy Server Configuration
dialog box.
Change to the install_dir/bin directory and run ./itmcmd
manage [-h install_dir]. Right-click the Tivoli Enterprise Portal Server
and select Configure.
You can also see the user ID in the KDQ_GS_POLICY_SERVER_USERID
environment variable in the portal server environment file:
Click Start > Programs > IBM Tivoli Monitoring > Manage
Tivoli Enterprise Monitoring Services. Right-click the Tivoli Enterprise
Portal Server and select Advanced > Edit ENV File to open the kfwenv
file.
Change to the install_dir/config directory and open the
cq.ini file in a text editor.
To check if the user is a member of the PolicyDistributor role, use the
following tivcmd Command-Line Interface command. A list of members
assigned to this role is displayed:
tivcmd listroles -n PolicyDistributor -m
A good first diagnostic step is to look at the portal server audit log, which
contains messages about Authorization Policy Server and policy
distribution problems.
[1/3/13 13:04:26:457 EST] 000000d4 PolicyServer I
com.ibm.tivoli.monitoring.provider.security.PolicyServer getEntitlements()
Calling getUserResources with allowedOrDenied<true> domain
<itm.HUB_amsntx28>user<cn=svtldap1,ou=users,ou=SWG,o=IBM,c=US>
user groups<[cn=svtgrp1,cn=svtldap,ou=SWG,o=IBM,c=US]> op<view>
objType<attributegroup> obj<any>
[1/3/13 13:04:26:458 EST] 000000d4 client E
com.ibm.tivoli.rest.authz.client.PolicyClientRefreshWrapperImpl
performInitialization Policy store parent root directory has not been set;
the client cannot initialize.
[1/3/13 13:04:26:465 EST] 000000d4 client E
com.ibm.tivoli.rest.authz.client.PolicyClientRefreshWrapperImpl
getUserResources KDQPC0017E The policy client is not initialized.
The policy client request cannot be processed. The failed policy store
path is [ null ] and the failed policy role path is [ null ] .
[1/3/13 13:04:26:466 EST] 000000d4 PolicyServer E
com.ibm.tivoli.monitoring.provider.security.PolicyServer
getEntitlements() ERROR:
com.ibm.tivoli.rest.authz.PolicyAuthorizationException
[1/3/13 13:04:26:479 EST] 000000d4 MsysCollectio E
com.ibm.tivoli.monitoring.provider.msys.MsysCollection
getSecurityFilter() getEntitlements using allowed returned empty list
[1/3/13 13:04:26:480 EST] 000000d4 rest E
com.ibm.tivoli.rest.RestProvidersURI getDatasourceDatasetColumns
unexpected exception:
com.ibm.tivoli.monitoring.provider.navmodel.ITMRuntimeException:
KFWITM621E The requested resource is currently unavailable,
does not exist or authorization is denied at
com.ibm.tivoli.monitoring.provider.msys.MsysCollection.
getSecurityFilter(MsysCollection.java:298)
The output indicates that no user has been added to the PolicyDistributor
role, which explains why the dashboard data provider could not download
policies and why the KFWITM621E message occurred. Note that every
time you install a new Authorization Policy Server, you must issue the
following command once:
tivcmd addtorole --rolename PolicyDistributor -u "user_configured_for_DP"
Latest policy updates not appearing at the portal server
In this scenario, assume that the user configured for the Authorization
Policy Server connection was added to the PolicyDistributor role, and
that policy distribution to the portal server has already occurred
successfully one or more times. You notice that the latest policy updates
are not available at the portal server. The symptom might be, for example,
that a role was recently granted access to a managed system, but the server
dashboard continues to prevent a user belonging to that role from viewing
the managed system.
When you investigate a problem of this type, it is important to remember
that policy updates implemented at the Authorization Policy Server do not
appear immediately at the portal server. Two polling intervals influence
how quickly the updates are distributed:
v Updates are periodically batched together at the Authorization Policy
Server to prepare for subsequent distribution. This process is controlled
by the Authorization Policy Server's DIST_POLL_INTERVAL resource
environment property. (For more information, see “Changing the
Authorization Policy Server configuration properties after installation
and configuration” in the IBM Tivoli Monitoring Administrator's Guide.)
v The portal server makes periodic requests to the Authorization Policy
Server for the latest updates, as defined by the Polling Interval
parameter in the portal server configuration settings. (For more
information, see “Enabling authorization policies in the portal server” in
the IBM Tivoli Monitoring Administrator's Guide.)
It could take as long as the sum of these two polling intervals for a change
made at the Authorization Policy Server to appear at the portal server.
Another potential reason for missing updates at the portal server is that a
recent change took effect at the Authorization Policy Server which has
impacted policy distribution. Here are some possible changes:
v The password for the Authorization Policy Server user specified in the
portal server configuration has expired or was modified and needs to be
updated at the portal server.
Adjust the trace settings only as directed by your administrator or IBM Support to
help diagnose the cause of problems with the server dashboards.
You can set a trace from a server dashboard in the Dashboard Application Services
Hub. While displaying the dashboard that you want to change the trace level for,
click Actions > Trace level and select one of the following levels:
v Verbose to have all activity logged. Verbose trace level includes Moderate, Light,
and Minimal trace logging.
v Moderate to have variable changes logged, such as what parameters were
passed in and what calculations were made. Moderate trace level includes Light
and Minimal trace logging.
v Light to log error and variable activity. You might want to set the trace to this
level if you have a problem such as no data being returned but the dashboard
continues to function. Light trace level includes Minimal trace logging.
v Minimal is the default setting and records only unrecoverable errors. You can
set the trace level back to minimal after collecting a specific activity sequence.
Even if a different trace level was set before logout, the trace is always reset to
the lowest level the next time you log in.
The trace is adjusted to the level chosen for this and all subsequent dashboards
selected. To keep communications traffic to a minimum, the log messages are
transferred in batches to the Dashboard Application Services Hub. A final transfer
is made after you log out, whether manually or after a timeout period. (If the
browser fails, no final logging is sent.)
The logs are saved on the Dashboard Application Services Hub computer and
named userid.log.0 where userid is the ID used to log in to the Dashboard
Application Services Hub and “0” is the first log. Three log files of 750 KB total are
used to record trace data in a cyclical manner: After the userid.log.0 reaches 250
KB, log entries are saved to userid.log.1; after userid.log.1 reaches 250 KB, log
entries go to userid.log.2 until it reaches the maximum, at which time
userid.log.0 is cleared and new entries are saved there.
A user with neither event access nor attribute group access for any
managed system groups, sees an empty carousel on the Managed
System Groups Overview dashboard. You can use the tivcmd
listroles command to view what roles the user is assigned to and
what permissions are assigned to that role:
tivcmd listroles -u uid=annette,cn=itm,o=tivoli -p
Sample output:
NTEventOnly
Permissions:
Domain: any
Resource Type: managedsystemgroup
Resource: *NT_SYSTEM
Object Type: event
Granted Operations: {view}
If you determine that the role should have view operation for the
event object type, use the tivcmd grant command to add this
capability:
tivcmd grant -n NTDataOnly -t managedsystemgroup -r
*NT_SYSTEM --objecttype event --operations view
Enter tivcmd listroles command to view the newly added
capability:
tivcmd listroles -u uid=jim,cn=itm,o=tivoli -p
Sample output:
NTEventOnly
Permissions:
Domain: any
Resource Type: managedsystemgroup
Resource: *NT_SYSTEM
Object Type: attributegroup
Granted Operations: {view}
Permissions controlled by the Tivoli Enterprise Portal Server
Edit your user ID to add Events - View permission and, if
applicable, the managed system type included in Allowed
Applications. For details, see “Using Tivoli Enterprise Portal user
authorization” in the IBM Tivoli Monitoring Administrator's Guide.
Compare the resource names to the names displayed by running the tacmd
listSystems and tacmd listSystemList Command-Line Interface
commands.
If you do not find the resolution to a problem you experience with a monitoring
agent, refer to the agent-specific user guide.
Select 1 to upgrade the libraries. You can quit the installation on the next
menu.
v Install the Tivoli Enterprise Services User Interface Extensions
component. You must force the operating system version during
installation. In this example, you must install Tivoli Enterprise Services
User Interface Extensions for the following systems: Linux Intel R2.6 (32
bit) and Linux x86_64 R2.6 (32 bit) After completion, all the Tivoli
Monitoring Shared Libraries should be at the 6.3 level.
# ./cinfo -t ax
***********************************************************
User: root Groups: root bin daemon sys adm disk wheel
Host name : hostname Installer Lvl:06.30.00.00
CandleHome: /opt/IBM/ITM
Version Format: VV.RM.FF.II (V: Version; R: Release; M: Modification;
F: Fix; I: Interim Fix)
***********************************************************
...Product inventory
Command-line interface
Review the command-line interface troubleshooting descriptions for help with
tacmd usage.
For certain action commands, the tacmd ExecuteAction command fails with the
following error message: KUICXA029E: The execution of the take action
ActionName failed in all the managed systems.This failure occurs when the
following ExecuteAction options are specified:
v -e | --stderr
v -o | --stdout
v -r | --returncode
v -l | --layout
v -p | --path
To understand if an action has a prefix, run the tacmd viewaction command and
view the action's Command details. For example, the UNIX AMS Stop Agent take
action command has a UX: prefix.
Historical data
If historical data is not being collected in the short-term history file or being
warehoused properly or you have other issues with data collection, review the
possible cause and solution that correspond to your symptoms
The raw data delivered by an monitoring agent is a string or number. The Tivoli
Enterprise Portal has format information to control the display. When an attribute
is used in a system command in the Situation editor's Action tab, the raw data
from the agent is substituted. For example, if a situation had an action command
to send an email, it could look like this in the Situation editor:
my_command Warning too many processes &{System.Load_Average_1_Min} options
If the average was actually 6.99 as displayed in the portal client, the command
would be executed as:
my_command Warning too many processes 699 options
IBM Tivoli Monitoring V6.2.2 FP6 IZ98638 does not support reflex
actions for OMEGAMON XE agents on z/OS
You might encounter a problem using reflex actions for your OMEGAMON XE
agents on z/OS. Note that IBM Tivoli Monitoring V6.2.2 FP6 IZ98638 does not
support reflex actions for OMEGAMON XE agents on z/OS.
Workspaces
Review the monitoring agent workspaces entries for a description of problems in
the Tivoli Enterprise Portal with accessing and displaying workspaces.
You might notice that the Agent Management Services (AMS) workspace remains
unavailable, even if the OS agent is started. Changes to the behavior of the login
process make the Proxy Agent Service (PAS) initialization asynchronous. The AMS
workspace remains unavailable until PAS initialization completes. This is an
expected behavior.
Links point to specific object names and are not automatically updated to return
the latest version of a workspace. A link from an external agent pointing into the
OS agent must be updated to point to the current release. If the current version of
the agent is not truly "versioned" using the VRF appendage to the object name, but
is a different object name to facilitate 64-bit data workspace exposure, the external
anchor should have two links associated with it: one pointing to the existing
version of the workspace and a second pointing to the new 64-bit enabled
workspace.
A default query should display data when it is assigned to the view on the
workspace. However, if this is a view that has links, a link must be selected in
order to see the data in the workspace.
To solve this problem, replace the view and save the workspace.
OS agents
Review the OS agent entries for a description of configuration and usage problems
and how to resolve them.
The host using the IP.PIPE network protocol is limited to 16 Tivoli Monitoring
processes.
You can identify the issue by noting the agent server process exiting unexpectedly
with the following key messages:
(4E85BA02.005E-1:kdcsuse.c,119,"KDCS_UseFamily") status=1c010005,
"cant bind socket", ncs/KDC1_STC_CANT_BIND_SOCK
(4E85BA02.005F-1:krabrreg.cpp,1289,"CTRA_reg_base__Load") Use family failed,
family=34, st=1c010005
(4E85BA02.0060-9:kde12li.c,189,"KDE1_ServerListen") Status 1DE0002C=
KDE1_STC_SERVERNOTBOUND
(4E85BA02.0061-9:kdcs1li.c,126,"Listen") KDE1_ServerListen(0x1DE0002C)
(4E85BA02.0062-1:kraaumsg.cpp,143,"CTRA_msg_server_exit_status")
CTRA Server: exit status, 0
These messages indicate that the server cannot bind a socket and that the server is
exiting. This behavior is a known limitation.
The Windows Registry Editor lists the event log name as a key in either of two
paths:
HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Services\Eventlog
If you add an incorrect JDBC driver JAR file when configuring the Summarization
and Pruning Agent, an error message is displayed after you click Test Connection.
The error message continues to display, even after you have replaced the file with
the correct one. This behavior is a known limitation. Close the configuration panel
and run the configuration again.
The Oracle JDBC driver version V10.2.0.3.0 causes an application to hang when it
tries to acquire a connection to the database. The most common symptoms
affecting the Summarization and Pruning agent or Warehouse Proxy Agent are:
v lack of Tivoli Enterprise Portal responsiveness
v agent upload failures for Warehouse Proxy Agent
v increased memory usage over time
v rejecting a shutdown command
To resolve this issue, upgrade the JDBC driver to a newer level (at least V10.2.0.5.0
or later).
This message indicates that the configuration attempt failed. For example, after
you specify "Run As User" for an agent, this value cannot be cleared back to not
having "Run As User" because the code that triggers updates is based on a value
being set for a property. In this case, you are trying to unset a property by not
passing a value. To avoid this issue, supply the same user name for the WPA agent
as you have for the OS agent (typically, root or Administrator).
Self-describing agent
When your Tivoli Enterprise Monitoring Agent at V6.2.3 or later is configured to
be self-describing, it has all the required application support files to update the
Tivoli Management Services servers. You do not have to perform manual support
installation steps and recycle each individual server component that will support
the agent.
Procedure
1. Verify that all appropriate self-describing environment variables are enabled.
2. Verify that the remote monitoring server is started and connected to the hub
monitoring server.
3. The agent must also be enabled for self-description by packaging application
support as part of the agent installation. The agent must use the IBM Tivoli
Monitoring V6.2.3 or later agent framework or be installed on a system where
the V6.2.3 or later agent framework is already installed. You can tell if an agent
is enabled for self-description before installation or after installation. See
“Determining if agents are enabled for self-description” in the IBM Tivoli
Monitoring Administrator's Guide.
4. Run the following tacmd commands and review the results:
tacmd listappinstallrecs
The STATE column displays the state of the self-describing installation
for each product package on each monitoring server. A STATE value of
ME indicates a terminal error and that the installation is not
automatically tried again.
tacmd listSdaInstallOptions
Reports what versions of each product are allowed to perform
self-describing installations and what the default setting is. Check the
allowed versions.
tacmd listSdaStatus
Reports the STATE and STATUS of self-describing enablement at each
online monitoring server and reports whether self-describing is
suspended.
tacmd listSdaOptions
Check the self-describing product seed definitions.
For detailed steps, see “Self-describing agent installation” in the IBM Tivoli
Monitoring Administrator's Guide. For a detailed description of the tacmd
commands and options, see the IBM Tivoli Monitoring Command Reference
(http://pic.dhe.ibm.com/infocenter/tivihelp/v61r1/topic/
com.ibm.itm.doc_6.3/cmdref/itm_cmdref.htm).
5. Review the appropriate monitoring component audit log:
Tivoli Enterprise Monitoring Server
Look at the monitoring server audit log to determine whether
self-description is enabled at the monitoring server that the agent
connects to. The monitoring server audit log can inform you if
self-description is disabled because of the KMS_SDA setting at the local
monitoring server or hub monitoring server. There might also be other
configuration errors at the local monitoring server or hub monitoring
server that are causing a problem.
Tivoli Enterprise Portal Server
Look at the portal server audit log to verify that the agent's
self-describing files were processed successfully at the portal server.
This message indicates that a validation error occurred and that the agent is
not able to provide SDA support. See the Audit log and RAS1 for other
similar messages that indicate the specific SDA manifest file validation error.
After correcting the error, recycle the agent to participate in SDA processing.
KRAA0015 Self-Describing Agent function disabled for PRODUCT variable due to
local SDA file validation error.
This message indicates that agent SDA status is disabled due to manifest file
validation error. This condition can occur at agent startup time, or while the
agent is running and asked to provide SDA support. After the error
condition is corrected, the agent must be recycled to enable its SDA function.
KRAA0016 Ignoring TEMA_SDA Configuration! Agent SDA package not found for
PRODUCT variable.
This message indicates that the TEMA_SDA=Y setting is ignored because the
agent SDA package was not found. Set the variable only when an agent is
packaged with SDA support files.
KRAA0017 Self-Describing Agent function disabled, expected TEMA_SDA
configuration not found for PRODUCT variable.
Procedure
v If the situation event console is not available in the Tivoli Enterprise Portal
because you chose not to initially install any application support on the Tivoli
Enterprise Portal Server, and then application support was added through
self-describing agents, reconfigure the portal server.
What to do next
See also “Missing agent managed system group in a hot standby environment” on
page 88.
The following files are common files and may appear in multiple directories:
v khdexp.cfg
v KRAMESG
v KRAMESG.hdr
v OPLOG
v OPLOG.hdr
Only the agent that records into the specific directory will record history into the
file. The other agents record history information into their respective directories.
Unreadable tool tip information for Available EIF Receivers list of the
Situation editor
When you select the EIF tab of the Situation editor, you might notice an
unreadable tool tip in the Available EIF Receivers list. The problem is caused by
bad data in the TEXT column of the EVNTSERVER table. The problem has been
corrected. However, you might still experience this problem if the default event
destination (id=0) on the zOS TEMS has not been updated since the problem was
fixed.
To correct the problem, delete the default event destination entry (for example,
id=0) and recycle the hub TEMS. This will cause the Event Forwarder to rebuild
the default event destination entry in the EVNTSERVER table and the tool tip will
display correctly.
This behavior is true for ODBC, JDBC, and for DB2, using an OS user. This
behavior is not true for MSSQL or Oracle, not using an OS user, but rather a DB
user.
If you find that the Tivoli Data Warehouse database does not contain a
WAREHOUSEPRUNE table, and if in the portal server database the
KFWWHISTDATA table has been renamed KFWWHISTDATABAK, rename the
KFWHISTDATABAK table to KFWWHISTDA in the portal server, and then
reconfigure the warehouse. The WAREHOUSEPRUNE table will then be in the
Tivoli Data Warehouse database, and the issue will be resolved.
To ensure that these monitoring agents are started, manually start them from
Manage Tivoli Monitoring Services.
Examples of monitoring agents from before v6.2.2 Fix Pack 1 for which the issue
can appear:
v DB2 Agent
v Oracle Agent
v Microsoft SQL Server Agent
v Sybase Server Agent
v Microsoft Exchange Server Agent
v Lotus Domino Agent
v VMWare VI Agent
v Microsoft BizTalk Server Agent
v Microsoft Cluster Server Agent
v Microsoft Exchange Server Agent
v Microsoft SQL Server Agent
v mySAP Agent
v Siebel Agent
If you have a problem with the value in the xml file, you need to check the agent
log file. For example, if you misspell the value in the pc_eventmap.map file as
follows:
<itmEventMapping:agent
xmlns:itmEventMapping="http://www.ibm.com/tivoli/itm/agentEventMapping"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.ibm.com/tivoli/itm/agentEventMapping
agentEventMap.xsd">
<idUD/id>
<version>6.2.0</version>
<event_mapping>
<situation name="UDB_Buff_Used_Pct_Warn_pr">
<class name="ITM_KUDINFO00"/>
<slot slotName1="msg">
<mappedAttribute name="Incorrect slotName"/>
</slot>
</event_mapping>
SNMP alerts sent from the monitoring agent have some limitations when working
with subnodes. Currently, the monitoring agent does not support situation
attribute atomization. This means that for a situation that returns multiple rows
from a table, an SNMP alert is only sent for the first row returned. For example, a
situation called Low_Disk_Space will trigger if available_disk_space *LE 20% might
be true for more than 1 disk at a time. An enterprise situation would define the
disk name as the display item so a separate situation alert would be displayed for
each disk with less than 20% free space available.
Monitoring Agents for Windows OS that are v6.2.1 and higher can be installed
onto a monitoring server with no adverse side effects. IBM Tivoli Monitoring
Application Agents regardless of version can also be installed onto a monitoring
server with no adverse side effects.
Name: PrimeShift
Type: CRON
Data: * 8-16 * * 1-5
To correct the problem, use the tacmd editcalendarentry command to correct the
calendar overlaps so that the calendar entries look as shown above.
This happens because the SNMPPORT is not available to use. The default
SNMPPORT is 161. You should try to specify a different SNMPPORT to use when
deploying the agent. Here is an example command: tacmd createNode -h
smb://target_endpoint_hostname -p snmpport=4567 server=RTEM_hostname -u
user_id -w password -t ssm
226 IBM Tivoli Monitoring: Troubleshooting Guide
The Agent Service Interface is not globalized
This window is only displayed in English. There is not a workaround for this
issue.
In some cases, these names are not friendly and hovering over the item on the
navigation tree shows the attribute group to which it belongs. You can alter the
name of these collections by editing the historical collection and modifying only
the name.
Situations with attributes from more than 1 group not supported with
autonomous agent
When using autonomous agent in Manage Connected Mode, creating situations
with AND and OR logic, and using values from two different attribute groups, the
traps do not list the predicates in an expected way.
If you have defined a situation that attempts to combine attributes across more
than 1 attribute group, this is not currently supported by any autonomous agent
processing mode or private situations. The monitoring server performs the
evaluation of situations having combined attribute groups. This can be either from
embedded situations, or two or more attribute groups in the same predicate, for
example, WHERE USER=abc AND LOCALTIME=today.
If you specify this variable and the number of connection attempts to the
monitoring server exceeds CTIRA_MAX_RECONNECT_TRIES, the agent attempts
to shutdown. If the Agent Management Services Watchdog is running, it
immediately restarts the agent. If you want the agent to shutdown when
CTIRA_MAX_RECONNECT_TRIES is exceeded, this Watchdog process must be
disabled. Use the AMS Stop Management action to disable this watchdog process.
Ensure that the system sending SNMP traps to IPv6 destinations can resolve its
own IPv6 address. The system where the trap emitting-agent is running must have
a valid IPv6 address associated with its own local host name. If the DNS
configuration has not been updated for IPv6, then it is necessary to modify the
\etc\hosts file or /etc/resolv.conf file on the agent system in order to associate
the local host name with a valid local IPv6 address, for example, not a loopback or
link-local address.
The BAROC file for the Integration Agent is no longer supplied with IBM Tivoli
Monitoring. Generate the BAROC files as needed by using the BAROC file
generator tool found here:
http://catalog.lotus.com/topal?NavCode=1TW10TM43
The monitoring agent can run using a non-root user ID on UNIX and Linux
systems. This can be done by running the itmcmd agent start command while
logged in as a non-root user, and this can be done remotely by deploying the agent
using the Run As option on the GUI or using the _UNIX_STARTUP_.Username
option on the tacmd addSystem command line. If the agent is running using a
non-root user ID, and then the agent is upgraded, restarted remotely, restarted as a
result of a system reboot, or the itmcmd agent start is run using the root user ID,
then the monitoring agent subsequently runs as the root user. To confirm the user
ID that the monitoring agent is using, run the following command:
itm_install/bin/cinfo -r
If the installation is not permissioned properly, then you might be unable to restart
the agent as a non-root user ID after it has been run as the root user ID. To prevent
this problem, ensure that the secureMain lock command with the -g option has
been previously run. See the "Securing your IBM Tivoli Monitoring installation on
Linux or UNIX" appendix in the IBM Tivoli Monitoring Installation and Setup
Guide for further details.
If the agent is running as root, and that is not the desired user ID, then use the
following steps to restart the agent:
1. Log in as root.
2. Run the itmcmd agent stop command.
3. Log in (or 'su') to the user ID that you want the agent to run as.
4. Run the itmcmd agent start command.
If the agent was running as root because of a system reboot, then complete the
following steps so that the appropriate user ID is used the next time the system is
To specify different user IDs to start different instances of an agent, use the
following syntax:
<productCode>product_code</productCode>
<instance>
<name>instance_name1</name>
<user>user_name</user>
</instance>
<instance>
<name>instance_name2</name>
<user>user_name</user>
</instance>
Where:
product_code
2-character product code for the agent, for example, ux for the Unix OS
monitoring agent.
user_name
Name of the user to run the agent as.
instance_name1
Name of an instance.
instance_name2
Name of another instance.
Examples:
For the Unix OS monitoring agent, to run as itmuser:
<productCode>ux</productCode>
<default>
<user>itmuser</user>
</default>
For the DB2 monitoring agent instances to run as the instance owner IDs:
<productCode>ud</productCode>
<instance>
<name>db2inst1</name>
<user>db2inst1</user>
</instance>
<instance>
<name>db2inst2</name>
<user>db2inst2</user>
</instance>
For the Websphere MQ monitoring agent instances to all run as the mqm user
ID, and for the default instance to not be started:
<productCode>mq</productCode>
<default>
<user>mqm</user>
The expected result would be that, for the deleted event, the situation status is set
to 'D".
Events do not reflect the long names if the definition is deleted, and the Tivoli
Enterprise Monitoring Server logs also do not reflect the long name.
Logs are using the situation ID string instead of the display name
Situations have both a display name (up to 256 bytes in length, UTF8 enabled) and
an ID string (up to 32 bytes in length ). For example, a situation display name can
be "don't let the pigeon drive the bus !" or 100 characters of Japanese text. The
associated ID string will be of the form 000000000000008D723DE7DFED450F. If the
Tivoli Enterprise Portal Server Universal Message Console and the Tivoli
Enterprise Monitoring Server RKLVLOG cannot display the situation display name,
the ID string is displayed instead.
To ensure that your situations are displayed using the display name instead of the
display ID, make sure that your situation names do not exceed 31 characters in
length, and that they do not contain any special characters.
Unable to receive summarized data for the last hour in the Tivoli
Enterprise Portal
The Summarization and Pruning Agent does not run continuously. It is scheduled
to run at some frequency or on a fixed schedule. Data for the last hour likely will
not be available until this agent has just finished running and has had enough data
to compute the last hour summary, which a data sample exists for the following
hour.
The Summarization and Pruning Agent does not summarize hour X until at least
one sample for hour X+1 is available in the Tivoli Data Warehouse at the time the
summarization is started for that agent.
If a database insert fails while running the Summarization and Pruning Agent, and
the message indicates the insert has failed because the maximum index size has
been exceeded, complete the following steps to correct this issue:
1. Stop the Summarization and Pruning Agent.
2. Drop the index causing the failure.
3. Run the Schema Publication Tool in configured mode:
KSY_PRODUCT_SELECT = configured
4. Open the tdw_schema_index.sql file and find the index that was deleted in step
1.
5. Edit out all statements except the index you want to recreate.
6. Run the create index statement.
7. Start the Summarization and Pruning Agent.
The agent deployment operations can complete if you increase the TIMEOUT
value. Some operations can complete even after a timeout is returned.
Table 17. Resolutions for agent deployment operations that TIMEOUT
Problem Resolution
KDY0014E Increase the Tivoli Enterprise Monitoring Server timeout value to 1200
message seconds (TIMEOUT=1200). The default is 600 seconds.
On Windows: installation_dir\CMS\KBBENV.
You can also change the environment variable in the kui.env file on
Windows systems and the tacmd shell script on non-Windows systems.
Both of these files can be found in the CANDLEHOME/logs directory.
A failure occurs The Tivoli Enterprise Portal Server times out waiting for deployment
when deploying action to complete. The default timeout is 600 seconds. You can change
an agent from the the timeout setting to KFW_SQL1_ASYNC_NOTIFY_MAX_WAIT in
Tivoli Enterprise kfwenv:
Portal. KFW_SQL1_ASYNC_NOTIFY_MAX_WAIT=1000
KUICCN068E Increase the timeout value in seconds by adding "-o TIMEOUT=3600"
error when to the createnode command.
running tacmd
createnode.
If you have multiple instances of a monitoring agent, you must decide how to
name the monitoring agents. This name is intended to uniquely identify that
monitoring agent. The agent's default name is composed of three qualifiers:
v Optional instance name
v Machine network host name
v Agent product node type
An agent name truncation problem can occur when the network domain name is
included in the network host name portion of the agent name. For example,
instead of just the host name myhost1 being used, the resulting host name might
be myhost1.acme.north.prod.com. Inclusion of the network domain name causes
the agent name in the example above to expand to
SERVER1:myhost1.acme.north.prod.com:KXX. This resulting name is 39 characters
long. It is truncated to 32 characters resulting in the name
SERVER1:myhost1.acme.north.prod.
The agent name truncation is only a problem if there is more than one monitoring
agent on the same system. In this case, the agent name truncation can result in
collisions between agent products attempting to register using the same truncated
name value. When truncated agent names collide on the same system, this can lead
to Tivoli Enterprise Monitoring Server problems with corrupted EIB tables. The
agent name collision in the Tivoli Enterprise Monitoring Server might cause a
registered name to be associated with the wrong product.
In general, create names that are short but meaningful within your environment.
Use the following guidelines:
v Each name must be unique. One name cannot match another monitoring agent
name exactly.
v Each name must begin with an alpha character.
v Do not use blanks or special characters, including $, #, and @.
v Each name must be between 2 and 32 characters in length.
v Monitoring agent naming is case-sensitive on all operating systems.
If you do not find the files mentioned in the preceding steps, perform the
following workarounds:
1. Change CTIRA_HOSTNAME environment variable in the configuration file of the
monitoring agent.
v Find the KAGENT_CODEKENV file in the same path mentioned in the preceding
row.
v For z/OS agents, find the RKANPAR library.
v For i5/OS agents, find the QAUTOTMP/KMSPARM library in member KBBENV.
2. If you cannot find the CTIRA_HOSTNAME environment variable, you must add it to
the configuration file of the monitoring agent:
v On Windows: Use the Advanced > Edit Variables option.
v On UNIX and Linux: Add the variable to the config/product_code.ini and
to config/product_code.config files.
v On z/OS: Add the variable to the RKANPAR library, member
Kproduct_codeENV.
v On i5/OS: Add the variable to the QAUTOTMP/KMSPARM library in member
KBBENV.
3. Some monitoring agents (for example, the monitoring agent for MQ Series) do
not reference the CTIRA_HOSTNAME environment variable to generate component
names. Check the documentation for the monitoring agent that you are using
for information on name generation. If necessary, contact IBM Software
Support.
Other solutions besides ensuring that each managed system name is unique are the
following:
v Detect and stop the agent process that is running improperly. This can be done
by checking the Tivoli Enterprise Portal Server Managed System Status network
address of the managed system name that seems to toggle ON and OFF line. Go
to the system indicated in the network address and check for multiple running
monitoring agents.
v If the agents running on the same system are the same product, stop or kill the
unintended agent process.
This problem has been seen in monitoring agents that exist on the same system.
Set the CTIRA_HOSTNAME environment variable configuration file for the
monitoring agent as follows:
1. Open the configuration file for the monitoring agent, which is located in the
following path:
v On Windows: install_dir\tmaitm6\Kproduct_codeCMA.INI. For example, the
product code for the Monitoring Agent for Windows OS is NT file name for
is KNTCMA.INI.
v On UNIX and Linux: install_dir/config/product_code.ini. For example,
the file name for the Monitoring Agent for UNIX OS is ux.ini.
2. Find the line the begins with CTIRA_HOSTNAME=.
3. Type a new name for host name that is a unique, shorter name for the host
computer. The final concatenated name including the subsystem name, new
host name, and AGENT_CODE, cannot be longer than 32 characters.
Note: You must ensure that the resulting name is unique with respect to any
existing monitoring component that was previously registered with the Tivoli
Enterprise Monitoring Server.
4. Save the file.
5. Restart the agent.
That file is created or modified every time an instance is started or stopped. All
instances must use the same user ID.
The Tivoli Data Warehouse fails and you either lose data or have
memory problems
On distributed systems, the data is written to the local file system. When the
Warehouse Proxy Agent receives the data, it copies the data to the Tivoli Data
Warehouse. If the Tivoli Data Warehouse is not available, the collected data could
take up a significant amount of memory.
If maintenance is set-up so that the data is rolled-over, the data that would have
been copied to the Tivoli Data Warehouse is copied over after a set period of time.
If maintenance is not performed, then the agent stops writing to the Tivoli Data
Warehouse until initialization is performed. Because the agent has stopped writing,
the data is there "forever" until you re-initialize and start again writing from the
top of the first dataset.
The minimum of 3 datasets allows for continuous collection, the normal case is
that one dataset is empty, one or more are full, and one is active. When an active
dataset becomes full, the empty dataset is activated for continued writing. When
the persistent datastore detects that there are no empty datasets left, it finds the
one with the oldest data and maintains it. If the BACKUP or EXPORT options are
not specified, maintenance is done within the persistent datastore to initialize the
dataset so that its status changes from full to empty. If BACKUP or EXPORT are
specified, a job runs to save the data, then the dataset is initialized and marked as
empty. If the BACKUP or EXPORT was specified, but the maintenance jobs fail to
do their job, the recording would stop in the persistent datastore. In this case,
datasets are taken off-line until there are no more available datasets for reading or
writing.
Note: If you allocate persistent datastores that fit more than 24 hours worth. The
agent initializes and writes as much data as it can fit in the persistent datastores,
24 hours or more. The Tivoli Enterprise Portal, for short term history display, only
pulls up 24 hours worth of data. The warehouse can archive ALL the data in the
persistent datastores, regardless if it has been 24 hours or more worth of data.
If you create a Tivoli Enterprise Portal query that is over 24 hours, then the
warehouse fulfills that request regardless if the data is in the online persistent
datastores.
Also, because the agent or Tivoli Enterprise Monitoring Server reads the entire
persistent datastores at initialization time, you should not allocate very large
persistent datastores to potentially store more than 24 hours. That increases the
Tivoli Enterprise Monitoring Server and agent startup time. As mentioned above,
the agent writes to it, but the Tivoli Enterprise Portal only displays 24 hours from
it. The warehouse processing reads all the data in the persistent datastores (24
hours or more), but there is a trade-off in Tivoli Enterprise Monitoring Server and
agent startup time. It is always best to calculate space for 24 hours as best as
possible.
As far as the potential of losing historical data, if the warehouse is down over 24
hours, that is a potential problem, assuming the persistent datastore's backup
processing is functioning and, therefore, the agent does not stop writing to the
persistent datastores.
Check the ethernet adapter settings on both the client and server. There are
problems if the adapter is set to Auto and the switch is set to 100/Full Duplex.
When configuring the Monitoring Agent for Sybase and the Warehouse
Proxy Agent, receive message to use CandleManage
The CandleManage command has been deprecated. The message should reference
the ./itmcmd manage command.
listSit command with the type option fails with a KUIC02001E message
on Japanese Systems
Edit the kuilistsitVld.xml file to replace the following text:
<Type arg1="-t" arg2="--type" Type ="String" ValidationRegExp=
"[-A-Za-z0-9 _/()\&%.]" Required="Y"/>
The solution is to increate the DB2 logging for the warehouse database. See the
IBM Tivoli Monitoring for Databases: DB2 Agent User's Guide for more
information.
Symbol references in Header and Footer expressions set through the view
Properties window will only be substituted if the workspace is reached as the
target of a link. Verify that the custom workspace for which the expression is being
specified is being reached through a link rather than being selected directly from a
Navigator item or the Workspaces menu. Otherwise, the symbols will evaluate as
an empty string.
For best results, also ensure that the expression is assigned to the Header or Footer
target workspace property through the Link Wizard rather than by editing the Title
text field for the Header or Footer region on the Style tab of the Properties editor.
Whatever expression is assigned to the target workspace property through the Link
Wizard will override the Title text entered in the Style tab when the workspace is
reached through the link.
The reason for the prompt for values is to allow the author of the
query/workspace to test whether or not the query returns the expected result set
for the provided values. The values are optional and need not be provided in order
to complete the assignment of the query to the view. The only impact of not
providing values is that the query triggers a syntax error when executed by the
Properties window Preview pane and the view is empty. In order for the query to
work correctly in the workspace, values must be provided for all the referenced
symbols through the execution environment. Typically, this is done by reaching the
Custom query processing differs from 'standard' query processing in that, for
custom queries, a value must be provided for every symbol reference while, for
'standard' queries, the system discards segments of the WHERE clause involving
symbols for which a non-null value cannot be found. In order to support historical
queries against summarized data, various agent groups distribute
product-provided custom queries that reference symbols that are meant to be
provided by the Timespan window when the 'Use summarized data' option is
selected. These queries are used with the product-provided 'Historical Summarized
...' workspaces available from the Workspaces menu of the Operating System
summary workspaces like 'Windows Systems'.
In order for the queries to return data, the Warehouse Proxy and Summarization
and Pruning agents must be configured and have been running long enough to
collect and summarize the data being viewed. The queries reference a number of
symbols. Following are some of the commonly referenced symbols and example
values:
Because of the open-ended nature of the custom queries and their provision by
multiple agent groups, it is possible that other symbols might also be used. One
way to understand what values are required is to save a copy of the query and
edit it to remove clauses involving the symbols. When the modified query is
assigned to a view (assuming any syntax errors are overcome), it should return an
unfiltered view of the table. The values in the table, in conjunction with close
examination of the query text, can be used as a guide to what to provide as filter
values.
Another symbol that has been specifically asked about is $Server_Name$. This is
the name of the server of interest. If the workspace is below the agent level in the
navigator tree, $NODE$ will usually return an appropriate value. If the value is
being provided through a link, it can often be picked from the Server Name
This message is harmless and is associated with verifying the versions of the gskit
component installed by IBM Tivoli Monitoring. The message is displayed at a point
where the UNIX installation normally pauses for a number of seconds, causing a
tendency to think the installation has failed, although this is not the case.
Do not interrupt or cancel the installation process at this point, wait for the
installation prompts to continue after the message is displayed and proceed to
finish the installation as you normally would.
Adding agent help files requires a restart of the Eclipse Help Server
and the Tivoli Enterprise Portal Server
When an agent's online help files are added to the eclipse server, they are not
available until the eclipse help server is restarted. This also requires a restart of the
Tivoli Enterprise Portal Server.
There are two possible fixes for this problem. Either manually set the file
permission to history dir, or add the db2 instance user to the root group. See the
IBM Tivoli Monitoring Installation and Setup Guide for instructions on how to
have monitoring agents work properly with SecureMain in place.
For more information about the krarloff command, see the topic on converting
files using the krarloff program in the IBM Tivoli Monitoring Administrator's Guide
(http://pic.dhe.ibm.com/infocenter/tivihelp/v61r1/topic/com.ibm.itm.doc_6.3/
adminuse/itm_admin.htm).
There has been a change in syntax where attributes that are enclosed in curly
brackets, {}, no longer are required to have quotes. See the following example:
&{grp1.atr1}.&{grp2.atr2}
Take Action commands do not work if unrequired values are left blank
The predefined Take Action commands work if None is provided in the keyword
and the value fields after at least one database (manager) configuration parameter
config-keyword value that you wish to update has been provided.
Restart the server by its correct name whether that is the default name or not.
Prerequisite checks
Review the prerequisite checks troubleshooting scenarios if you have trouble with
completion of the tacmd checkprereq command, tacmd createnode command with
the -o EXECPREREQCHECK=Y option, or the OS Agents Report Prerequisites Scanner
report.
If you are running Windows Server 2012, you must have .Net Framework V3.5 (or
later) installed to use the tacmd checkprereq command or to run the OS Agents
Report Prerequisites Scanner report provided by the OS agents report package (see
“Tivoli Common Reporting” in the IBM Tivoli Monitoring Administrator's Guide).
The error signifies that the command's timeout limit was exceeded. The
tacmd CLI program sets a 10 minute timeout to accommodate
long-running sub-commands, such as tacmd viewDepot, and long-running
scripts invoked by tacmd executecommand.
However, this timeout problem can still occur if the hub monitoring server
does not own the well-known SOAP Server listening ports 1920 and 3661
because another Tivoli Monitoring process on the same system has
acquired those ports first. There is a default two minute TCP timeout in
effect between the hub monitoring server and the Tivoli Monitoring
process that owns 1920 and 3661. The tacmd command failure occurs if
that two-minute timeout limit is exceeded.
Resolution
Guarantee that the hub monitoring server is the owner of the 1920 and
3661 ports:
v Ensure that the hub monitoring server is always the first IBM Tivoli
Monitoring process started on a system.
v If the hub monitoring server is recycled, other Tivoli Monitoring
processes on the same computer must also be recycled, with the hub
monitoring server restarted first so that it can acquire the 1920 and 3661
ports.
Initializing ...
Do you agree to the terms of the license in file LICENSE.TXT on the CD?
1-yes, 2-no, 3- to read the agreement now.
(choosing "2" exits the installation process) (1/2/3): 1
You are not entitled to install from this media. Setup will not proceed.
[root@sdogoff ud_062000000_li6263]#
Many interactive elements have been removed from the agent bundle output of the
tacmd exportBundle -o LOCAL command in order to optimize it for remote
transmission and silent execution using software distribution technologies. In order
to install the exported agent bundle, the silentInstall.sh or silentInstall.bat
script available in the destination directory should be run instead.
The tacmd addBundles command is used to add one or more deployment bundles
to the local agent deployment depot. By default, this command also adds all
deployment bundles that are prerequisites of the deployment bundle being added,
if the prerequisite bundles do not already exist in the depot. The tacmd
addBundles command requires double the size of the bundle disk space free in the
depot (including the agent and all prerequisite bundles). The kdyjava.log file in the
system temp directory provides additional information about the problem.
This occurs when using a tacmd command related to agents like tacmd
getdeploystatus or tacmd addsystem.
Ensure that you are using the right format for the product code. It must be a 2
digit product code, as in 'um' for the Universal Agent, and not 'kum'.
To make this change, add the following setting in the .vmx configuration file for
the virtual system where the tacmd command is being invoked.
monitor_control.virtual_rdtsc = false
If installing from images on an NFS mount, the NFS mounts need world execute
permissions to be accessible by the process doing the distribution.
The warehouse did not contain any historical data from the managed system for
the specified situation attribute during the specified time period. The time period
is established by the start time and end time, and is further constrained by the
calendar entries you specified. In addition, historical data collection must be
configured and enabled for the attribute group, and the warehouse proxy agent
must be configured and running on the same host as the managed system.
Verify that historical data collection is enabled for the appropriate attribute group
and that the warehouse proxy agent is installed and running on the same host(s)
as the specified managed system or managed systems.
The IBM Tivoli Monitoring command line environment expects the system to be
running in the Language_Locale convention, and is currently limited from
understanding other conventions. You can verify this problem by viewing the
kuiras1 log and looking for entries similar to the following:
(4C765377.008E-1:nls2.c,491,"NLS2_GetLocale") Entry
(4C765377.008F-1:nls2.c,494,"NLS2_GetLocale") Input parameters: languageId 0,
codepage 0, options 0.
(4C765377.0090-1:nls2.c,507,"NLS2_GetLocale") Zero language Id and codepage defined.
(4C765377.0091-1:nls2.c,3888,"NLS2_allocateLocale") Entry
(4C765377.0092-1:nls2.c,3907,"NLS2_allocateLocale") Preparing to initialize
Locale structure.
(4C765377.0093-1:nls2.c,3980,"NLS2_initLocaleObject") Entry
(4C765377.0094-1:nls2.c,3983,"NLS2_initLocaleObject") Get the current native locale.
(4C765377.0095-1:nls2.c,3991,"NLS2_initLocaleObject") Locale returned is turkish.
(4C765377.0096-1:nls2.c,4000,"NLS2_initLocaleObject") Getting the locale basename.
(4C765377.0097-1:nls2.c,4022,"NLS2_initLocaleObject") Locale basename is turkish.
(4C765377.0098-1:nls2.c,4024,"NLS2_initLocaleObject") Locate locale basename in
With the default code page setting, some systems might display corrupted
characters for the following tacmd commands:
v histconfiguregroups
v histcreatecollection
v histdeletecollection
v histeditcollection
v histlistattributegroups
v histlistcollections
v histlistproduct
v histstartcollection
v histstopcollection
v histunconfiguregroups
v histviewattributegroup
v histviewcollection
v exportcalendarentries
v importcalendarentries
v createsitassociation
v deletesitassociation
v listSitAssociations
v exportsitassociations
v importSitAssociations
v createsysassignment
v deletesysassignment
v listsysassignments
v exportsysassignments
v importsysassignments
v suggestBaseline
v acceptBaseline
v setOverride
v listOverrides
v deleteOverride
Procedure
This problem has to do with your locale and system configuration, and can be
fixed by performing the following procedure:
1. Open the command prompt.
2. Change the command prompt locale according to the following table:
3.
a. To change the locale in the command prompt, type chcp **** in the
command prompt, where **** is the new value for your code page, and
press enter. For example, if your system locale is Latin 2, type chcp 1250 in
the command prompt.
b. To check the results of this change, type chcp and press enter. The command
prompt will display the following message: Active code page: ****If the
value displayed after Active code page is the same as the value you just
entered, then you have successfully changed the settings. For example, if
your system locale is Latin 2, the command prompt should display the
message: Active code page: 1250
4. Change the font displayed within the command prompt.
a. You can do this by right-clicking the title bar and clicking Properties from
the drop-down menu.
b. Click the Font tab and select Lucida Console from the list of fonts in the
window.
c. Click OK.
d. A window will appear, allowing you to select the windows to apply the
font change to. Select Modify shortcut that started this window and click
OK.
5. You should no longer see corrupted characters in the CLI.
to:
<ActionName arg1="-n" arg2="--name" Type="String"
ValidationRegExp="[A-Za-z0-9 _:.\()/]" Required="Y"/>
By not enabling the enhanced command execution for subnodes, you can avoid
this issue. Instead, you can use the tacmd executecommand command against a
subnode manager agent that controls the subnode.
If the addBundlescommand times out, this message is not returned. Set the
TIMEOUT environment variable to more than 600 before running the addBundles
command. For example:
You can also reset the TIMEOUT after the command times out. Then run the
addBundles command.
You can also change the LAN linkspeed and duplex setting from auto detect to
100Mbps/Full Duplex. Then re-start the addbundle process.
tacmd createNode
If the createNode command fails, it might be because of the syntax or a
connectivity issue. Review the possible causes and resolutions to help you recover.
You are able to run the tacmd addBundles command for the Windows OS agent,
but when you try to deploy the agent with tacmd createNode on the Windows 7
and Windows 8 computers, the command fails.
Cause
The problem is related to being able to connect to the endpoint.
Solution
1. Enable Remote Registry:
a. Click Administrative Tools > Services.
b. Double-click “Remote Registry” (or right-click it and select
Properties).
c. In the General tab, set Startup type to Automatic and click OK.
2. Disable password protected sharing:
a. Click Control Panel > Network and Sharing Center > Advanced
sharing settings.
b. In “Password protected sharing” area, select the Turn off password
protect sharing radio button and click Save changes.
3. If you have a domain user account, ensure that the local and the target
machine are both members of a Windows domain.
4. If you are a member of a local administrators group and you use a
local user account, complete the following steps on the local computer
to enable administrative tasks on the target computer:
a. Click Administrative Tools > Local Security Policy.
b. Expand Security Settings > Local Policies > Security Options
c. To enable the built-in Administrator account for connecting to the
target computer, double-click “Accounts: Administrator account
status” and select the Enabled radio button.
d. If a different Administrator user account is to be used to connect to
the target computer, double-click “User Account Control: Run all
administrators in Admin Approval Mode” and select the Disabled
radio button.
e. Click OK.
The tacmd suggestbaseline command does not support pre-IBM Tivoli Monitoring
v6.2.1 agents.
When using the listSystems command, the last two digits for
the version appear as 'XX'
Extended version information for every agent might not always be available. When
this happens, the last two digits of the version displayed are represented as "XX".
This occurs for subnode agents or when agents are not enabled for Agent Deploy
support.
KUICRA006I: Are you sure you want to restart the UL agent(s) that manage
zpmaix13:KUL?
The monitoring server encountered an error while restarting the managed system.
If the error information returned from the server is not sufficient to help you
resolve the error, contact IBM Software Support.
Enable Deployment trace logging on the agent machine. Contact Customer Service
for details on this procedure. Collect the following log files
On Windows the log kdsmain.msg log is located in the {CANDLEHOME}\CMS
directory and {hostname}_ms_{timestamp}-XX.log files are located in
CANDLEHOME\logs directory.
On Unix-Based systems the logs {hostname}_{timestamp}.log and
{hostname}_ms_{timestamp}-XX.log is located in the {CANDLEHOME}/logs directory.
On the target Managed System Node machine collect the following log files.
On Windows the logs kdyproc_ras1_{timestamp}.log and {hostname}_nt_kntcma_
{timestamp}-XX.log are located in the {CANDLEHOME}\tmaitm6\logs directory.
On Unix systems the logs kdyproc_ras1_{timestamp}.log and {hostname}_
ux_kuxagent_{timestamp}-XX.log is located in the {CANDLEHOME}/logs directory.
On Linux systems the logs kdyproc_ras1_{timestamp}.log and {hostname}_
lz_klzagent_{timestamp}-XX.log is located in the {CANDLEHOME}/logs directory.
Refer to IBM Tivoli Monitoring v 6.2 Problem Determination Guide
for more information.
The user can verify the agent is stopped by running tacmd listSystems:
# /opt/IBM/ITM/bin/tacmd listsystems
Managed System Name Product Code Version Status
zpmaix13:KUL UL 06.20.00.00 N
zpmaix13:08 08 06.20.00.00 Y
amshp16.tivlab.raleigh.ibm.com:K UX 06.20.00.00 Y
TEMS_zpmaix13 EM 06.20.00.00 Y
where:
-t|--type
Specifies the type of managed system to add to the monitoring system. You
can specify a managed system type name or its associated two-character
code. Use viewDepot to display a list of correct managed system types.
-n|--node
Specifies the node to start. A node is identified by the managed operating
system that it contains.
MANAGED-OS
Specifies a correct managed operating systems.
-d|--dir|--directory} NODEDIR
Specified the correct name of the directory that contains the node
components, including the OS agent. This syntax is only correct when the
node is on the local system.
-i|--imagePath
Specified the correct directory that contains agent installation images.
-p|--property|--properties
Specifies one or more NAME=VALUE pairs that identify configuration
properties of the new system and their values. Run the
describeSystemType command to determine correct values for the
properties.
Either update the systems host tables and correctly setup the DNS domain search
so that the link between itmserver and itmserver.raleigh.ibm.com can be made, or
supply the monitoring server fully-qualified host name during the createnode
deployment using the SERVER=itmserver.raleigh.ibm.com property.
tacmd createSit does not send errors if you mistype the name
of an attribute
The tacmd createSit command enables you to create situations without using the
Tivoli Enterprise Portal. However, if you mistype the name of an attribute when
using this command, you do not receive an error. The situation is created, skipping
the attribute that you meant to type in the command. If the created situation had,
for example, 6 attributes to monitor, the new created situation has only 5 if you
made a mistake in typing 1 of the attribute names in the command.
If you are using the IBM Tivoli Monitoring command line tacmd createSit function
for situation creation, you can use the Situation editor in the Tivoli Enterprise
Portal to validate your specified attributes.
tacmd viewUser
Review the symptoms and resolutions for the tacmd viewUser command if you do
not get the results that you expect.
You receive a message that an option is repeating
The -v, -p -a, and -o options for this command are mutually exclusive. If
you enter more than one, you receive a message that the second option
entered is repeating. For example:
C:\IBM\ITM\bin>tacmd viewuser -u sysadmin -w mypassword -a -v
KUIC02022E: The command did not complete because -v option is repeating.
itmcmd commands
When the results of running an itmcmd are not what you expected, review the
symptoms and possible solutions.
You can bypass the prompt and continue with the agent configuration.
tivcmd commands
Review the symptoms and resolutions for the Authorization Policy Server
command line interface tivcmd commands when you do not get the results that
you expect.
Logging with debugging has a significant impact on performance of the agent. Use
logging with debugging turned on only when solving a problem, and switch it off
afterward. The log files for the agent are created in the Windows
install_dir\TMAITM6\logs or Linux or UNIX install_dir/logs directory.
Procedure
1. Stop the Performance Analyzer agent.
v Click Start > Programs > IBM Tivoli Monitoring > Manage Tivoli
Monitoring Services, then right-click on Performance Analyzer and select
Stop.
v Enter the itmcmd agent stop pa command.
2. Open the init.cfg file. Depending on your operating system, the file is in the
following directory:
install_dir\TMAITM6\config
install_dir/li6263/pa/config
install_dir/aix533/pa/config
install_dir/sol283/pa/config
3. Update the LogLevel and LogSize parameters:
LogLevel=Debug
LogSize=10000000
4. Start the Performance Analyzer agent.
v Click Start > Programs > IBM Tivoli Monitoring > Manage Tivoli
Enterprise Monitoring Services, right-click the Performance Analyzer entry,
and click Start.
v Enter the itmcmd agent start pa command.
Logging with debugging is enabled for the agent.
5. Wait until the Waiting for a period of 60000 message is displayed in the
kpacma.log file. The message means that the processing cycle of the agent is
over.
6. If you want to send the log files to support, stop the agent, compress the
kpacma*.log files to itpa_agent_log1.zip and send it.
7. Restore the default values to the LogLevel and LogSize parameters:
LogLevel=Warning
LogSize=1000000
8. Restart the agent.
Procedure
1. Stop the Performance Analyzer agent.
v On Windows platforms, click Start > Programs > IBM Tivoli Monitoring >
Manage Tivoli Monitoring Services, then right-click on Performance
Analyzer and select Stop.
v On other platforms, enter the itmcmd agent stop pa command.
2. Open the logging.properties file, located in the <home_directory>/IBM/
Java142/jre/lib directory.
3. Comment out the handlers= java.util.logging.ConsoleHandler:
#handlers= java.util.logging.ConsoleHandler
line
4. Uncomment the #handlers= java.util.logging.FileHandler,
java.util.logging.ConsoleHandler:
handlers= java.util.logging.FileHandler, java.util.logging.ConsoleHandler
line
5. Edit the values of the java.util.logging.FileHandler.limit and
java.util.logging.FileHandler.count properties in the following way:
java.util.logging.FileHandler.limit = 5000000
java.util.logging.FileHandler.count = 1
6. Add the
com.ibm.tivoli.pa.config.level = FINEST
line
7. Verify that the value of the java.util.logging.FileHandler.pattern property
is set to %h/java%u.log, where %h is the path to your home directory. This
property determines the location where log files are created.
8. Save and close the file.
Results
Detailed logging has been enabled.
What to do next
If you want to send the log files to IBM Software Support, start the TEP desktop
client and perform steps to reproduce the error, and then go to the user home
directory, where java*.log.* files are created. Compress the files to
itpa_agent_log1.zip and send it to IBM Software Support.
This message indicates that DB2 has exhausted all available shared memory
segments. To resolve this problem, you must configure your DB2 server to support
extended shared memory. To enable this support, complete the following steps:
1. From DB2 command prompt, run the following command:
export EXTSHM=ON
db2set DB2ENVLIST=EXTSHM
db2set –all
2. Edit cq.ini file in <itm_dir>/config/ and at the end of the file add this line:
EXTSHM=ON
3. Restart TEPS and DB2.
When tasks are started and when you should see data in the
workspaces
Tasks are run when the Analytic Agent starts and during each time period
specified for the task. Depending on your data collection size and database server
performance, you can expect to see data within 5 - 30 minutes. However, if you
have not previously activated the Summarization and Pruning Agent and you use
the default daily schedule of 2 a.m., you might need to run the agent overnight
before summary tables are created and workspaces populated.
If the agent is active and historical data collection is configured, your configuration
may be wrong. Confirm the answers to following questions:
v Is the connection configuration information for the Tivoli Enterprise Portal
Server and Tivoli Data Warehouse correct?
v Is the Performance Analyzer Warehouse agent running?
v If you installed on a distributed system did you install the correct support files
on each workstation?
If the workspaces are visible but unavailable, the Analytic Agent ran in the past
but conditions have changed: either the agent is no longer running or the
connection configuration for Tivoli Enterprise Monitoring Server has changed.
To add an index to the _PA table, run the appropriate command from the database
command prompt on the computer where the Tivoli Data Warehouse is running.
Create an index for each of the nonlinear attribute groups:
v DB2
CREATE INDEX DB2INST1.ITMIXFCMEM ON DB2INST1.attribute_group_name_PA
("System_Name" ASC) ALLOW REVERSE SCANS
v Oracle
CREATE INDEX ITMIXFCMEM ON attribute_group_name_PA (System_Name ASC)
v MS SQL
CREATE INDEX ITMIXFCMEM ON attribute_group_name_PA (System_Name ASC)
v DB2 on z/OS
CREATE INDEX ITMIXFCMEM ON attribute_group_name_PA (System_Name ASC)
Create an index for each of the nonlinear attribute groups, adding the suffix _PA to
the name of each group:
v KPA_GENERIC_D32_NLT_STATUS_PA
v KPA_GENERIC_D64_NLT_STATUS_PA
v KPA_GENERIC_I32_NLT_STATUS_PA
v KPA_GENERIC_I64_NLT_STATUS_PA
v KPA_GENERIC_D32_NLT_FCAST_PA
v KPA_GENERIC_D64_NLT_FCAST_PA
v KPA_GENERIC_I32_NLT_FCAST_PA
v KPA_GENERIC_I64_NLT_FCAST_PA
v CPU_Utilization_NLT_Fcast_PA
v Mem_Utilization_NLT_Fcast_PA
v Disk_Utilization_NLT_Fcast_PA
v Net_Traffic_In_NLT_Fcast_PA
v Net_Traffic_Out_NLT_Fcast_PA
v CPU_Utilization_NLT_Status_PA
v Mem_Utilization_NLT_Status_PA
v Disk_Utilization_NLT_Status_PA
v Net_Traffic_In_NLT_Status_PA
v Net_Traffic_Out_NLT_Status_PA
This problem indicates that the kernel on the machine is not at the required level.
To resolve the problem, upgrade the kernel to 2.6.18-274.12.1.el5 or higher. See also
the Software Product Compatibility Reports (http://pic.dhe.ibm.com/infocenter/
prodguid/v1r0/clarity/index.html).
After completing these two steps, the catalog database is restored and the
monitoring server works correctly. No data is lost in the process.
If the Windows Local Security Settings are enabled for long or complex passwords,
ensure your password meets those syntax requirements for these IDs. If your
Windows environment requires you to change passwords regularly do the
following to change the portal server database user account password.
The Tivoli Data Warehouse requires one bufferpool and three tablespaces to begin
its operation. The bufferpool and tablespaces are created by the warehouse user
before the Warehouse Proxy Agent starts, provided the warehouse user has
administrative authority to the database. A warehouse user with limited authority
cannot create the required bufferpool and tablespaces. Therefore, the procedure to
limit the authority of the warehouse user includes steps to create the bufferpool
and tablespaces in advance. However, in a DB2 pureScale environment, the
Warehouse Proxy agent cannot create the regular tablespace ITMREG8K. The
workaround is to connect to the warehouse database with the Warehouse Proxy
Agent user, for example, “ITMUser”. Then create the tablespaces using the
following SQL statements:
db2 create regular tablespace ITMREG8K pagesize 8k managed by automatic storage
bufferpool ITMBUF8k
db2 create user temporary tablespace ITMUSER8K pagesize 8k managed by automatic
storage bufferpool ITMBUF8k
db2 create system temporary tablespace ITMSYS8K pagesize 8k managed by automatic
storage bufferpool ITMBUF8k
where "WHA_USER" represents the Warehouse Proxy Agent user, for example,
“ITMUser”.
Procedure
v Continue installing the software, using the following steps to resolve the error:
1. When the installation completes, log off the current Windows user session
and log in with using the db2admin userid.
2. Run install_dir\installITM\DB2DataSource.exe from Windows Explorer or
a command prompt. You can run this program again, even if one or more of
the tasks completed the first time it ran.
3. Start the Tivoli Enterprise Monitoring Server after the software indicates the
Tivoli Enterprise Portal Server configuration was successful.
4.
v If the installation is complete but it does not start, the data source might not be
defined. Review the install_dir\cnps\kfwras1.log file. If error messages
similar to the following are present in the log, the data source was not
defined:[IBM][ODBC Driver Manager] Data source name not found and no
default driver specified. Use the follow steps to verify whether the data
source was created and to resolve the problem:
1. Open the ODBC datasource window: Start > Settings > Control Panel >
Administrative tools and double-click the Data Sources (ODBC) .
2. Verify that the IBM DB2 ODBC DRIVER datasource is defined.
3. If the IBM DB2 ODBC DRIVER data source is not present, run the
install_dir\installITM\DB2DataSource.exe file.
What to do next
Error messages are displayed in the trace file if any configuration errors are
detected in filter variables such as KSY_PRODUCT_FILTER, KSY_TABLE_FILTER,
or KSY_SUMMARIZATION_SELECTION. Such messages indicate which table or
product do not exist in the application support or if the summarization selection is
incorrect.
If the filters are configured correctly but errors are detected at the database level,
such as tables already partitioned or tables do not exist in the database, the
migration tool generates only the SQL statements for the tables that can be
migrated and error messages are logged in the trace file for the tables that cannot
be migrated. When these errors occur, the following message appears on the
standard output:
Warning: One or more tables may not have migration steps generated.
See the log file for additional information.
When the migration tool has generated all the scripts successfully with a warning,
the following message appears on the standard output:
When the migration tool has generated all the scripts successfully without
warnings, the following message appears on the standard output:
Status tables
You can review the status tables to help you troubleshoot your errors.
The tdw_migrate_step1.sql script provides the return codes based on your database
type. Refer to the IBM Tivoli Monitoring Administrator's Guide for a complete list of
return codes.
When migrating a table on DB2 for Linux, UNIX, or Windows, you might receive
return codes 1, 2, or 4.
1: Indicates that the source table could not be renamed to MIGRATING_*.
2: Indicates that the partitioned table could not be created.
4: Indicates that the table MIGRATING_* could not be renamed to DONE_*.
To determine the cause of each of these errors, note the SQL code and SQL state
values that are provided in the output when the table is migrated. To get more
information about an SQL code, refer to the DB2 error code documentation or issue
the db2 ? <SQL CODE> command.
The SQL code and SQL state values are also stored in the
WAREHOUSE_MIGRATION_STATUS table. To retrieve the values:
1. Start the DB2 command line processor. Do not issue SQL from the operating
system's command line to avoid having to escape quotes in SQL text.
2. Connect to the Tivoli Data Warehouse database as the Tivoli Data Warehouse
user:
connect to <TDW database> user <TDW user ID> using <password>
3. Issue the SQL:
SELECT sqlcode, sqlstate FROM WAREHOUSE_MIGRATION_STATUS WHERE tablename =
’<name of table being migrated>’
For example:
SELECT sqlcode, sqlstate FROM WAREHOUSE_MIGRATION_STATUS WHERE
tablename = ’NT_Process_64’
4. Once the cause of the error is resolved, the migration script
tdw_migrate_step1.sql can be rerun.
When migrating a table on DB2 for Linux, UNIX, or Windows, a return code of 3
indicates the load step has failed. This means there was a failure calling the LOAD
stored procedure in the tdw_migrate_step1.sql script for a given table. The load
step could fail for many different reasons including:
v The Tivoli Data Warehouse user doesn't have sufficient privileges to run the load
utility.
v Insufficient space in the migrated table's tablespace for the data being loaded.
MSGRETRIEVAL
-------------------------------------------------------------------
SELECT SQLCODE, MSG FROM TABLE(SYSPROC.ADMIN_GET_MSGS(’32727_ITMUSER’))
AS MSG
4. Issue the SQL query that was returned in step 3. This displays the messages
from the DB2 LOAD utility.
In this example, there were no errors in the load step of the migration:
SQLCODE MSG
------- ----------------------------------------------------------
SQL3501W The table space(s) in which the table resides will not be
placed in backup pending state since forward recovery is disabled
for the database.
SQL1193I The utility is beginning to load data from the SQL statement
"SELECT * FROM ITMUSER."MIGRATING_NTPROCESS"".
SQL3500W The utility is beginning the "LOAD" phase at time
"01/14/2013 11:26:47.793424".
SQL3519W Begin Load Consistency Point. Input record count = "0".
SQL3520W Load Consistency Point was successful.
SQL3110N The utility has completed processing. "96785" rows were
read from the input file.
SQL3519W Begin Load Consistency Point. Input record count = "96785".
SQL3520W Load Consistency Point was successful.
SQL3515W The utility has finished the "LOAD" phase at time
"01/14/2013 11:26:59.426438".
9 record(s) selected.
5. Fix the problems mentioned in the load utility's message files and re-execute
the tdw_migrate_step1.sql script.
Using DB2 V9.1 for z/OS, Warehouse Proxy agent encounters a large
number of disconnections
When you use DB2 for z/OS 9.1 for the warehouse database, the Warehouse Proxy
agent can encounter repeated disconnections from the database.
The default idle thread timeout value (DSN6FAC IDTHTOIN in DSNZPxxx) is 120
seconds. The Warehouse Proxy agent uses a pool of database connections to
The database connections are idle between export requests, and if the idle thread
timeout value is less than the warehousing interval, the database connections
might timeout. This results in numerous error messages written to the Warehouse
Proxy agent log. The Warehouse Proxy agent "Statistics" workspace will also show
a large number of Disconnections in the "Failure / Disconnections" view.
To avoid repeated disconnections, consider increasing the DB2 idle thread timeout
value to a value higher than the warehousing interval. Specifying a value of 0
disables timeout processing. If timeout processing is disabled, idle server threads
remain in the system and continue to hold their resources, if any.
For more information on the DB2 IDLE THREAD TIMEOUT field (IDTHTOIN
subsystem parameter), refer to the DB2 Version 9.1 for z/OS Installation Guide in the
DB2 Version 9.1 for z/OS information (http://pic.dhe.ibm.com/infocenter/
dzichelp/v2r2/topic/com.ibm.db2z9.doc/src/alltoc/db2z_planhome.htm).
Wild card matching is not supported. The only way to mimic that functionality
would be to use the substring or scan for string functions instead of the default
value and equals. The equals operator only works with full matches.
If you need to install the Summarization and Pruning agent, see the IBM Tivoli
Monitoring Installation and Setup Guide. There you can find information for
environment-wide capacity planning. You can find agent-specific capacity planning
information in the user guide for the specific monitoring agent.
The following errors can occur in the khdras1.log if the globalization system
environment variable is not set correctly:
ORACLE error: [Oracle][ODBC][Ora]ORA-01461: can bind a LONG value only
for insert into a LONG column
1. Set the environment variable
NLS_LANG=AMERICAN_AMERICA.AL32UTF8 as a system
environment on the Windows computer on which the Warehouse Proxy
is installed.
2. Restart the Windows computer so that the Warehouse Proxy windows
service recognizes the change.
DB2 error: SQL0302N The value of a host variable in the EXECUTE or OPEN
statement is too large for its corresponding use. SQLSTATE=22003 sqlstate =
22003
When you configure a DB2 Warehouse Proxy connection from the Manage Tivoli
Enterprise Monitoring Services utility using the Configure DB2 Datasource for
Warehouse window, the user ID the Warehouse Proxy uses to connect to the
warehouse database must have SYSADM permission. SYSADM permission is
required to create an 8K Tablespace and Bufferpool.
Windows
If the database is on Windows, the user must be a member of the local
Administrators group.
UNIX-based system
If the database is on Linux or UNIX user must belong to the SYSADM
group.
1. Log in as the DB2 instance owner (usually "su - db2inst1"),
2. Run the following command to determine the group that the
UNIX-based system user must belong.
db2 get dbm cfg | grep SYSADM
If you are installing event synchronization on Linux and see the message below in
the log file, you must install the libXp shared library and then run the event
synchronization installation program again:
java.lang.UnsatisfiedLinkError: /tmp/isjSlpnGj/jre/bin/libawt.so: libXp.so.6:
cannot open shared object file: No such file or directory
You must update the Netcool/OMNIbus ObjectServer database schema with the
IBM Tivoli Monitoring automations before you update the probe's
tivoli_eif.rules file to include the itm_event.rules file or the probe won't start.
See the topic "Updating the OMNIbus database schema" in the IBM Tivoli
Monitoring Installation and Setup Guide for details on this procedure.
If you have updated the OMNIbus database schema with the IBM Tivoli
Monitoring automations but the probe will not start because the BSM_Identity
attribute is not defined, check if the itm_event.rules file has been modified to
include the tbsm_eif_event.rules file or if tivoli_eif.rules is including other
rules files that set the BSM_Identity attribute. If you are not integrating IBM Tivoli
Monitoring, Netcool/OMNIbus, and Tivoli Business Service Manager, comment out
any rules files (like tbsm_eif_event.rules) that are intended for Tivoli Business
Service Manager integration and are setting BSM_Identity. However, if you are
using Tivoli Business Service Manager, ensure you have installed the OMNIbus
automations provided with that product because those automations ensure that
BSM_Identity is added to the ObjectServer database schema.
tacmd vieweventdest -i 0
Server Id : 0
Server Name: Default EIF Receiver
Server Type: TEC
Description: default EIF event listener
Default : Y
Host1 : nswin02:5527
tacmd vieweventdest -i 1
Server Id : 1
Server Name: nswin01-OM
Server Type: Micromuse/Omnibus
Description: Windows OMNIbus Server
Default : N
Host1 : nswin01:9998
Note: You can also check the Netcool/OMNIbus Probe for Tivoli EIF log file and
the Netcool/OMNIbus ObjectServer log file to determine if those components are
unable to process the events. See “Log files for Netcool/OMNIbus Event
Synchronization” on page 286 to determine the location of the log files.
See also the technote, “EIF events from ITM not clearing in Netcool/OMNIbus”
(http://www-304.ibm.com/support/docview.wss?uid=swg21584791&myns=swgtiv
&mynp=OCSSSHTQ&mync=E)technote
Use the following steps to change the TCP/IP timeout for your computer.
When a report fails to generate or generates incorrectly, click View the report with
errors to make diagnosis of the underlying problem easier.
If you get errors while running the tdw_schema_insert.sql script against a DB2
database, review the options for resolving the issue.
Symptom
You receive errors while running the tdw_schema_insert.sql script against
a DB2 database to insert data, indicating that the transaction log is full.
Solution
Take one of the following actions to resolve the issue:
v Reduce the transaction by changing the call to create a smaller range of
dates.
v Increase the transaction log space:
1. Connect to the WAREHOUS database with the DB2 administrator user
ID. Example:
db2 connect to WAREHOUS user DB2_administrator_user_ID using pw
2. Get the current LOG settings. Example:
db2 get db cfg for WAREHOUS |grep LOG
3. Change one or all of the LOGFILSIZ, LOGPRIMARY, and LOGSECOND
settings. For example:
LOGFILSIZ=5000, LOGPRIMARY=100, LOGSECOND=12
Run the script again with the ITMUSER user ID. Disregard any errors about
the insertion of data in MONTH_LOOKUP and WEEKDAY_LOOKUP, because that
data was inserted correctly the first time the script was executed.
If you see JavaScript errors in the reports that you create, look for “Caused by” in
the stack trace. This phrase points out the line number of the script in the report
design at which the error occurred. To see the SQL query that was generated by
this error, look at the log file.
To remedy this issue, manually delete the *.rtm files located in the
cognos/data/cqe/RTModels directory.
This available disk space accommodates temporary data to generate reports on the
Tivoli Common Reporting Server when you use Tivoli Monitoring provided
reports.
The statement could not be processed because it exceeds a system limit for either
length or complexity. Change the DB2 configuration by increasing the db2
statement heap size STMTHEAP to run the report.
You are missing drivers after the Tivoli Common Reporting installation
In this scenario, you have installed Tivoli Common Reporting and imported a
report package, but when you try to run a report, you see this message from Tivoli
Common Reporting:
CTGTRV014E The report cannot be successfully formatted because it completed
with errors, reference ID [REPORTID_3_OBJECTID_7ec67ec6].
Click on the following link to view the report with the errors.
CTGTRV011E See the Tivoli Common Reporting log files for more information.
https://localhost:30343/TCR/Reports/view
This problem is occurring because you are missing drivers required by Tivoli
Common Reporting. You did not copy the required driver files or copied incorrect
driver files. Refer to the “Configuring a JDBC data source” topic in the IBM Tivoli
Common Reporting: User's Guide for additional information about this requirement.
For example, if your Tivoli Data Warehouse is running on DB2 and you selected
View the report with errors, you would see error message:
No Data Returned
Warning: No data is available for this parameter set.
To address this problem for a DB2 database, you would need to copy the following
files:
v db2jcc.jar
v db2jcc_license_cu.jar
Typically found in this default DB2 installation path or in the java directory of
whatever alternate path you specified for DB2 installation, into this directory.
Make these corrections and then try running the report again.
Click OK to dismiss the error message; and the integrated portaland Tivoli
Common Reportingreports remain operational.
The report fails to generate because the SQL query was not valid
If you have made changes to the reports or to the workspace against which the
query is being run, you could see SQL query errors. Typically this error looks
similar to this:
The following items have errors:
Additional information about the error might be found in the SQLERRMC file.
If you experience SQL errors, try generating the report with a different set of
parameters. If all executions of the report generate SQL errors, it is likely that, for
some reason, the running report is not compatible with your database.
This message is followed by some SQL codes and the name of the table or view
that is absent.
This message is displayed when the tables and views from which data is to be
extracted do not exist in the database. To view the SQL query that generated this
problem and determine what historical data is not being collected, review the log
files. See “Locations of log files” on page 302 for the locations of log files.
As well as the following scenario, consider that if you are generating reports for
more than 5,000 agents, especially from multiple days of history, you will also get
Java out-of-memory errors.
These error messages are generated by Tivoli Common Reporting and might
indicate that you need to increase the default heap size for the JVM on the java
command to start the Tivoli Common Reporting server. To confirm that this is the
problem, complete the following steps:
Procedure
1. Open the SystemOut.log file, found in this location: On Windows Systems:
<tcr_install_dir>/tcr/eWas61/profiles/tcrProfile/logs/tcrServer/
2. Confirm that this line can be found: An OutOfMemory error happened while
running the report. If you find this line, do the following to correct this
problem:
3. Stop Tivoli Common Reporting.
4. Edit the startServer.bat file, usually found in this location: On Windows
Systems: C:\Program Files\IBM\tivoli\tip\bin
5. Locate this instruction about half way through the bat file:
"%JAVA_HOME%\bin\java" -Xms128m -Xmx512m -Dcmd.properties
.file=%TMPJAVAPROPFILE% %WAS_TRACE% %WAS_DEBUG% %CONSOLE_ENCODING% "%CLIENTSAS%"
"%CLIENTSSL%" %USER_INSTALL_PROP% "-Dwas.install.root=%WAS_HOME%" com.ibm.ws
.bootstrap.WSLauncher com.ibm.ws.management.tools.WsServerLauncher
"%CONFIG_ROOT%" "%WAS_CELL%" "%WAS_NODE%" %* %WORKSPACE_ROOT_PROP%
6. Increase the JVM heap size by adding the -xms and -xmx options, as in the
example shown below:
"%JAVA_HOME%\bin\java" -Xms128m -Xmx512m -Dcmd.properties
.file=%TMPJAVAPROPFILE% %WAS_TRACE% %WAS_DEBUG% %CONSOLE_ENCODING% "%CLIENTSAS%"
"%CLIENTSSL%" %USER_INSTALL_PROP% "-Dwas.install.root=%WAS_HOME%" com.ibm.ws
.bootstrap.WSLauncher com.ibm.ws.management.tools.WsServerLauncher
"%CONFIG_ROOT%" "%WAS_CELL%" "%WAS_NODE%" %* %WORKSPACE_ROOT_PROP%
7. Re-start the Tivoli Common Reporting server and try generating the report
again.
Verify at the Tivoli Enterprise Portal console that you see the Historical Collection
icon for the subject workspace and can draw a report for the subject time period. If
you cannot, you do not have historical collection enabled. Start it and try the task
again.
Or:
CTGTRV011E See the Tivoli Common Reporting log files for more information.
https://localhost:30343/TCR/Reports/view
Finding the statement SQLSTATE 22003 indicates that you are experiencing a SQL
arithmetic overflow at the Tivoli Data Warehouse database.
To remedy the problem, choose a smaller time period (the recommended action) or
possibly a different metric. This change causes the calculations being performed for
the report to process less data, reducing the chance of overflow.
Note: All occurrences of this failure in the IBM-provided reports were eliminated
by casting DB2 Integer values to DECIMAL(column_name 31,0). This problem should
not occur with IBM-provided reports, but could be an issue in customer-generated
reports.
When you check version levels, you find that the version of database manager you
are using for Tivoli Data Warehouse is correct and that it is defined as your data
source and that all the requested columns in the attribute table are available.
However, even though data is displayed in the other columns of this table in Tivoli
Enterprise Portal, no data is available in the column needed to draw the graph.
Historical data sets have been allocated, historical collection has been configured
and enabled , and some data has been collected. Tivoli Common Reporting is
configured and is drawing a partial report. If the report does not meet your needs,
you may be able to obtain the needed data by one of these methods:
v By choosing a different attribute on the Report Parameters window.
v By ensuring that data for the selected attribute is available for this workspace
configuration. For example, the OMEGAMON XE for Mainframe Networks
monitoring agent cannot collect data for some attributes in OSA reports,
depending on how the OSA adapter is configured.
Confirm that the values for these attributes are being displayed in Tivoli Enterprise
Portal. If no data is being collected for a key attribute because of configuration
issues, consult the product manual to see what configuration change would
provide data for the missing attributes.
To determine the cause of this problem you need to enable detailed logging in
Tivoli Common Reporting, as described in the IBM Tivoli Common Reporting: User's
Guide, and you need to know which agent table (or attribute group) contains the
attribute that is not being displayed. Go to the Tivoli Enterprise Portal console and
validate that this column contains data and, if the issue is with summarized data,
that summarization is enabled for this attribute.
Another reason might be that you have not configured historical collection when
you installed and configured the monitoring agent or that not enough time has
passed for Tivoli Data Warehouse to collect the data for your requested report time
The solution might require changes to the persistent datastore, the z/OS repository
for short-term historical data. Those changes should be made using the
Configuration Tool.
The generated report displays the message “The requested data is not
available”
In this scenario, you complete the report parameters window to generate a report
and click Submit. After clicking Submit, you see a blank report page with no
graphed or tabular information and the error message The requested data is not
available is displayed in the message area.
When you check version levels, you find that Tivoli Data Warehouse is your data
source, but none of the requested columns in the Tivoli Enterprise Portal attribute
table contain data.
This indicates that no data is being collected in Tivoli Data Warehouse. Verify this
by querying the same table in the database or by requesting data from the
matching workspace, for this time period, in the Tivoli Enterprise Portal. If, for
example, your installation stopped collecting historical data for this report 8 days
ago, and you query the last 7 day, no data will be returned. If the managed
resource (for example, a TN3270 Server or a CICS® Region) were taken offline, then
no data can be collected.
To address the time period issue, expand the time period for your query. To correct
the resource availability issue, ensure that the managed resource is online and your
agent is collecting data for it.
The solution might require changes to the persistent datastore, the z/OS repository
for short-term historical data. Those changes should be made using the
Configuration Tool.
In Tivoli Common Reporting V2.2 and V3.1, the OS agent Lineage reports are not
working.
If you think you have received this message incorrectly, run the Exception Report
to find if there is another reason why no data is being returned.
Audit events in the system reflect authorization and authentication failures, and
major and minor changes, but do not reflect minor service messages stored in the
RAS logs.
For more information about common logging, see “Audit logging” in the IBM
Tivoli Monitoring Administrator's Guide.
Audit Log workspace shows only 100 of the most recent audit records
By default, all Tivoli Monitoring components show only the 100 most recent audit
records in the Audit Log workspace.
Only audit events that were created since the component was started are
displayed.
If you want to display audit records for events that occurred before the most recent
component startup, you must enable historical data collection for the ITM Audit
attribute group and distribute the history collection settings to the components you
want to have access to the historical audit data.
Audit Log workspace does not display records before the latest
component startup
The Audit Log workspace shows audit records that are generated since the
component was most recently started.
To access audit records that were generated before the latest restart, collect audit
records historically from critical Tivoli Monitoring components.
If you do not find the resolution to the problem you experience with the interface,
refer to the IBM OMEGAMON Enhanced 3270 User Interface Guide for more
information.
The OMEGAMON Enhanced 3270 user interface uses WLM Services to discover
Data Retrieval Agents that are running in its Sysplex. The interface uses TCP/IP
services to communicate with Data Retrieval Agents.
The Data Retrieval Agent uses TCP/IP services to communicate with the hub
monitoring server. The Data Retrieval Agent uses WLM Services to register or
publish their existence within the Sysplex.
The OMEGAMON Enhanced 3270 user interface uses VTAM services to enable you
to log on and communicate with the interface. You log on to the OMEGAMON
Enhanced 3270 user interface through a VTAM Appl ID that is opened during
startup of the interface.
The following components comprise the startup and operation environment for the
OMEGAMON Enhanced 3270 user interface and the OMEGAMON agents:
OMEGAMON Enhanced 3270 user interface address space
Although startup of the OMEGAMON Enhanced 3270 user interface
address space is relatively fast, it should probably be the last component in
the startup sequence; because the interface requires that all other
components in its environment (the OMEGAMON monitoring agents, the
Data Retrieval Agent, and the hub and or remote monitoring servers) be
initialized and running before it is able to retrieve data.
OMEGAMON Enhanced 3270 user interface local registry
The OMEGAMON Enhanced 3270 user interface startup process discovers
registered Data Retrieval Agents and connects to related hub monitoring
servers in order to establish a local registry of data source information; that
is, managed systems names and managed system lists. After startup, by
default, the registry is refreshed on a five minute interval.
OMEGAMON agent and OMEGAMON Enhanced 3270 user interface Data
Retrieval Agent
The OMEGAMON XE V5.1.0 agent address spaces run both an instance of
the product agent and also an instance of a OMEGAMON Enhanced 3270
user interface Data Retrieval Agent. The Data Retrieval Agent uses WLM
services to publish or register its existence. The product agent registers
with the hub monitoring server; these registration processes facilitate
OMEGAMON Enhanced 3270 user interface discovery of OMEGAMON
agents. Note that some OMEGAMON agents, such as OMEGAMON XE on
z/OS, run under a remote z/OS monitoring server; the OMEGAMON
Enhanced 3270 user interface; the Data Retrieval Agent also runs under
remote z/OS monitoring servers.
Figure 2. OMEGAMON Enhanced 3270 user interface workspace depicting a case of no data after the initial log on
Note: The initial workspace parameter setting is specified in the log on profile
with the FIRSTWS parameter.
You can use the capability in the OMEGAMON Enhanced 3270 user interface for
investigating the root cause of the no data condition.
Use the OMEGAMON Enhanced 3270 user interface to assign values for your site
and or user ID named log on profile.
Procedure
1. From the OMEGAMON Enhanced 3270 user interface, select View > 2. Hub
Information. The Current Hub TEMS Information panel is displayed. If you see
a panel similar to the following panel indicating that no values are assigned,
either no site and or user ID-named log on profile has been created, or the
profile for this session has not been customized to specify hub monitoring
server settings.
a. Verify that a site (CUASITE) or user_ID named data set member exists in
rte.UKOBDATF .
b. Verify that the hub monitoring server settings in the profile for the current
session have been customized to specify the required hub monitoring server
configured values. For example, see Figure 4 on page 319.
2. After you customize the log on profile member, log off the OMEGAMON
Enhanced 3270 user interface and log on to pickup the profile changes.
Results
If you repeat Step 1, you will see the hub settings you have specified as shown in
the following panel.
The hub monitoring server settings in the log on profile do not match
the hub monitoring server configured values
Verify that the rte.UKOBDATF profile members specify the configured hub
monitoring server settings and that these settings match the hub monitoring server
configured values.
The settings that are displayed in the Current Hub TEMS information panel in
figure 4 indicate that a custom profile member has been created and customized;
however, the no data condition persists. Inspect the profile for the current session
to determine if the specified settings match the configured hub monitoring server
settings.
For example, Figure 4shows that the server name is set to RTE1.CMS , with a
period. However, the actual configured hub monitoring server name is RTE1:CMS ,
with a colon.
Correct the settings in the rte.UKOBDATF profile member, then log off and log on
to the OMEGAMON Enhanced 3270 user interface to pickup the profile changes.
The Current Hub TEMS Information panel will display the correct settings as
shown in the following example:
If you have verified the existence of a custom profile, in which the hub monitoring
server settings are correctly specified, but the no data condition persists, you need
to verify that there are registered data retrieval agents.
From the OMEGAMON Enhanced 3270 user interface, select View > 3. Data
Retrieval Agents. The KOBLOGON workspace, and, in particular the List of
available ITM/TEMS Data Sources panel, is opened and lists all the available data
retrieval agents and their associated hub server as shown in the following panel.
The three rows shown in figure 6 list the hub monitoring server as RTE1:CMS; this is
an indication that there are three registered data retrieval agents running in the
same Sysplex as the OMEGAMON Enhanced 3270 user interface address space.
Assuming that there are no problems with agent data collection, the OMEGAMON
Enhanced 3270 user interface is able to connect to any one of these data retrieval
agents to retrieve OMEGAMON data from a V5.1.0 monitoring agent that is
connected to the RTE1:CMS hub monitoring server.
If there are no data retrieval agents registered for a given hub monitoring server,
the KOBLOGON workspace will either be empty or, if there are multiple hub
monitoring servers configured, it might contain data retrieval agent rows for other
monitoring servers running in the Sysplex.
If the KOBLOGON workspace list does not contain the hub monitoring server that
is specified in the OMEGAMON Enhanced 3270 user interface log on profile, there
is a high probability that the OMEGAMON XE on z/OS agent(s) and or related
monitoring server address space are off line.
The hub monitoring server is off line; verify initialization and data
request reception
Verify the hub monitoring server is started, initialized, and prepared to receive
data requests; listening on the configured TCP/IP port.
The following monitoring server log messages are a good indication regarding the
health of hub monitoring server operations:
..
KDSMA001 Tivoli Enterprise Monitoring Server (TEMS) data collection server started
...
KO4SRV032 Tivoli Enterprise Monitoring Server (TEMS) startup complete
..
If you cannot find these messages in the hub monitoring server log, examine the
log for indications of potential problems. For example:
See the "Monitoring server troubleshooting" section of this book for more
information.
Application product support (seed data) has not been added to the
hub monitoring server
Verify the hub monitoring server has the product version (for example, V5.1.0)
application support (seed data) loaded.
If you did not complete this configuration step, your OMEGAMON Enhanced 3270
user interface might be missing data for one or more OMEGAMON products.
For a hub monitoring server on a z/OS system, see the "Adding application
support to a monitor server on z/OS" section in the IBM Tivoli Monitoring
Installation and Setup Guide.
For a hub monitoring server on a distributed system, see the "Installing application
support on monitoring server" section in the IBM Tivoli Monitoring Installation and
Setup Guide.
Note: The V5.1.0 product application support files must also be loaded in the run
time environment libraries that are associated with the OMEGAMON Enhanced
3270 user interface started task. For a new or upgraded run time environment, the
run time environment load configuration step updates the application support.
You will see the following messages in the OMEGAMON Enhanced 3270 user
interface log file when the application support files are back-leveled or missing:
...
KOBUIGP1I Source ODI does not contain table Kppxxxxx ...
KOBUIGP9I ERROR: No ODI will cause an erroneous display ...
KOBUIGP1W ODI Failure ignoring SORTCOLS=’...
...
Also, see “The OMEGAMON Enhanced 3270 user interface local registry does not
contain required agent information” on page 324.
From the OMEGAMON Enhanced 3270 user interface, select View > 4. Managed
Systems. The Only Managed Systems panel is displayed and lists the available
names of the local registry managed systems.
Figure 7. Only Managed Systems panel listing the available names of the local registry
When you examine the Only Managed Systems (KOBMSNS) workspace content
and it indicates there are no online agents for a given product, then examine the
agent address space to verify that it has been started and it has successfully
initialized on line. In some cases, it might be necessary to verify that the
corresponding monitored systems or subsystems (for example, CICS regions) are
also running.
The OMEGAMON Enhanced 3270 user interface local registry does not
contain required agent information
The OMEGAMON Enhanced 3270 user interface local registry must have accurate
information about the configuration of the environment to enable the composition
and routing of data queries to appropriate OMEGAMON agent managed systems.
The registry is initially populated during the address space initialization process
and thereafter, by default, at five minute intervals.
Use the OMEGAMON Enhanced 3270 user interface Options menu to examine the
local registry.
The Online Managed Systems Lists workspace lists the rows with managed
system list names.
The following table lists the naming conventions for the OMEGAMON products
of the managed systems list names:
Product Managed System List naming convention
OMEGAMON XE on z/OS v *MVS_SYSPLEX
v *MVS_SYSTEM
OMEGAMON XE for CICS on z/OS v *CPIRA_MGR *GWIRA_MGR
v *IBM_CICSplexes
v *IBM_CICSplex
v *MVS_CICSTG
v *MVS_CICS
v KCP_CICSplex_plex_name
OMEGAMON XE for DB2 PE *MVS_DB2
OMEGAMON XE for IMS on z/OS v *MVS_IMSPLEX
v KIP_ims_system_IMS
OMEGAMON XE for Mainframe Networks v *OMEGAMONXE_MAINFRAME_NTWK_TCP
v
*OMEGAMONXE_MAINFRAME_NTWK_VTAM
v *OMEGAMONXE_MAINFRAME_NTWK
OMEGAMON XE for Messaging mq_ID:lpar_smfid:MQESA
OMEGAMON XE for Storage on z/OS agent_jobname:lpar_smfid:STORAGE
Note: In the case where there are multiple instances of IBM Tivoli Monitoring
configurations running in a common Sysplex, the configurations must be
configured with unique names; at least one of the configurations must provide
an override Sysplex name (pseudo name) so that both configurations are able to
start a Sysplex proxy (and agent). The workspace notice, Sysplex Data
Unavailable: Enter ’ZOSLPARS’ for LAPR Data depicts the case where perhaps
this has not been done and the Sysplex proxy is unable to start in one of the
configurations; the result is that the OMEGAMON Enhanced 3270 user interface
is unable to render Sysplex data for that Sysplex.
By default, the OMEGAMON Enhanced 3270 user interface is configured with the
request time out parameters shown in the following table:
Elongated response times when interacting with the OMEGAMON Enhanced 3270
user interface might be a symptom of time out conditions. For example, during log
on, the initial Enterprise Summary (KOBSTART) workspace might take a
significant amount of time (more than a few seconds) to render and or the
workspace is rendered with partial or no data.
Note: The following message is written to the SYSPRINT log files when request
time-outs occur:
The following items identify the common causes for delay and or time out
conditions. Investigation of these conditions might be complex, this information
provides you with some hints for further investigation:
v The hub monitoring server is running under degraded system conditions (heavy
system workload or an under-capacity system) and is being delayed when
attempting to service OMEGAMON Enhanced 3270 user interface data requests.
In this example, examine the availability and priority of the system resources
provided to the hub monitoring server.
v The OMEGAMON agent is running under degraded system conditions (heavy
system workload or an under-capacity system) and is being delayed when
attempting to service OMEGAMON Enhanced 3270 user interface data requests.
In this example, examine the availability and priority of the system resources
provided to the OMEGAMON agent.
v Data requests submitted from the OMEGAMON Enhanced 3270 user interface to
a given Data Retrieval Agent, thorough a TCP/IP conduit, are being impacted
by degraded network conditions. In this case, the availability, priority, and
configuration of network resources and paths associated with communications
between the OMEGAMON Enhanced 3270 user interface and the hub
monitoring server and OMEGAMON agents should be examined.
v An OMEGAMON component (monitoring server or agent) that played a role in
a given data request path has gone off-line; the LPAR was terminated, or the
address space was terminated. In this case, “The OMEGAMON Enhanced 3270
user interface local registry does not contain required agent information” on
page 324 to investigate the status of OMEGAMON components, (Managed
Systems: on line or off line).
v The hub and or a remote monitoring server is experiencing operational issues
and is being delayed when attempting to service OMEGAMON Enhanced 3270
user interface data requests. A misconfiguration or a special site or
environmental configuration requirements might lead to operational issues.
For example:
– Operational issues might arise if a monitoring server running on a z/OS
operating system is experiencing problems writing to its’ Historical Persistent
Datastore files.
– Operational issues might occur if a monitoring server is unable to bind to its
configured TCP/IP port number.
v The IP domain name resolution is not fully configured on the z/OS operating
system where the OMEGAMON Enhanced 3270 user interface, Tivoli Enterprise
Monitoring Server and or agent address spaces are running. Also, there might be
more than one TCP/IP task running on the z/OS operating system; for these
cases, the OMEGAMON address spaces, the OMEGAMON Enhanced 3270 user
interface, Tivoli Enterprise Monitoring Server and or agent started task JCL
procedures must specify the IP name resolution configuration data set to be
specified through the SYSTCPD DDNAME statement.
v The hub Tivoli Enterprise Monitoring Server is running on a system that has
multiple network interfaces and perhaps the preferred and or universally known
interface is not being employed; this results in IP connection issues that manifest
on the interface as a possible sporadic, no data condition. Refer to the following
tech note for more information related to this type of configuration; the use of
the KDEB_INTERFACELIST parameter. See http://www-01.ibm.com/support/
docview.wss?uid=swg21282474.
For information about accessing and using the publications, select Using the
publications in the Contents pane of the IBM Tivoli Monitoring and
OMEGAMON XE Information Center at http://pic.dhe.ibm.com/infocenter/
tivihelp/v61r1/index.jsp.
To find a list of new and changed publications, click What's new on the Welcome
page of the IBM Tivoli Monitoring and OMEGAMON XE Information Center. To
find publications from the previous version of a product, click Previous versions
under the name of the product in the Contents pane.
The following publications provide information about using the base agents.
v Operating system agents:
– Windows OS Agent User's Guide, SC22-5451
– UNIX OS Agent User's Guide, SC22-5452
– Linux OS Agent User's Guide, SC22-5453
– IBM i Agent User's Guide, SC22-5454
v Agentless operating system monitors:
– Agentless Monitoring for Windows Operating Systems User's Guide, SC23-9765
– Agentless Monitoring for AIX Operating Systems User's Guide, SC23-9761
– Agentless Monitoring for HP-UX Operating Systems User's Guide, SC23-9763
– Agentless Monitoring for Solaris Operating Systems User's Guide, SC23-9764
– Agentless Monitoring for Linux Operating Systems User's Guide, SC23-9762
v Warehouse agents:
– Warehouse Summarization and Pruning Agent User's Guide, SC22-5457
Related publications
For information about related products and publications select OMEGAMON XE
shared publications or other entries in the Contents pane of the IBM Tivoli
Monitoring and OMEGAMON XE Information Center at http://pic.dhe.ibm.com/
infocenter/tivihelp/v61r1/index.jsp .
The IBM Support Assistant saves you the time it takes to search the product,
support, and educational resources. The IBM Support Assistant helps you gather
support information when you need to open a problem management record
(PMR), which you can then use to track the problem.
The product-specific plug-in modules provide you with the following resources:
v Support links
v Education links
v Ability to submit problem management reports
For more information, and to download the IBM Support Assistant, see
http://www.ibm.com/software/support/isa. After you download and install the
IBM Support Assistant, follow these steps to install the plug-in for your Tivoli
product:
1. Start the IBM Support Assistant application.
2. Select Updater on the Welcome page.
3. Select New Properties and Tools or select the New Plug-ins tab (depending on
the version of IBM Support Assistant installed).
4. Under Tivoli, select your product, and then click Install. Be sure to read the
license and description.
Obtaining fixes
A product fix might be available to resolve your problem. To determine which
fixes are available for your Tivoli software product, follow these steps:
1. Go to the IBM Software Support website at http://www.ibm.com/software/
support.
2. Under Select a brand and/or product, select Tivoli.
If you click Go, the Search within all of Tivoli support section is displayed. If
you don't click Go, you see the Select a product section.
3. Select your product and click Go.
4. Under Download, click the name of a fix to read its description and, optionally,
to download it.
If there is no Download heading for your product, supply a search term, error
code, or APAR number in the field provided under Search Support (this
product), and click Search.
For more information about the types of fixes that are available, see the IBM
Software Support Handbook at http://www14.software.ibm.com/webapp/set2/sas/
f/handbook/home.html.
Before contacting IBM Software Support, your company must have an active IBM
software maintenance contract, and you must be authorized to submit problems to
IBM. The type of software maintenance contract that you need depends on the
type of product you have:
v For IBM distributed software products (including, but not limited to, Tivoli,
Lotus®, and Rational® products, as well as DB2 and WebSphere products that
run on Windows or UNIX operating systems), enroll in Passport Advantage® in
one of the following ways:
Online
Go to the Passport Advantage website at http://www-306.ibm.com/
software/howtobuy/passportadvantage/pao_customers.htm .
By telephone
For the telephone number to call in your country, go to the IBM
Software Support website at http://techsupport.services.ibm.com/
guides/contacts.html and click the name of your geographic region.
v For customers with Subscription and Support (S & S) contracts, go to the
Software Service Request website at https://techsupport.services.ibm.com/ssr/
login.
v For customers with Linux, iSeries®, pSeries®, zSeries®, and other support
agreements, go to the IBM Support Line website at http://www.ibm.com/
services/us/index.wss/so/its/a1000030/dt006.
v For IBM eServer™ software products (including, but not limited to, DB2 and
WebSphere products that run in zSeries, pSeries, and iSeries environments), you
can purchase a software maintenance agreement by working directly with an
IBM sales representative or an IBM Business Partner. For more information
about support for eServer software products, go to the IBM Technical Support
Advantage website at http://www.ibm.com/servers/eserver/techsupport.html.
If you are not sure what type of software maintenance contract you need, call
1-800-IBMSERV (1-800-426-7378) in the United States. From other countries, go to
the contacts page of the IBM Software Support Handbook on the web at
http://www14.software.ibm.com/webapp/set2/sas/f/handbook/home.html and
click the name of your geographic region for telephone numbers of people who
provide support for your location.
IBM may have patents or pending patent applications covering subject matter
described in this document. The furnishing of this document does not give you
any license to these patents. You can send license inquiries, in writing, to:
For license inquiries regarding double-byte (DBCS) information, contact the IBM
Intellectual Property Department in your country or send inquiries, in writing, to:
The following paragraph does not apply to the United Kingdom or any other
country where such provisions are inconsistent with local law :
Any references in this information to non-IBM Web sites are provided for
convenience only and do not in any manner serve as an endorsement of those Web
sites. The materials at those Web sites are not part of the materials for this IBM
product and use of those Web sites is at your own risk.
Licensees of this program who wish to have information about it for the purpose
of enabling: (i) the exchange of information between independently created
programs and other programs (including this one) and (ii) the mutual use of the
information which has been exchanged, should contact:
IBM Corporation
2Z4A/101
11400 Burnet Road
Austin, TX 78758 U.S.A.
The licensed program described in this document and all licensed material
available for it are provided by IBM under terms of the IBM Customer Agreement,
IBM International Program License Agreement or any equivalent agreement
between us.
All statements regarding IBM's future direction or intent are subject to change or
withdrawal without notice, and represent goals and objectives only.
All IBM prices shown are IBM's suggested retail prices, are current and are subject
to change without notice. Dealer prices may vary.
This information is for planning purposes only. The information herein is subject to
change before the products described become available.
This information contains examples of data and reports used in daily business
operations. To illustrate them as completely as possible, the examples include the
names of individuals, companies, brands, and products. All of these names are
fictitious and any similarity to the names and addresses used by an actual business
enterprise is entirely coincidental.
COPYRIGHT LICENSE:
Each copy or any portion of these sample programs or any derivative work, must
include a copyright notice as follows:
© IBM 2013. Portions of this code are derived from IBM Corp. Sample Programs. ©
Copyright IBM Corp. 2013. All rights reserved.
If you are viewing this information in softcopy form, the photographs and color
illustrations might not be displayed.
Trademarks
IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of
International Business Machines Corp., registered in many jurisdictions worldwide.
Other product and service names might be trademarks of IBM or other companies.
A current list of IBM trademarks is available on the web at “Copyright and
trademark information” at www.ibm.com/legal/copytrade.shtml.
Adobe, Acrobat, PostScript and all Adobe-based trademarks are either registered
trademarks or trademarks of Adobe Systems Incorporated in the United States,
other countries, or both.
Intel, Intel logo, Intel Inside, Intel Inside logo, Intel Centrino, Intel Centrino logo,
Celeron, Intel Xeon, Intel SpeedStep, Itanium, and Pentium are trademarks or
registered trademarks of Intel Corporation or its subsidiaries in the United States
and other countries.
Microsoft, Windows, Windows NT, and the Windows logo are trademarks of
Microsoft Corporation in the United States, other countries, or both.
UNIX is a registered trademark of The Open Group in the United States and other
countries.
Java and all Java-based trademarks and logos are trademarks or registered
trademarks of Oracle and/or its affiliates.
Notices 341
Cell Broadband Engine is a trademark of Sony Computer Entertainment, Inc. in the
United States, other countries, or both and is used under license therefrom.
Linear Tape-Open, LTO, the LTO Logo, Ultrium, and the Ultrium logo are
trademarks of HP, IBM Corp. and Quantum in the U.S. and other countries.
Index 345
pdcollect portal server (continued) situations (continued)
Authorization Policy Server 189 turn on trace 49 action commands 213
pdcollect tool 11, 68 USE_EGG1_FLAG=1 114 non-hub 159
performance analyzer prerequisite check 249 reflex automation 30
problems after upgrading 267 problem resolution 335 UADVISOR 146
Performance Analyzer 265 problem solving 11 SMC 333, 335
adding an index 270 problems, types of 1 snapcore 5
agent does not connect to monitoring process names 329 SOAP commands 163
server 271 processes 164 software requirements 11
common reporting 299 pureScale Software Support 335
enabling loggine 266 ITMREG8K 274 contacting 337
enabling logging 265 table and tablespace 274 receiving weekly updates 336
installation and configuration 266 WAREHOUSEMARKER 274 SQL errors 307
kernel 271 SQL queries 305
nonlinear tasks 270 SQL statement 305
persistent data store 25
ping 33
Q SQL0480N 277
SQL0552N 284
Questions 71
policy authorization startup
distribution issued 197 Authorization Policy Server 194
policy client portal client 135
trace 187 R Startup Center 75
Policy Distribution Client 191 RAS trace logs 5, 8 statement parameter 243
pop-up blocker 149 RAS1 summarization and pruning agent 56
portal client syntax 46 Summarization and Pruning agent 55
See also Tivoli Enterprise Portal tracing on z/OS 60 superseded workspace 214
agents offline 153 RAS1 interface 63 support assistant 335
browser client RAS1 log Support Assistant 335
Linux using Firefox 136 reading 45 support portal
security message 137 ras1log tool 68 subscribing 2
browser mode uninstall Redbooks 333, 335 viewing 2
procedure 67 reflex automation 30, 213 sysadmin
certificate validation failure 138 refreshTECinfo 297 LDAP 156
data not returned to 149 Registry Services 182
DB2 errors 132 release notes
help not displayed 149
historical configuration failure 146
See support portal
remote deploy
T
tacmd 212
historical data 146 agent and OS agent on different
agent status mismatch 15
install Java extension 138 servers 77
cleanMS 249
Java security concern message 139 review 324, 326
executeaction 212
Java Web Start 137, 138, 139 RKPDLOG 25
listSystems 249
logon 141
slow response time 249
Navigator Physical view order 150
timeout failure 250
no view data 12
not starting in Firefox 135
S tacmd checkprereq 249
schema publication tool 301 tacmd commands 57, 249
Oracle Java download page 135
SECJ0373E 204 VMWare 253
security message 137, 139
security tacmd createnode 249
set trace options 48
Windows local 274 tacmd listappinstallrecs 176
situations 147
self-describing agent tacmd listSystems 252
SQL1_CreateRequest failed 145
hot standby 221 tacmd login
startup 135, 136
UNIX monitoring server application hot standby 175
tabbed workspaces 142
support 94 tacmd settrace 61
troubleshooting 135
self-describing capability tacmd viewUser 262
workspaces 142, 214
hot standby 89, 176 Take Action 248
portal server
installation and configuration 87 take action commands 213
connectivity with hub 131
installation sqlStatus 79 errors 89 TCP/IP timeout setting 297
database connection 275
missing managed system group 89 tdw_migrate_setup.sql 277
does not start 130
usage 219 tdw_migrate_step1.sql 277
does not start or stops responding 17
z/OS monitoring server support 89 technotes
failed connection 126
service console 61 See also support portal
logon failure 123
blocking access 65 subscribing to 2
logs 11
login 62 Technotes 333
mismatched application support
RAS1 interface 63 telnet session 172
files 75
Service Management Connect 333, 335 TEPS databse 273
non-hub situations 159
settings 313, 317 tivcmd 97, 195
slow performance 157
silent installation 101 tivcmd CLI 195
status alerts 130
Situation Update Forwarder 59 tivcmd commands 263
troubleshooting 155
situations 28, 147 Tivoli Common Reporting 299
Index 347
348 IBM Tivoli Monitoring: Troubleshooting Guide
Printed in USA
GC22-5449-00