Documentum Maintenance Procedure Checklist
Documentum Maintenance Procedure Checklist
After the initial Documentum installation and rollout of the first phase, it is essential to
follow a maintenance/procedure checklist to assure maximum system performance and
stability.
Documentum Administrator
Many of the maintenance procedures and jobs are configured or accessed through
Documentum
Administrator (DA):
Maintenance
Logs to Monitor
It is highly recommended to check all logs periodically for errors and warnings.
Application Server
Name: stdout_yyyymmdd.log (example: stdout_20090218.log)
Location: \Program Files\Apache Software Foundation\Tomcat 6.0\logs
Purpose: shows warnings and errors from Webtop and TBOs.
• Name: DocbaseName.log
• Location: C:\Documentum\dba\log
• Purpose: Shows the repository startup output and any warnings or errors.
The Content Server has a state of the docbase job (dm_StateOfDocbase) which monitors
this. Also the data drive should be monitored.
Jobs
Some of the jobs below are not active OOTB. They have to set to active and started on a
schedule. Be sure to set the run times so that they do not conflict other jobs and backup
schedules.
dm_ContentWarning
dm_LogPurge
dm_StateOfDocbase
dm_AuditMgt
• Purpose: Removes old audit trail entries A key parameter is the cutoff in days, basically
how many days worth of audits to keep.
• args: -queueperson, -custom_predicate r_gen_source=1, -window_interval 1440,
-cutoff_days 1
• Note the "cutoff_days" parameter should be set to a reasonable number of days,
balancing compliance and trouble shooting issues.
dm_QueueMgt
dm_UpdateStats
• Purpose: Updates RDBMS statistics and reorgs tables (if RDBMS supports)
• args: -window_interval 120, -queueperson, -dbreindex READ, -server_name
SQL2\SQL2005
dm_ConsistencyChecker
dm_DataDictionaryPublisher
dm_LDAPSynchronization
dm_FTStateOfIndex
dm_GwmTask_Alert
dm_GwmClean
The following statements are some of the DQLs that EMC support had us run to determine
the
number of audit trails and queue items that were in the repository:
Select count(*) from dmi_queue_item
Ideally, the Content Server should be shutdown prior to running the back up of the SQL
Server database and started back up afterward. This will reduce any likelihood of the
repository becoming out of synch with the database and the content files.
Before applying any patches or upgrades to any of the Documentum suite and supporting
applications, be sure to check for compatibility. Apply any patches or upgrades to the dev
and QA environments and test them first.
If any network interruption occurs, then service logs should be checked for compromised
activity. The Content Server and Tomcat server may need to be restarted. The logs of the
application and content servers should be periodically monitored for errors and warnings.
Performance
If RAM is filled or CPU utilization is maxed out then the service responsible should be
checked. If the service is a Documentum service, it should be restarted and root cause
should be determined. Utilization should be monitored and any anticipated spikes in use or
additional services need to be load tested and analyzed. What should you do if Tomcat
performance slows? If the concurrent users reach EMC’s limit of 20, EMC will recommends
adding a second Tomcat server.
EMC Support gives the basic JVM settings to cover for common exceptions and crashes.
There
are a number of other settings to add as more traffic occurs on the Tomcat server. From
the
DCM Installation Guide:“To achieve better performance, add these parameters to the
application server startup command line:
• -server-XX:+UseParallel01dGC
Document caching can consume at least 80MB of memory. User session caching can
consume approximately 2.5 MB to 3 MB per user. Fifty connected users can consume over
200 MB of VM memory on the application server. Increase the values to meet the
demands of the expected user load.”
Monitor Sessions
DA
• Purpose: set this script at a command line prompt to output how many active and inactive
sessions are current on the content server. Set the interval between output and how many loops
to run.
• Using DA, look at how many “active” users sessions are currently in the repository.
How many “inactive” sessions.
• Try reducing the session timeout value in the web.xml on the Tomcat server to see if
the inactive sessions get cleared out faster.
• As more users access the system, it may become necessary to create a second Tomcat
(clustered) instance to ease the load on just one application server.
• As more content get added to the system, more disk space will need to be added to
the filestore drive.
• Set up failover services for all key components
• Add more Java Method Servers if lifecycle processing overwhelms the existing one.
• A comprehensive content archiving plan will need to be designed and implemented.
• Setup a disaster recovery site if the system’s service level agreement (SLA) is
sooner than a new system could be built with backups.