Error "Other watchdog instance (pid xxxx) is running for more than 540 sec, killing it!"
Last Updated February 24, 2011
You see the following error in the logs "Other watchdog instance (pid 10392) is running for more than 540 sec, killing it!"
Symptoms You receive mail notifications with the following content:
Generally a first notification is sent with similar contents to the following: "Other watchdog instance is running, running time = 300 sec, exiting"
You may also receive another notification after 5 minutes from the above: "Error "Other watchdog instance (pid xxxx) is running for more than 540 sec, killing it!" You see the following error in the logs "Other watchdog instance (pid xxxx) is running for more than 540 sec, killing it!"
Conditions You are using an SMS Appliance running OS software version 5.x Common hardware models likely to experience similar issues: 8220/8320/8240/8260
Common conditions leading to similar issues:
Appliance configured in Control Center + Scanner
Logs, Reports and Quarantine max retention times set to values higher than 15 days
Reports gathering extended data (see Settings/Reports)
Appliance is processing a total number of messages around 200k per day or more
SNMP is configured on the SMS appliance
The watchdog is an internal process running on SMS Appliances which periodically (every 10 minutes) creates statistics data to send to the Control Center. In some cases (see Conditions) the watchdog takes more than 10 minutes to complete its execution, therefore the above notifications/errors are generated.
This message indicates an internal conflict that was automatically resolved. This error does not indicate an issue requiring a response. However, if the software version installed is not 9.0.2 or greater please upgrade to at least version 9.0.2. If the alerts continue after upgrading to version 9.0.2 or greater then proceed with the possible resolution the steps listed below.
Possible resolution steps:
Change your configuration settings in order to retain less data.
Change the SMS appliance with a more powerful hardware model.
Separate Control Center and Scanner roles by adding another Scanner only.
If load balancing is used to distribute the mail load on scanners, add another scanner to the array or make sure the mail load is distributed evenly between scanners.
If cron daemon/watchdog errors persist then restart the SNMP services. To restart SNMP services:
Open Command Line Interface and login as admin.
Run the following commands:
service afasnmpd restart service percsnmpd restart service lsisnmpd restart service snmpd restart
Note: Not all of the services above are supported on every SMS Appliance platform. So if you get a response stating that the service isn't supported on this hardware platform then just proceed with the next service.
Imported Document ID: TECH84692
Subscribing will provide email updates when this Article is updated. Login is required to Subscribe