WatchDog element
Configures the watchdog process SLWatchdog.
Parents
Children
Name | Occurrences | Description |
---|---|---|
All | ||
Actions | [0, 1] | Specifies the action to perform. Possible values: alarm, restart, switch. Multiple values must be separated by a semicolon (";"). |
[0, 1] | Configures email settings. | |
Errors | [0, 1] | Specifies how many consecutive missed thread signals (errors) are tolerated before the system considers a thread to be in a problematic state and triggers error handling actions. |
FailoverOnRTE | [0, 1] | Configures settings related to performing a Failover switch when a runtime error (RTE) occurs. |
ProcessMonitor | [0, 1] | Configures settings related to the process monitor. |
TechsupportNotifications | [0, 1] | Configures settings related to tech support notifications. |
TimeoutTime | [0, 1] | Specifies the timeout time in minutes. |
Remarks
You can configure Watchdog to:
Initiate a Failover switch in case of a runtime error, by specifying the value "switch" in the tag. Optionally, to exclude certain threads from initiating a Failover switch, add the <FailoverOnRTE> subtag and specify the threads in <SkipRTE> subtags.
Note
- If a Failover switch is launched, the DMA is then also restarted to make sure that it frees the virtual IP address. Before the restart is initiated, the DMA is marked as "offline".
- If DataMiner Watchdog is set to initiate a Failover switch in case of a runtime error, it will even do so if the Failover type is set to "Manual" in the Failover settings.
Initiate an element restart in case of a runtime error on an element-related SLProtocol thread, by adding the attribute restartElementOnProtocolRTE, and setting it to true.
Note
- To make these changes to MaintenanceSettings.xml take effect, after you have saved the file, stop the DMA, manually stop the SLWatchdog service, and then restart the DMA.
- If DataMiner Watchdog is set to initiate both a Failover switch and an element restart, then the latter takes precedence. No Failover switch will be initiated when the element restart succeeds.