9.2.4 Alarms related to process monitoring are not reported as intended
Note the following when you are monitoring the process operation status of a monitored host that is running UNIX: An error alarm might be reported even though the monitoring target process is not stopped, and then a normal alarm might be reported at the following collection time.
In a UNIX environment, when a process generates child processes, copies of these processes are created, and as a result duplicate copies of the same processes might appear to exist. Therefore, keep in mind that the number of processes increases when a process that generates child processes is the monitoring target. Specifically, an error alarm can be reported if process information is collected at the time the number of processes increases, and a normal alarm can then be reported if process information is collected at the time the number of processes returns to 1.
To avoid this phenomenon, take the following steps:
-
If the maximum number of concurrently existing child processes that will be generated by the monitoring target process is clear, specify the result of the formula shown below for the upper threshold of the number of monitoring target processes. Here, m indicates the maximum number of active processes and n indicates the maximum number of concurrently existing child processes per process.
m × (1 + n)
If the calculation result exceeds 65,535, specify 65535.
-
If the maximum number of concurrently existing child processes that will be generated by the monitoring target process is unclear, specify 65535 for the upper threshold of the number of monitoring target processes.
If the process operation status information could not be collected from the OS, the number of monitoring target processes might become 0 and an alarm might be reported. To prevent this alarm, from the Alarms window, open the New Alarm Table > Main Information window or the Edit > Main Information window. Then in Advanced settings, select Report alarm when the following damping condition is reached and specify 2 occurrence(s) during/Interval(s).