21.7.1 Procedure for detecting factors that might stop job execution
This section describes the procedure for monitoring and reporting factors that might prevent scheduler services from executing jobs.
Note that these factors can also be monitored by using the Web GUI (Management Portal). To use the Web GUI (Management Portal), you must set it up on the manager host. For details about how to setup, see 6.7 Settings related to the Web GUI (Management Portal) (for Windows) or 15.8 Settings related to the Web GUI (Management Portal) (for Linux).
For details about performing operations on Web GUI (Management Portal), see 16. Management Portal of the Web GUI in the JP1/Automatic Job Management System 3 Operator's Guide.
- Organization of this subsection
(1) Definition procedure
-
Stop the service.
- In Windows:
-
In Windows Control Panel, open the Services administrative tool, and stop the following service:
- JP1/AJS3 service
- In UNIX:
-
Execute the following commands to confirm that all processes have stopped:
# /opt/jp1ajs2/bin/jajs_spmd_stop
# /opt/jp1ajs2/bin/jajs_spmd_status
-
Execute the following command to set the environment setting parameter described in (2) below:
jajs_config -k "definition-key" "parameter-name"=value
-
Restart JP1/AJS3.
The new settings are applied.
(2) Environment setting parameter
Definition key |
Environment setting parameter |
Explanation |
---|---|---|
|
"AlarmContentJobEventInterval"= |
Interval for issuing a JP1 event when the number of jobs in the Now queuing or Now running status exceeds the value specified for the AlartContentJob environment setting parameter |
[{JP1_DEFAULT|logical-host-name}\JP1AJSMANAGER\scheduler-service-name]# |
"CONDGENWARNINT"= |
The interval for checking the number of retained execution generations of the jobnets with start conditions before a message is output |
"CONDGENWARNNUM"= |
The threshold for the number of retained execution generations of the jobnets with start conditions |
|
|
"AlarmUnreportedInfoEvent"= |
Threshold for monitoring the number of unreported information items for the scheduler service and outputting a message and issuing a JP1 event |
"AlarmUnreportedInfoEventInterval"= |
Interval for outputting a message and issuing a JP1 event during monitoring of the number of unreported information items for the scheduler service |
- #:
-
The specification of the {JP1_DEFAULT|logical-host} part depends on whether the host is a physical host or a logical host. For a physical host, specify JP1_DEFAULT. For a logical host, specify the logical host name.
For details about the definition of these environment setting parameters, see the following:
(3) Settings for performance reports
If a factor that might stop job execution is detected, you can determine whether corrective actions are required by checking the transition of values output by the performance report function. Therefore, we recommend the performance report function be enabled. For details about how to enable the performance report function, see 6.6.1 Settings for the performance report output function (for Windows) or 15.7.1 Settings for the performance report output function (for UNIX).
The following table lists the column of the ajsreport command corresponding to the items to be monitored.
Monitored item |
Column name of the ajsreport command |
---|---|
Number of jobs in the Now queuing or Now running status |
CONTENTJOB_NUM |
Number of generations of a jobnet with start conditions |
RUNNING_AND_SCHEDULED_NEXT_GENERATIONS_NUM |
Number of unreported information items for the scheduler service |
EVJOB_UNREPORTED_EVENT_NUM |