Job Management Partner 1/Automatic Job Management System 3 Administration Guide
When JP1/AJS3 starts, multiple processes are generated. You can set up JP1/AJS3 - Manager and JP1/AJS3 - Agent to automatically restart a process that has terminated abnormally for whatever reason.
The restart setting described here applies to JP1/AJS3 that is not operating in a cluster system. If you want to automatically restart processes in a cluster system, use cluster software. For details, see 11.1 Overview of cluster systems. Note that if you specify the -HA option on a logical host, the restart settings are disabled.
Automatic restarting is set in the extended startup process definition file. We recommend using the default values. In JP1/AJS3, the processes that can be restored by restarts are set by default (in JP1/AJS2, no processes are to be restarted by default). For details about the default values of the restart settings for each process, see Tables 7-16 to 7-21.
To change the settings, edit the extended startup process definition file, and then restart JP1/Base and JP1/AJS3. In Windows Server 2003, you will need to change the Dr. Watson settings, as described in (1) Changing the Dr. Watson settings (Windows Server 2003 only).
The extended startup process definition file is in the following location.
- In Windows:
- JP1/AJS3-installation-folder\conf
- In UNIX:
- /etc/opt/jp1ajs2/conf
The tables below list the processes applicable to the restart setting. Applicable processes are only the child processes or detailed processes of the JP1/AJS3 - Manager and JP1/AJS3 - Agent services in the table. You cannot set automatic restart for any other processes.
- In Windows:
Table 7-9 Processes applicable to the restart setting (JP1/AJS3 - Manager in a standard configuration)
No. Child process name or detailed process name Extended startup process definition file Process that can be restarted 1 jajs_dbmd.exe jp1ajs_service_0700.conf jajs_dbmd.exe 2 jajs_hstd.exe jp1ajs_service_0700.conf jajs_hstd.exe and detailed process# 3 ajshlogd.exe jp1ajs_hstd_0700.conf ajshlogd.exe 4 ajsinetd.exe jp1ajs_hstd_0700.conf ajsinetd.exe 5 ajsnetwd.exe jp1ajs_hstd_0700.conf ajsnetwd.exe 6 ajsagtmd.exe jp1ajs_hstd_0700.conf ajsagtmd.exe 7 ajsovstatd.exe jp1ajs_hstd_0700.conf ajsovstatd.exe 8 ajsgwmasterd.exe jp1ajs_hstd_0700.conf ajsgwmasterd.exe 9 ajsqlcltd.exe jp1ajs_hstd_0700.conf ajsqlcltd.exe 10 jpqman.exe jp1ajs_hstd_0700.conf jpqman.exe 11 jpomanager.exe jp1ajs_hstd_0700.conf jpomanager.exe 12 jajs_schd.exe jp1ajs_service_0700.conf jajs_schd.exe and detailed process# 13 ajslogd.exe jp1ajs_schd_0700.conf ajslogd.exe 14 jpqman.exe jp1ajs_schd_0700.conf jpqman.exe 15 jpomanager.exe jp1ajs_schd_0700.conf jpomanager.exe 16 ajsmasterd.exe jp1ajs_schd_0700.conf ajsmasterd.exe 17 jajs_agtd.exe jp1ajs_service_0700.conf jajs_agtd.exe and detailed process# 18 jpqmon.exe jp1ajs_agtd_0700.conf jpqmon.exe 19 jpoagent.exe jp1ajs_agtd_0700.conf jpoagent.exe
- #
- For details about JP1/AJS3 detailed processes, see B. List of Processes in the manual Job Management Partner 1/Automatic Job Management System 3 Troubleshooting.
Table 7-10 Processes applicable to the restart setting (JP1/AJS3 - Manager in a compatible ISAM configuration)
No. Child process name or detailed process name Extended startup process definition file Process that can be restarted 1 jajs_hstd.exe jp1ajs_service_0700.conf jajs_hstd.exe and detailed process# 2 ajshlogd.exe jp1ajs_hstd_0700.conf ajshlogd.exe 3 ajsinetd.exe jp1ajs_hstd_0700.conf ajsinetd.exe 4 ajsnetwd.exe jp1ajs_hstd_0700.conf ajsnetwd.exe 5 ajsovstatd.exe jp1ajs_hstd_0700.conf ajsovstatd.exe 6 ajsgwmasterd.exe jp1ajs_hstd_0700.conf ajsgwmasterd.exe 7 ajsqlcltd.exe jp1ajs_hstd_0700.conf ajsqlcltd.exe 8 jpqman.exe jp1ajs_hstd_0700.conf jpqman.exe 9 jpomanager.exe jp1ajs_hstd_0700.conf jpomanager.exe 10 ajsmasterd.exe jp1ajs_hstd_0700.conf ajsmasterd.exe 11 jajs_agtd.exe jp1ajs_service_0700.conf jajs_agtd.exe and detailed process# 12 jpqmon.exe jp1ajs_agtd_0700.conf jpqmon.exe 13 jpoagent.exe jp1ajs_agtd_0700.conf jpoagent.exe
- #
- For details about JP1/AJS3 detailed processes, see B. List of Processes in the manual Job Management Partner 1/Automatic Job Management System 3 Troubleshooting.
Table 7-11 Processes applicable to the restart setting (JP1/AJS3 - Agent)
No. Child process name or detailed process name Extended startup process definition file Process that can be restarted 1 jpqmon.exe jp1ajs_service_0700.conf jpqmon.exe 2 jpoagent.exe jp1ajs_service_0700.conf jpoagent.exe
- In UNIX:
Table 7-12 Processes applicable to the restart setting (JP1/AJS3 - Manager in a standard configuration)
No. Child process name or detailed process name Extended startup process definition file Process that can be restarted 1 jajs_dbmd jp1ajs_service_0700.conf jajs_dbmd 2 jajs_hstd jp1ajs_service_0700.conf jajs_hstd and detailed process# 3 ajshlogd jp1ajs_hstd_0700.conf ajshlogd 4 ajsinetd jp1ajs_hstd_0700.conf ajsinetd 5 ajsnetwd jp1ajs_hstd_0700.conf ajsnetwd 6 ajsagtmd jp1ajs_hstd_0700.conf ajsagtmd 7 ajsovstatd jp1ajs_hstd_0700.conf ajsovstatd 8 ajsgwmasterd jp1ajs_hstd_0700.conf ajsgwmasterd 9 jpqman jp1ajs_hstd_0700.conf jpqman 10 jpomanager jp1ajs_hstd_0700.conf jpomanager 11 jajs_schd jp1ajs_service_0700.conf jajs_schdhstd and detailed process# 12 ajslogd jp1ajs_schd_0700.conf ajslogd 13 jpqman jp1ajs_schd_0700.conf jpqman 14 jpomanager jp1ajs_schd_0700.conf jpomanager 15 ajsmasterd jp1ajs_schd_0700.conf ajsmasterd 16 jajs_agtd jp1ajs_service_0700.conf jajs_agtdhstd and detailed process# 17 jpqmon jp1ajs_agtd_0700.conf jpqmon 18 jpoagent jp1ajs_agtd_0700.conf jpoagent
- #
- For details about JP1/AJS3 detailed processes, see B. List of Processes in the manual Job Management Partner 1/Automatic Job Management System 3 Troubleshooting.
Table 7-13 Processes applicable to the restart setting (JP1/AJS3 - Manager in a compatible ISAM configuration)
No. Child process name or detailed process name Extended startup process definition file Process that can be restarted 1 jajs_hstd jp1ajs_service_0700.conf jajs_hstd and detailed process# 2 ajshlogd jp1ajs_hstd_0700.conf ajshlogd 3 ajsinetd jp1ajs_hstd_0700.conf ajsinetd 4 ajsnetwd jp1ajs_hstd_0700.conf ajsnetwd 5 ajsovstatd jp1ajs_hstd_0700.conf ajsovstatd 6 ajsgwmasterd jp1ajs_hstd_0700.conf ajsgwmasterd 7 jpqman jp1ajs_hstd_0700.conf jpqman 8 jpomanager jp1ajs_hstd_0700.conf jpomanager 9 ajsmasterd jp1ajs_hstd_0700.conf ajsmasterd 10 jajs_agtd jp1ajs_service_0700.conf jajs_agtd and detailed process# 11 jpqmon jp1ajs_agtd_0700.conf jpqmon 12 jpoagent jp1ajs_agtd_0700.conf jpoagent
- #
- For details about JP1/AJS3 detailed processes, see B. List of Processes in the manual Job Management Partner 1/Automatic Job Management System 3 Troubleshooting.
Table 7-14 Processes applicable to the restart setting (JP1/AJS3 - Agent)
No. Child process name or detailed process name Extended startup process definition file Process that can be restarted 1 jpqmon jp1ajs_service_0700.conf jpqmon 2 jpoagent jp1ajs_service_0700.conf jpoagent The following shows the definition file format.
- In JP1/AJS3 - Manager:
- process-name|path|startup-option|whether-to-restart|restart-count|retry-interval|retry-count-reset-time|type|scheduler-flag|start-sequence|auto-start|stop-path|stop-option|status-check-path|status-check-option|status-check-return-code|status-check-interval|
- In JP1/AJS3 - Agent:
- process-name|path|startup-option|whether-to-restart|restart-count|retry-interval|retry-count-reset-time|
The definition file contains pre-defined information. You can change the values of the whether-to-restart, restart-count, retry-interval, and retry-count-reset-time fields. Do not change any other fields, which are used by the system. You cannot omit the vertical bar (|) that delimits fields. If you want to insert a comment line, begin the line with a hash mark (#). The line up to the linefeed is assumed to be a comment line.
The following table lists the values that can be specified for the variable fields.
Table 7-15 Values that can be specified for the variable fields
Field name Description whether-to-restart Specify whether to restart a process when it has terminated abnormally. Specify 0 if the process is not to be restarted. Specify 1 to restart the process. An appropriate value is set by default. restart-count Specify the number of times a restart of a process is attempted. You can specify a value in the range from 0 to 99. An appropriate value is initially set for each process. Customize this value according to the operating mode. If 0 is set for the whether-to-restart field, the restart-count field is disabled regardless of whether a value is specified. retry-interval Specify the interval in seconds at which a process restart is attempted. You can specify a value in the range from 0 to 3,600. An appropriate value is initially set for each process. Customize this value according to the operating mode. If 0 is set for the whether-to-restart field, the retry-interval field is disabled regardless of whether a value is specified. retry-count-reset-time Specify the period of time (hours converted to seconds) that can elapse from the time a process is restarted until the time the restart count is reset. When the specified time elapses after the process is started, the retry count is reset. If the process abnormally terminates again, the restart count starts again from 1.
If a process is restarted and then abnormally terminates again before the specified time expires, the previous restart count is inherited. You can specify a value in the range from 3,600 and 2,147,483,647 (seconds). An appropriate value is initially set for each process. Customize this value according to the operating mode. If 0 is set for the whether-to-restart field, the retry-count-reset-time field is disabled regardless of whether a value is specified.
- Cautionary notes
- If you attempt to start a process without a value specified or with an incorrect value specified, an error occurs and the process will not start.
- When you start a process managed by a logical host in a cluster configuration, if the conf folder on the logical host does not contain the extended startup process definition file, the file is copied from the physical host.
- When the ajsmasterd child process is restarted, any jobnets and jobs that were running before the restart are interrupted. The status of the jobnets and jobs after ajsmasterd is restarted depends on the start mode of the JP1/AJS3 service. For details about the status of jobnets and jobs for the service start mode, see 7.2.1(3) Jobnet and job statuses for each start mode.
- When a process is restarted, the following message might be output to the integrated trace log: KNAD3737-E The JP1/AJS3 management-target-process-name terminated abnormally. This might occur when a process is restarted too quickly after it has terminated abnormally. In such cases, the restart operation starts before the complete stop of the abnormally terminated process and the child processes of that process, and a double startup is detected. Because a restart of the process is attempted the specified number of times until the process restarts, there is no problem if the KNAD3737-E error message is output. However, you can suppress output of this message by increasing the retry interval in the extended startup process definition file. The possibility of this problem occurring increases on low-performance computers. If necessary, customize the retry interval.
Note that increasing the retry interval might increase the time required for restarting the JP1/AJS3 service. Therefore, do not specify too large a value for the retry count. The recommended value is 10 seconds.
The following tables describe the default values of the restart settings.
In Windows:
Table 7-16 Default values of the restart settings (JP1/AJS3 - Manager in a standard configuration)
No. Child process name or detailed process name whether-to-restart restart-count retry-interval retry-count-reset-time 1 jajs_dbmd.exe No 3 3 21600 2 jajs_hstd.exe Yes 3 20 21600 3 ajshlogd.exe Yes 3 3 21600 4 ajsinetd.exe Yes 3 3 21600 5 ajsnetwd.exe Yes 3 3 21600 6 ajsagtmd.exe Yes 3 3 21600 7 ajsovstatd.exe Yes 3 3 21600 8 ajsgwmasterd.exe Yes 3 3 21600 9 ajsqlcltd.exe Yes 3 3 21600 10 jpqman.exe Yes 3 3 21600 11 jpomanager.exe Yes 3 3 21600 12 jajs_schd.exe Yes 3 10 21600 13 ajslogd.exe No 0 0 21600 14 jpqman.exe No 0 0 21600 15 jpomanager.exe No 0 0 21600 16 ajsmasterd.exe No 0 0 21600 17 jajs_agtd.exe Yes 3 3 21600 18 jpqmon.exe Yes 3 3 21600 19 jpoagent.exe Yes 3 3 21600
Table 7-17 Default values of the restart settings (JP1/AJS3 - Manager in a compatible ISAM configuration)
No. Child process name or detailed process name whether-to-restart restart-count retry-interval retry-count-reset-time 1 jajs_hstd.exe Yes 3 20 21600 2 ajshlogd.exe Yes 3 3 21600 3 ajsinetd.exe Yes 3 3 21600 4 ajsnetwd.exe Yes 3 3 21600 5 ajsovstatd.exe Yes 3 3 21600 6 ajsgwmasterd.exe Yes 3 3 21600 7 ajsqlcltd.exe Yes 3 3 21600 8 jpqman.exe Yes 3 3 21600 9 jpomanager.exe Yes 3 3 21600 10 ajsmasterd.exe No 3 3 21600 11 jajs_agtd.exe Yes 3 3 21600 12 jpqmon.exe Yes 3 3 21600 13 jpoagent.exe Yes 3 3 21600
Table 7-18 Default values of the restart settings (JP1/AJS3 - Agent)
No. Child process name or detailed process name whether-to-restart restart-count retry-interval retry-count-reset-time 1 jpqmon.exe Yes 3 3 21600 2 jpoagent.exe Yes 3 3 21600
In UNIX:
Table 7-19 Default values of the restart settings (JP1/AJS3 - Manager in a standard configuration)
No. Child process name or detailed process name whether-to-restart restart-count retry-interval retry-count-reset-time 1 jajs_hstd Yes 3 20 21600 2 ajshlogd Yes 3 3 21600 3 ajsinetd Yes 3 3 21600 4 ajsnetwd Yes 3 3 21600 5 ajsagtmd Yes 3 3 21600 6 ajsovstatd Yes 3 3 21600 7 ajsgwmasterd Yes 3 3 21600 8 jpqman Yes 3 3 21600 9 jpomanager Yes 3 3 21600 10 jajs_schd Yes 3 10 21600 11 ajslogd No 0 0 21600 12 jpqman No 0 0 21600 13 jpomanager No 0 0 21600 14 ajsmasterd No 0 0 21600 15 jajs_agtd Yes 3 3 21600 16 jpqmon Yes 3 3 21600 17 jpoagent Yes 3 3 21600
Table 7-20 Default values of the restart settings (JP1/AJS3 - Manager in a compatible ISAM configuration)
No. Child process name or detailed process name whether-to-restart restart-count retry-interval retry-count-reset-time 1 jajs_hstd Yes 3 20 21600 2 ajshlogd Yes 3 3 21600 3 ajsinetd Yes 3 3 21600 4 ajsnetwd Yes 3 3 21600 5 ajsovstatd Yes 3 3 21600 6 ajsgwmasterd Yes 3 3 21600 7 jpqman Yes 3 3 21600 8 jpomanager Yes 3 3 21600 9 ajsmasterd No 3 3 21600 10 jajs_agtd Yes 3 3 21600 11 jpqmon Yes 3 3 21600 12 jpoagent Yes 3 3 21600
Table 7-21 Default values of the restart settings (JP1/AJS3 - Agent)
No. Child process name or detailed process name whether-to-restart restart-count retry-interval retry-count-reset-time 1 jpqmon Yes 3 3 21600 2 jpoagent Yes 3 3 21600
The default values of the restart settings have been set to the most appropriate values after taking into account the characteristics of each process. The following describes the characteristics of the processes:
- The jajs_dbmd.exe and jajs_dbmd processes manage the starting and stopping of the embedded database. Because the embedded database has its own restart functionality, it is not necessary to restart these processes in JP1/AJS3.
- A long retry interval is set for the jajs_hstd.exe and jajs_hstd processes because these processes manage many processes that require a long time to stop.
- A long retry interval is set for the jajs_schd.exe and jajs_schd processes because these processes manage processes that require a long time to stop.
- You cannot start just some of the detailed processes of the jajs_schd.exe and jajs_schd processes because the detailed processes must be started in sync with one other. The settings have been specified so that if some detailed processes terminate abnormally, the jajs_schd.exe and jajs_schd processes are restarted.
- The ajsmasterd.exe and ajsmasterd processes in a compatible ISAM configuration do not need to be restarted in JP1/AJS3 because the ajsmasterd process has its own restart functionality.
- Organization of this subsection
- (1) Changing the Dr. Watson settings (Windows Server 2003 only)
- (2) Setting example
(1) Changing the Dr. Watson settings (Windows Server 2003 only)
In Windows, when a process is automatically restarted, an application error occurs in Dr. Watson and a message box appears. If this message box appears, restart is not enabled. To successfully perform automatic restart, you must suppress the error notification by the message box.
Note that when error notification by the message box is suppressed, the message box will not appear even if an error occurs in other applications.
To change the settings of Dr. Watson:
- From the Windows Start menu, choose Run.
- In the text box, type drwtsn32, and click the OK button.
The Dr. Watson dialog box opens.
- Clear the Visual Notification check box.
- Click the OK button.
(2) Setting example
The following shows an example of settings in the extended startup process definition file, and the operation performed when a process terminates abnormally.
This example assumes that the following conditions have been set for JP1/AJS3 child processes:
whether-to-restart: 1 (Restart the process) restart-count: 4 retry-interval: 3 (seconds) restart-count-reset-time: 3,600 (seconds)Figure 7-1 Example of settings in the extended startup process definition file
The following shows an example of the operation performed when a process terminates abnormally.
Figure 7-2 Example of the operation performed when a process terminates abnormally
In the above example, if the process does not abnormally terminate before 3,600 seconds (specified for restart-count) has elapsed since the process was restarted, the restart count is reset. Therefore, if the process abnormally terminates again, the restart count starts from 1. However, if the process abnormally terminates within 3,600 seconds after it was restarted, the restart count is incremented. If the restart count then reaches the specified count value, no attempts are made to restart the process even if it terminates abnormally.
Copyright (C) 2009, 2010, Hitachi, Ltd.
Copyright (C) 2009, 2010, Hitachi Solutions, Ltd.