Hitachi

Job Management Partner 1 Version 10 Job Management Partner 1/Automatic Job Management System 3 Administration Guide


7.3.1 Restarting an abnormally terminated JP1/AJS3 process

When JP1/AJS3 starts, multiple processes are generated. You can set up JP1/AJS3 - Manager and JP1/AJS3 - Agent to automatically restart a process that has terminated abnormally for whatever reason.

The restart setting described here applies to JP1/AJS3 that is not operating in a cluster system. If you want to automatically restart processes in a cluster system, use cluster software. For details, see 11.1 Overview of cluster systems. Note that if you specify the -HA option on a logical host, the restart settings are disabled.

Automatic restarting is set in the extended startup process definition file. We recommend using the default values. In JP1/AJS3, the processes that can be restored by restarts are set by default (in JP1/AJS2, no processes are to be restarted by default). For details about the default values of the restart settings for each process, see Tables 7-17 to 7-22.

To change the settings, edit the extended startup process definition file, and then restart JP1/Base and JP1/AJS3. In Windows Server 2003, you will need to change the Dr. Watson settings, as described in (1) Changing the Dr. Watson settings (Windows Server 2003 only).

The extended startup process definition file is in the following location.

In Windows:

JP1/AJS3-installation-folder\conf

In UNIX:

/etc/opt/jp1ajs2/conf

The tables below list the processes applicable to the restart setting. Applicable processes are only the child processes or detailed processes of the JP1/AJS3 - Manager and JP1/AJS3 - Agent services in the table. You cannot specify the restart setting for any process except those listed in Tables 7-10 to 7-15.

In Windows:
Table 7‒10: Processes applicable to the restart setting (JP1/AJS3 - Manager in a standard configuration)

No.

Child process name or detailed process name

Extended startup process definition file

Process that can be restarted

1

jajs_dbmd.exe

jp1ajs_service_0700.conf

jajs_dbmd.exe and detailed process#

2

ajsdbmgrd.exe

jp1ajs_dbmd_0700.conf

ajsdbmgrd.exe

3

jajs_hstd.exe

jp1ajs_service_0700.conf

jajs_hstd.exe and detailed process#

4

ajshlogd.exe

jp1ajs_hstd_0700.conf

ajshlogd.exe

5

ajsinetd.exe

jp1ajs_hstd_0700.conf

ajsinetd.exe

6

ajsnetwd.exe

jp1ajs_hstd_0700.conf

ajsnetwd.exe

7

ajsagtmd.exe

jp1ajs_hstd_0700.conf

ajsagtmd.exe

8

ajsovstatd.exe

jp1ajs_hstd_0700.conf

ajsovstatd.exe

9

ajsgwmasterd.exe

jp1ajs_hstd_0700.conf

ajsgwmasterd.exe

10

ajsqlcltd.exe

jp1ajs_hstd_0700.conf

ajsqlcltd.exe

11

jpqman.exe

jp1ajs_hstd_0700.conf

jpqman.exe

12

jpomanager.exe

jp1ajs_hstd_0700.conf

jpomanager.exe

13

ajscdinetd.exe

jp1ajs_hstd_0700.conf

ajscdinetd.exe

14

jajs_schd.exe

jp1ajs_service_0700.conf

jajs_schd.exe and detailed process#

15

ajslogd.exe

jp1ajs_schd_0700.conf

ajslogd.exe

16

jpqman.exe

jp1ajs_schd_0700.conf

jpqman.exe

17

jpomanager.exe

jp1ajs_schd_0700.conf

jpomanager.exe

18

ajsmasterd.exe

jp1ajs_schd_0700.conf

ajsmasterd.exe

19

jajs_agtd.exe

jp1ajs_service_0700.conf

jajs_agtd.exe and detailed process#

20

jpqmon.exe

jp1ajs_agtd_0700.conf

jpqmon.exe

21

jpoagent.exe

jp1ajs_agtd_0700.conf

jpoagent.exe

#

For details about JP1/AJS3 detailed processes, see B. List of Processes in the manual Job Management Partner 1/Automatic Job Management System 3 Troubleshooting.

Table 7‒11: Processes applicable to the restart setting (JP1/AJS3 - Manager in a compatible ISAM configuration)

No.

Child process name or detailed process name

Extended startup process definition file

Process that can be restarted

1

jajs_hstd.exe

jp1ajs_service_0700.conf

jajs_hstd.exe and detailed process#

2

ajshlogd.exe

jp1ajs_hstd_0700.conf

ajshlogd.exe

3

ajsinetd.exe

jp1ajs_hstd_0700.conf

ajsinetd.exe

4

ajsnetwd.exe

jp1ajs_hstd_0700.conf

ajsnetwd.exe

5

ajsovstatd.exe

jp1ajs_hstd_0700.conf

ajsovstatd.exe

6

ajsgwmasterd.exe

jp1ajs_hstd_0700.conf

ajsgwmasterd.exe

7

ajsqlcltd.exe

jp1ajs_hstd_0700.conf

ajsqlcltd.exe

8

jpqman.exe

jp1ajs_hstd_0700.conf

jpqman.exe

9

jpomanager.exe

jp1ajs_hstd_0700.conf

jpomanager.exe

10

ajscdinetd.exe

jp1ajs_hstd_0700.conf

ajscdinetd.exe

11

ajsmasterd.exe

jp1ajs_hstd_0700.conf

ajsmasterd.exe

12

jajs_agtd.exe

jp1ajs_service_0700.conf

jajs_agtd.exe and detailed process#

13

jpqmon.exe

jp1ajs_agtd_0700.conf

jpqmon.exe

14

jpoagent.exe

jp1ajs_agtd_0700.conf

jpoagent.exe

#

For details about JP1/AJS3 detailed processes, see B. List of Processes in the manual Job Management Partner 1/Automatic Job Management System 3 Troubleshooting.

Table 7‒12: Processes applicable to the restart setting (JP1/AJS3 - Agent)

No.

Child process name or detailed process name

Extended startup process definition file

Process that can be restarted

1

jpqmon.exe

jp1ajs_service_0700.conf

jpqmon.exe

2

jpoagent.exe

jp1ajs_service_0700.conf

jpoagent.exe

In UNIX:
Table 7‒13: Processes applicable to the restart setting (JP1/AJS3 - Manager in a standard configuration)

No.

Child process name or detailed process name

Extended startup process definition file

Process that can be restarted

1

jajs_dbmd

jp1ajs_service_0700.conf

jajs_dbmd and detailed process#

2

ajsdbmgrd

jp1ajs_dbmd_0700.conf

ajsdbmgrd

3

jajs_hstd

jp1ajs_service_0700.conf

jajs_hstd and detailed process#

4

ajshlogd

jp1ajs_hstd_0700.conf

ajshlogd

5

ajsinetd

jp1ajs_hstd_0700.conf

ajsinetd

6

ajsnetwd

jp1ajs_hstd_0700.conf

ajsnetwd

7

ajsagtmd

jp1ajs_hstd_0700.conf

ajsagtmd

8

ajsovstatd

jp1ajs_hstd_0700.conf

ajsovstatd

9

ajsgwmasterd

jp1ajs_hstd_0700.conf

ajsgwmasterd

10

jpqman

jp1ajs_hstd_0700.conf

jpqman

11

jpomanager

jp1ajs_hstd_0700.conf

jpomanager

12

ajscdinetd

jp1ajs_hstd_0700.conf

ajscdinetd

13

jajs_schd

jp1ajs_service_0700.conf

jajs_schdhstd and detailed process#

14

ajslogd

jp1ajs_schd_0700.conf

ajslogd

15

jpqman

jp1ajs_schd_0700.conf

jpqman

16

jpomanager

jp1ajs_schd_0700.conf

jpomanager

17

ajsmasterd

jp1ajs_schd_0700.conf

ajsmasterd

18

jajs_agtd

jp1ajs_service_0700.conf

jajs_agtdhstd and detailed process#

19

jpqmon

jp1ajs_agtd_0700.conf

jpqmon

20

jpoagent

jp1ajs_agtd_0700.conf

jpoagent

#

For details about JP1/AJS3 detailed processes, see B. List of Processes in the manual Job Management Partner 1/Automatic Job Management System 3 Troubleshooting.

Table 7‒14: Processes applicable to the restart setting (JP1/AJS3 - Manager in a compatible ISAM configuration)

No.

Child process name or detailed process name

Extended startup process definition file

Process that can be restarted

1

jajs_hstd

jp1ajs_service_0700.conf

jajs_hstd and detailed process#

2

ajshlogd

jp1ajs_hstd_0700.conf

ajshlogd

3

ajsinetd

jp1ajs_hstd_0700.conf

ajsinetd

4

ajsnetwd

jp1ajs_hstd_0700.conf

ajsnetwd

5

ajsovstatd

jp1ajs_hstd_0700.conf

ajsovstatd

6

ajsgwmasterd

jp1ajs_hstd_0700.conf

ajsgwmasterd

7

jpqman

jp1ajs_hstd_0700.conf

jpqman

8

jpomanager

jp1ajs_hstd_0700.conf

jpomanager

9

ajscdinetd

jp1ajs_hstd_0700.conf

ajscdinetd

10

ajsmasterd

jp1ajs_hstd_0700.conf

ajsmasterd

11

jajs_agtd

jp1ajs_service_0700.conf

jajs_agtd and detailed process#

12

jpqmon

jp1ajs_agtd_0700.conf

jpqmon

13

jpoagent

jp1ajs_agtd_0700.conf

jpoagent

#

For details about JP1/AJS3 detailed processes, see B. List of Processes in the manual Job Management Partner 1/Automatic Job Management System 3 Troubleshooting.

Table 7‒15: Processes applicable to the restart setting (JP1/AJS3 - Agent)

No.

Child process name or detailed process name

Extended startup process definition file

Process that can be restarted

1

jpqmon

jp1ajs_service_0700.conf

jpqmon

2

jpoagent

jp1ajs_service_0700.conf

jpoagent

The following shows the definition file format.

In JP1/AJS3 - Manager:

process-name|path|startup-option|whether-to-restart|restart-count|retry-interval|retry-count-reset-time|type|scheduler-flag|start-sequence|auto-start|stop-path|stop-option|status-check-path|status-check-option|status-check-return-code|status-check-interval|

In JP1/AJS3 - Agent:

process-name|path|startup-option|whether-to-restart|restart-count|retry-interval|retry-count-reset-time|

The definition file contains pre-defined information. You can change the values of the whether-to-restart, restart-count, retry-interval, and retry-count-reset-time fields. Do not change any other fields, which are used by the system. You cannot omit the vertical bar (|) that delimits fields. If you want to insert a comment line, begin the line with a hash mark (#). The line up to the linefeed is assumed to be a comment line.

The following table lists the values that can be specified for the variable fields.

Table 7‒16: Values that can be specified for the variable fields

Field name

Description

whether-to-restart

Specify whether to restart a process when it has terminated abnormally. Specify 0 if the process is not to be restarted. Specify 1 to restart the process. An appropriate value is set by default.

restart-count

Specify the number of times a restart of a process is attempted. You can specify a value in the range from 0 to 99. An appropriate value is initially set for each process. Customize this value according to the operating mode. If 0 is set for the whether-to-restart field, the restart-count field is disabled regardless of whether a value is specified.

retry-interval

Specify the interval in seconds at which a process restart is attempted. You can specify a value in the range from 0 to 3,600. An appropriate value is initially set for each process. Customize this value according to the operating mode. If 0 is set for the whether-to-restart field, the retry-interval field is disabled regardless of whether a value is specified.

retry-count-reset-time

Specify the period of time (hours converted to seconds) that can elapse from the time a process is restarted until the time the restart count is reset. When the specified time elapses after the process is started, the retry count is reset. If the process abnormally terminates again, the restart count starts again from 1.

If a process is restarted and then abnormally terminates again before the specified time expires, the previous restart count is inherited. You can specify a value in the range from 3,600 and 2,147,483,647 (seconds). An appropriate value is initially set for each process. Customize this value according to the operating mode. If 0 is set for the whether-to-restart field, the retry-count-reset-time field is disabled regardless of whether a value is specified.

Cautionary notes
  • If you attempt to start a process without a value specified or with an incorrect value specified, an error occurs and the process will not start.

  • When you start a process managed by a logical host in a cluster configuration, if the conf folder on the logical host does not contain the extended startup process definition file, the file is copied from the physical host.

  • When the ajsmasterd child process is restarted, any jobnets and jobs that were running before the restart are interrupted. The status of the jobnets and jobs after ajsmasterd is restarted depends on the start mode of the JP1/AJS3 service. For details about the status of jobnets and jobs for the service start mode, see 7.2.1(3) Jobnet and job statuses for each start mode.

  • When a process is restarted, the following message might be output to the integrated trace log: KNAD3737-E The JP1/AJS3 management-target-process-name terminated abnormally. This might occur when a process is restarted too quickly after it has terminated abnormally. In such cases, the restart operation starts before the complete stop of the abnormally terminated process and the child processes of that process, and a double startup is detected. Because a restart of the process is attempted the specified number of times until the process restarts, there is no problem if the KNAD3737-E error message is output. However, you can suppress output of this message by increasing the retry interval in the extended startup process definition file. The possibility of this problem occurring increases on low-performance computers. If necessary, customize the retry interval.

    Note that increasing the retry interval might increase the time required for restarting the JP1/AJS3 service. Therefore, do not specify too large a value for the retry count. The recommended value is 10 seconds.

  • If JP1/AJS3 child processes restart, their detailed processes also restart. As a result, the restart counts of the detailed processes are reset.

The following tables describe the default values of the restart settings.

In Windows:

Table 7‒17: Default values of the restart settings (JP1/AJS3 - Manager in a standard configuration)

No.

Child process name or detailed process name

whether-to-restart

restart-count

retry-interval

retry-count-reset-time

1

jajs_dbmd.exe

Yes

3

3

21600

2

ajsdbmgrd.exe

No

3

3

21600

3

jajs_hstd.exe

Yes

3

20

21600

4

ajshlogd.exe

Yes

3

3

21600

5

ajsinetd.exe

Yes

3

3

21600

6

ajsnetwd.exe

Yes

3

3

21600

7

ajsagtmd.exe

Yes

3

3

21600

8

ajsovstatd.exe

Yes

3

3

21600

9

ajsgwmasterd.exe

Yes

3

3

21600

10

ajsqlcltd.exe

Yes

3

3

21600

11

jpqman.exe

Yes

3

3

21600

12

jpomanager.exe

Yes

3

3

21600

13

ajscdinetd.exe

Yes

3

3

21600

14

jajs_schd.exe

Yes

3

10

21600

15

ajslogd.exe

No

3

3

21600

16

jpqman.exe

No

3

3

21600

17

jpomanager.exe

No

3

3

21600

18

ajsmasterd.exe

No

3

3

21600

19

jajs_agtd.exe

Yes

3

3

21600

20

jpqmon.exe

Yes

3

3

21600

21

jpoagent.exe

Yes

3

3

21600

Table 7‒18: Default values of the restart settings (JP1/AJS3 - Manager in a compatible ISAM configuration)

No.

Child process name or detailed process name

whether-to-restart

restart-count

retry-interval

retry-count-reset-time

1

jajs_hstd.exe

Yes

3

20

21600

2

ajshlogd.exe

Yes

3

3

21600

3

ajsinetd.exe

Yes

3

3

21600

4

ajsnetwd.exe

Yes

3

3

21600

5

ajsovstatd.exe

Yes

3

3

21600

6

ajsgwmasterd.exe

Yes

3

3

21600

7

ajsqlcltd.exe

Yes

3

3

21600

8

jpqman.exe

Yes

3

3

21600

9

jpomanager.exe

Yes

3

3

21600

10

ajscdinetd.exe

Yes

3

3

21600

11

ajsmasterd.exe

No

3

3

21600

12

jajs_agtd.exe

Yes

3

3

21600

13

jpqmon.exe

Yes

3

3

21600

14

jpoagent.exe

Yes

3

3

21600

Table 7‒19: Default values of the restart settings (JP1/AJS3 - Agent)

No.

Child process name or detailed process name

whether-to-restart

restart-count

retry-interval

retry-count-reset-time

1

jpqmon.exe

Yes

3

3

21600

2

jpoagent.exe

Yes

3

3

21600

In UNIX:

Table 7‒20: Default values of the restart settings (JP1/AJS3 - Manager in a standard configuration)

No.

Child process name or detailed process name

whether-to-restart

restart-count

retry-interval

retry-count-reset-time

1

jajs_dbmd

Yes

3

3

21600

2

ajsdbmgrd

No

3

3

21600

3

jajs_hstd

Yes

3

20

21600

4

ajshlogd

Yes

3

3

21600

5

ajsinetd

Yes

3

3

21600

6

ajsnetwd

Yes

3

3

21600

7

ajsagtmd

Yes

3

3

21600

8

ajsovstatd

Yes

3

3

21600

9

ajsgwmasterd

Yes

3

3

21600

10

jpqman

Yes

3

3

21600

11

jpomanager

Yes

3

3

21600

12

ajscdinetd

Yes

3

3

21600

13

jajs_schd

Yes

3

10

21600

14

ajslogd

No

3

3

21600

15

jpqman

No

3

3

21600

16

jpomanager

No

3

3

21600

17

ajsmasterd

No

3

3

21600

18

jajs_agtd

Yes

3

3

21600

19

jpqmon

Yes

3

3

21600

20

jpoagent

Yes

3

3

21600

Table 7‒21: Default values of the restart settings (JP1/AJS3 - Manager in a compatible ISAM configuration)

No.

Child process name or detailed process name

whether-to-restart

restart-count

retry-interval

retry-count-reset-time

1

jajs_hstd

Yes

3

20

21600

2

ajshlogd

Yes

3

3

21600

3

ajsinetd

Yes

3

3

21600

4

ajsnetwd

Yes

3

3

21600

5

ajsovstatd

Yes

3

3

21600

6

ajsgwmasterd

Yes

3

3

21600

7

jpqman

Yes

3

3

21600

8

jpomanager

Yes

3

3

21600

9

ajscdinetd

Yes

3

3

21600

10

ajsmasterd

No

3

3

21600

11

jajs_agtd

Yes

3

3

21600

12

jpqmon

Yes

3

3

21600

13

jpoagent

Yes

3

3

21600

Table 7‒22: Default values of the restart settings (JP1/AJS3 - Agent)

No.

Child process name or detailed process name

whether-to-restart

restart-count

retry-interval

retry-count-reset-time

1

jpqmon

Yes

3

3

21600

2

jpoagent

Yes

3

3

21600

The default values of the restart settings have been set to the most appropriate values after taking into account the characteristics of each process. The following describes the characteristics of the processes:

Organization of this subsection

(1) Changing the Dr. Watson settings (Windows Server 2003 only)

In Windows, when a process is automatically restarted, an application error occurs in Dr. Watson and a message box appears. If this message box appears, restart is not enabled. To successfully perform automatic restart, you must suppress the error notification by the message box.

Note that when error notification by the message box is suppressed, the message box will not appear even if an error occurs in other applications.

To change the settings of Dr. Watson:

  1. From the Windows Start menu, choose Run.

  2. In the text box, type drwtsn32, and click the OK button.

    The Dr. Watson dialog box opens.

  3. Clear the Visual Notification check box.

  4. Click the OK button.

(2) Setting example

The following shows an example of settings in the extended startup process definition file, and the operation performed when a process terminates abnormally.

This example assumes that the following conditions have been set for JP1/AJS3 child processes:

whether-to-restart: 1 (Restart the process)
restart-count: 4
retry-interval: 3 (seconds)
restart-count-reset-time: 3,600 (seconds)
Figure 7‒1: Example of settings in the extended startup process definition file

[Figure]

The following shows an example of the operation performed when a process terminates abnormally.

Figure 7‒2: Example of the operation performed when a process terminates abnormally

[Figure]

In the above example, if the process does not abnormally terminate before 3,600 seconds (specified for restart-count) has elapsed since the process was restarted, the restart count is reset. Therefore, if the process abnormally terminates again, the restart count starts from 1. However, if the process abnormally terminates within 3,600 seconds after it was restarted, the restart count is incremented. If the restart count then reaches the specified count value, no attempts are made to restart the process even if it terminates abnormally.