Hitachi

JP1 Version 12 JP1/Integrated Management 2 - Event Gateway for Network Node Manager i


3.3.2 Registering into the cluster software (in UNIX)

This subsection describes how to register JP1/IM - EG for NNMi and JP1/Base on the logical host into the cluster software in a UNIX environment.

Organization of this subsection

(1) Creating scripts to be registered into the cluster software

When you use UNIX cluster software, you normally create a tool to control applications by a method such as scripts and then you register the scripts into the cluster software. In general, you need scripts to provide the Start, Stop, Operation monitoring, and Forced stop functions.

This subsection describes the JP1/IM - EG for NNMi information that is needed for designing scripts. Use this information to create scripts that control JP1/IM - EG for NNMi according to the cluster software specifications and then register the scripts into the cluster software.

Table 3‒3: Detailed information needed for designing scripts for cluster registration

Function to be registered

Description

Start

Starts JP1/IM - EG for NNMi.

  • Command to be used

    jegn_start.cluster logical-host-name

  • Command termination timing

    The start command waits until JP1/IM - EG for NNMi has started before it terminates itself. However, if startup processing is not completed within the timeout period (60 seconds is the default), the command terminates without completing the startup processing. In such a case, the command terminates with the startup processing still underway (the command does not cancel the startup processing).

  • Checking the command execution result

    You can determine the JP1/IM - EG for NNMi startup result by means of the operation monitoring method described below. Normally, the result is determined by the cluster software's operation monitoring. The return values of the start command are 0 (normal termination) and 1 (argument error); therefore, result details cannot be determined from the return value.

Stop

Terminates JP1/IM - EG for NNMi.

  • Command to be used

    jegn_stop.cluster logical-host-name

  • Command termination timing

    The stop command waits until JP1/IM - EG for NNMi has terminated before it terminates itself. However, if stop processing is not completed within the timeout period (60 seconds is the default), the command terminates without completing the stop processing. In such a case, the command terminates with the stop processing still underway (the command does not cancel the stop processing).

  • Checking the command execution result

    You can determine the JP1/IM - EG for NNMi termination result by means of the operation monitoring method described below. The return values of the stop command are 0 (normal termination) and 1 (argument error); therefore, result details cannot be determined from the return value.

We recommend that you execute the forced termination command described below after the stop command has terminated. This enables you to terminate the process and prevent a failover error even in the event of a problem.

Operation monitoring#

Monitors normal operation of JP1/IM - EG for NNMi.

To determine whether JP1/IM - EG for NNMi is running normally, check the return value of the jegn_spmd_status command. This command determines the status from the operating status of each process.

Note that some cluster software does not provide the operation monitoring function. If there is no need to perform failover in the event of a JP1/IM - EG for NNMi failure, you need not register this function.

  • Command to be used

    jegn_spmd_status -h logical-host-name

  • Checking the result of operation monitoring

    The following describes how to interpret the return value:

    Return value = 0 (all running):

    JP1/IM - EG for NNMi is running normally.

    Return value = 1 (error):

    An unrecoverable error has occurred. Treat this as an error.

    Note:

    If you execute the jegn_spmd_status command on the standby server whose shared disk is offline, the return value will be 1 because the shared disk is not available.

    Return value = 4 (partially stopped):

    Some JP1/IM - EG for NNMi processes are stopped due to a problem. Treat this as an error.

    Return value = 8 (all stopped):

    The JP1/IM - EG for NNMi processes have all stopped due to a problem. Treat this as an error.

    Return value = 12 (retriable error):

    While the jegn_spmd_status command was checking the operating status, an error that can be recovered by retries has occurred. Retry checking of the operating status as many times as specified.

Forced stop

Forcibly terminates JP1/IM - EG for NNMi and releases the current resources.

  • Command to be used

    The jegn_killall.cluster command forcibly terminates each process without performing JP1/IM - EG for NNMi termination processing.

Note:

Before you execute forced stop, use the stop command to terminate JP1/IM - EG for NNMi.

#

With respect to JP1 operation checking, the same commands are used in both UNIX and Windows, but they function differently.

Windows operation differs from UNIX operation due to its association with Windows service control. In Windows, when any process terminates, JP1 process management automatically terminates all processes and places the service in stopped status. Assume such a service stop is the result of an error or assume an error has occurred when a command such as jegn_spmd_status returns a value of 8.

About restart of JP1

In the case of cluster operation, do not restart JP1 by JP1 process management. Restart JP1 by the cluster software.

(2) Setting the resource start and stop sequence

To execute JP1/IM - EG for NNMi and JP1/Base on the logical host, the shared disk and logical IP address must be available for use.

Set the start and stop sequence or dependencies in such a manner that they are controlled by the cluster software as described below.