Hitachi

Job Management Partner 1 Version 10 Job Management Partner 1/IT Service Level Management Description, User's Guide, Reference and Operator's Guide


7.1.4 Handling failover errors (cluster system)

If failover from the active server to the standby server has failed, take the appropriate corrective action based on the cause of the failover error.

Organization of this subsection

(1) Procedure

To handle a failover error:

  1. Check the cluster software's logs to determine the cause of the failover error.

    The cause of a failover error is one of the following:

    • A Windows service start error occurred on ITSLM's standby server

    • A cluster software error occurred

    If a Windows service start error occurred on ITSLM's standby server, go to step 2; if a cluster software error occurred, go to step 3.

  2. Check the message output to the standby server's event log, integrated trace log, or message log, and eliminate the cause of the Windows service start error on the standby server.

    For details about the output destinations of the event log, integrated trace log, and message log, see 7.1.1 Checking and handling the output messages. For details about the messages, see 11.3 Messages. After you have taken corrective action, go to step 4.

  3. If failover occurred due to a cluster software error, check the cluster software's logs and eliminate the cause of the error.

  4. Start the Windows services for ITSLM from the cluster software.

    For details about the Windows services to be started, see 2.1.1 Starting ITSLM - Manager and 2.1.2 Starting ITSLM - UR.

If the Windows services start successfully on the standby server, handling of the failover error is complete. If the Windows services still do not start successfully on the standby server after the corrective action was taken on the basis of the message, collect the data needed for determining the cause of the error and contact the system administrator. For details about how to collect the data needed for determining the cause, see 7.1.6 Collecting the data needed for determining the cause of a problem.