OpenTP1 Version 7 Description

[Contents][Glossary][Index][Back][Next]

5.3.3 Recovering from file errors

This section describes the procedure when an error occurs in one of the following OpenTP1 files:

For details about recovering from file errors, see the manual OpenTP1 Operation.

Organization of this subsection
(1) Recovering from system file errors
(2) Recovering from queue file errors
(3) Recovering from user file errors

(1) Recovering from system file errors

The following describes the procedure when an error occurs in a system file.

(a) Recovering from system-file errors: status files

OpenTP1 can detect a status file error when the status information is written or read, or when the status service is started. When an error is detected while attempting to read to or write from a status file, the procedure for recovering from such an error depends on:

When an error is detected when attempting to start the status service, the system and user actions depend on which of the following is specified in the status service definition:

(b) Recovering from system-file errors: system journal files

System journal file errors can be generally classified into write errors and read errors. In a write error, the journal to be obtained cannot be written to during online processing. In a read error, the journal cannot be read from during a complete recovery or a UAP transaction partial recovery.

The procedure for recovering from such an error depends on:

Note, however, that when a system journal file is duplicated, the journal is input first from physical file A. If an error occurs during the write, the journal can be input from physical file B, which increases reliability at system recovery.

When a system journal file is duplicated, even if only one of the files can be used as a swap destination, you can specify whether to use it as the swap destination in the system journal service definition.

(c) Recovering from system-file errors: checkpoint dump files

As with system journal file errors, checkpoint dump file errors can be roughly classified into write errors and read errors. In a write error, the checkpoint dump cannot be written to during online processing. In a read error, the checkpoint file cannot be read from during a complete recovery.

If a checkpoint dump is duplicated, errors can be recovered starting from either system A or B at system recovery.

The procedure for recovering from such an error depends on:

(d) Recovering from system-file errors: archive journal files

Archive journal file errors can be classified into write errors and read errors. In a write error you cannot write journal information you are attempting to acquire during online processing. In a read error you cannot read journals during complete recovery.

If an archive journal file is duplicated, data is input from system A. If a write error occurs, input can be switched from system A to system B, increasing the reliability during system recovery.

When a file is duplicated, you can specify in the archive journal service definition whether to allocate the archive journal file as the swap destination even if only one standby physical file can be used.

(e) Recovering from system-file errors: transaction recovery journal files and server recovery journal files

If a failure occurs in a transaction recovery journal file or in a server recovery journal file, a message about the failure is output. In accordance with this message:

  1. Execute the jnlmkrf command to restore the target recovery journal file.

If the recovery journal files are not recovered despite executing the jnlmkrf command:

  1. Restore the resource manager using commands (damfrc or tamfrc)
  2. Forcibly restart the OpenTP1 system.
(f) Recovering from system-file errors: message log files

For a message log file, two files are used with round-robin scheduling. If an error occurs in either of the two files, OpenTP1 uses the normal file only, isolating the erroneous file. If errors occur in both files, OpenTP1 continues processing without outputting a message log.

The procedure for recovering from such an error depends on whether OpenTP1 can detect the error:

(2) Recovering from queue file errors

The following describes the procedure when an error occurs in a queue file.

(a) Recovering from queue-file errors: MCF message queue file

OpenTP1 cannot recover from a read or write error to an MCF message queue file. If an MCF message queue file cannot be opened, OpenTP1 outputs a message and the OpenTP1 administrator must allocate another MCF message queue file.

(b) Recovering from queue-file errors: MQA message queue file

For an action taken if an error occurs on a queue file of TP1/Message Queue, see the error recovery section in the OpenTP1 TP1/Message Queue User's Guide.

(3) Recovering from user file errors

The following describes the procedure when an error occurs in a user data file.

(a) Recovering from user-file errors: DAM files

If a read or write error occurs in a DAM file, the DAM service returns an error to the UAP. When the transaction is completed, the DAM file is shutdown (i.e., access to the DAM file is prevented) because of the error.

(b) Recovering from user-file errors: TAM files

When OpenTP1 detects a read or write error in a TAM file, the TAM file is shutdown: i.e., access to the TAM file is prevented.

(c) Recovering from user-file errors: ISAM files

For an action taken if an error occurs on a Hitachi ISAM file, see the error recovery section in the manual Indexed Sequential Access Method ISAM.