Hitachi

Job Management Partner 1 Version 10 Job Management Partner 1/Automatic Job Management System 3 System Design (Configuration) Guide


5.2.1 Backup and recovery methods

To perform backup and recovery for JP1/AJS3, you need to back up and restore the settings information needed to run JP1/AJS3. The files to be backed up and the timing of a backup differ according to the purpose of the backup and recovery. Table 5-2 describes the types of backup and recovery and their characteristics.

Two methods for backup and recovery are available:
  • Backing up and restoring specific settings information according to the update time.

  • Backing up and restoring the entire system, the shared disk in a cluster configuration, or the entire database (embedded database) in one operation.

After considering the advantages and disadvantages of each type of backup and recovery for the intended purpose, consider the time required for performing backup and recovery and the scope of backup and recovery.

Table 5‒2: Types of JP1/AJS3 backup and recovery and the characteristics of each

Type of backup and recovery

Main purpose

When backup is created

Advantages

Disadvantages

JP1/AJS3 environment settings information

Recovery from a failure such as a disk failure, a corrupted file, or files that are no longer available

When a setting is changed

  • JP1/AJS3 operation does not need to stop.

  • The time at which the backup is created and its scope can be set appropriately as needed.

  • For definition information and schedule information that is changed often, the timing of backups and the number of backups must be considered.

  • Because jobnets must be registered again, resuming work task takes a long time.

JP1/AJS3 definition information:

  • Definition information for jobs and jobnets

  • Calendar and schedule information

  • Execution agent information

  • Execution environment for QUEUE jobs and submit jobs

  • Recovery from a failure such as a disk failure, a corrupted file, or files that are no longer available

  • Distribution to other computers or scheduler services

Registered execution-schedule information for jobnets#1

Recovery from a failure such as a disk failure, a corrupted file, or files that are no longer available

When a jobnet is registered

Because the registration status of jobnets can be restored in one operation after recovery, the re-registration of individual jobnets is not required, reducing the workload before operation starts.

Jobnets registered for immediate execution are not included.

Entire system

Maintenance or migration of the entire system

When the entire system is migrated#2

Information can be backed up or restored in one operation.

  • JP1/AJS3 must be stopped during maintenance.

  • After recovery, JP1/AJS3 must be cold-started.#3

  • Because the jobnets must be registered again, time is required before work tasks resume.

Shared disk in a cluster configuration

  • Maintenance such as replacing the shared disk

  • Migration of a cluster system or reconstruction of an environment

  • Recovery from a shared disk failure

When cluster system maintenance is performed

Database (embedded database)

  • Recovery from a failure such as a failure on the disk containing the database, a corrupted file, or files that are no longer available

  • Migration of a system or reconstruction of the environment

When the scheduler service stops

  • The scheduler service must be stopped.

  • After the recovery, JP1/AJS3 must be cold-started.#3

  • Because the jobnets must be registered again, time is required before work tasks resume.

#1

To back up and restore the registered execution-schedule information, you also need to back up and restore the unit definition information.

#2

When an entire system is migrated via backup and recovery, JP1/AJS3 - Manager cannot be migrated to a host that has a different host name. JP1/AJS3 - Manager can only be migrated to the same host or to another host that has the same host name. JP1/AJS3 - Agent can be migrated to a host that has a different host name.

In either case, as a prerequisite, JP1 must back up the entire system while maintaining the integrity of all file information, including the information registered in the OS, and then must restore all the backed up files.

If you intend to back up and restore an entire system using OS commands and backup tools, first verify the environment that is being used.

#3

When JP1/AJS3 is cold-started, the registered execution-schedule information for jobnets and the execution information for jobnets and jobs are initialized, and can no longer be restored.

Organization of this subsection

(1) Backing up and restoring JP1/AJS3 environment settings information and definition information

(a) What needs to be backed up and restored

You should back up files containing JP1/AJS3 environment settings information and definition information, and the settings information output by using a JP1/AJS3 command. The JP1/AJS3 environment settings and definition information includes information about JP1/AJS3 - Manager, JP1/AJS3 - Agent, and JP1/AJS3 - View. For details about the information that must be backed up in JP1/AJS3, see Table 5-3 in 5.2.2 Target files and backup timing.

Note that the definition information for jobnets and jobs, calendar and schedule information, and execution agent definition information are stored in an embedded database. Determine the unit and the best time to perform backup and recovery according to operational needs.

(b) Backup and recovery procedures

For details about the backup procedure, see 2.1.1(1) Backup procedure in the Job Management Partner 1/Automatic Job Management System 3 Administration Guide.

When you perform recovery, restore the backed up information to the system as needed according to the state of the system after a failure. Because only the settings information needed to run JP1/AJS3 is backed up, you will need to re-register jobnets for execution after this information is restored. For details about the recovery procedure, see 2.1.2(1) Recovery procedure in the Job Management Partner 1/Automatic Job Management System 3 Administration Guide.

(c) Backup and recovery methods

For details about how to back up and restore only settings information, see 2.2 Backing up the setup information for a system that uses JP1/AJS3 in the Job Management Partner 1/Automatic Job Management System 3 Administration Guide and 2.3 Restoring the setup information for a system that uses JP1/AJS3 in the Job Management Partner 1/Automatic Job Management System 3 Administration Guide.

(2) Backing up and restoring registered execution-schedule information for jobnets

You can use the functions for exporting and importing registered execution-schedule information to back up and restore the execution registration status for jobnets. If a failure occurs in the JP1/AJS3 system or if you need to cold-start the JP1/AJS3 service in order to back up and restore the entire system, these functions greatly reduce the time required until operation resumes.

You can back up and restore the registered execution-schedule information for jobnets in JP1/AJS3 - Manager. For details, see 4.4 Backing up and recovering the execution registration status of jobnets by using the ajsrgexport and ajsrgimport commands in the Job Management Partner 1/Automatic Job Management System 3 Administration Guide.

(3) Backing up and restoring the entire system

(a) Backup and recovery methods

When you use an OS command or backup tool to back up the entire system, stop JP1/AJS3 before starting the backup process. To restore the system from the backup, cold-start JP1/AJS3 to initialize the registered execution-schedule information for jobnets and the execution information for jobnets and jobs. For details about the procedure, see 2.4 Backing up and recovering an entire system in the Job Management Partner 1/Automatic Job Management System 3 Administration Guide.

(b) Considerations before backing up or restoring the entire system

Consider the following before you back up or restore the entire system:

  • Consistency with other data and applications (for example, the status of user jobs)

  • To what state (the point in time) you want to restore the jobnet or job

  • Which host (the manager host or the agent host) will be cold-started

With products such as JP1/AJS3, which link to a variety of other programs, it is extremely difficult to ensure consistency for the execution status and results of jobnets and jobs, and of executed user jobs. Consistency might be achievable, but only by imposing various constraints on the user jobs executed in the system. That contradicts the main JP1/AJS3 purpose of the flexible management of program operations through the use of jobnets. For these reasons, we recommend that you limit backup and recovery procedures to the JP1/AJS3 environment settings information and definition information.

(4) Backing up and restoring the shared disk in a cluster configuration

(a) Backup when replacing the disk

A shared disk used in a cluster configuration might need to be replaced to avoid hardware failures due to aging of the disk or another problem. In such cases, stop JP1/AJS3, copy the shared disk, and then replace the disk.

To back up or recover an entire shared disk, you must also back up or recover the entire system, including local disks.

(b) What needs to be backed up and restored and methods

If you want to be able to restore shared-disk data in the event of a failure, you need to back up the information described in (1) Backing up and restoring JP1/AJS3 environment settings information and definition information for each logical host in advance.

To back up and restore the physical host environment for a cluster configuration, you need to perform backup and recovery on both the executing and the standby hosts. For a logical host environment, you need to perform backup and recovery only on the executing host. You can then use the information you backed up on the executing host to restore the standby host.

For an overview of backup for a cluster configuration, see 2.1.1(2) Backup procedure (when a cluster system is used) in the Job Management Partner 1/Automatic Job Management System 3 Administration Guide. For an overview of recovery, see 2.1.2(2) Recovery procedure (when a cluster system is used) in the Job Management Partner 1/Automatic Job Management System 3 Administration Guide.

For details about the types of folders processed when you back up or restore the shared disk in a cluster configuration, see 8.2.1(1) Creating shared files on the shared disk in the Job Management Partner 1/Automatic Job Management System 3 Configuration Guide 1 (in Windows) or 17.2.1(1) Creating shared files on the shared disk in the Job Management Partner 1/Automatic Job Management System 3 Configuration Guide 1 (in UNIX).

For details about how to back up and restore the shared folder, see the procedures for logical hosts in 2.2.2 Backing up the JP1/AJS3 - Manager setup information in the Job Management Partner 1/Automatic Job Management System 3 Administration Guide and 2.3.3 Recovering the JP1/AJS3 - Manager setup information in the Job Management Partner 1/Automatic Job Management System 3 Administration Guide.

(5) Backing up and restoring an embedded database

Embedded databases are classified as scheduler databases and agent management databases. A scheduler database contains definition information for jobnets and jobs, calendar and schedule information, and execution status and execution results for jobnets and jobs. Therefore, after recovery, you need to cold-start JP1/AJS3 to ensure consistency in the system.

You can back up and restore an embedded database in JP1/AJS3 - Manager. For details about how to back up and restore an embedded database see 5.4 Handling database errors.