2.3.27 Operands related to the system switchover facility

134) pd_ha = use | nouse
Specifies whether or not the system switchover facility is to be used.
use: Use the system switchover facility.
nouse: Do not use the system switchover facility.
135) pd_ha_ipaddr_inherit = Y | N
Specifies whether or not IP addresses are to be inherited when the system switchover facility is used. How the system switchover facility operates depends on whether or not IP addresses will be inherited; for details, see the HiRDB Version 8 System Operation Guide.
Y: Inherit IP addresses.
N: Do not inherit IP addresses.
In server mode operations, IP addresses cannot be inherited.
If you use the standby-less system switchover facility, omit this operand.
Specification guidelines
  • HiRDB/Single Server
    To inherit IP addresses, specify Y; otherwise, specify N. However, in server mode operations, IP addresses cannot be inherited. Therefore, if you specify Y here, you need to specify N for the pd_ha_ipaddr_inherit operand of the unit control information definition.
    Note that if you specify N for this operand, you cannot specify Y for the pd_ha_ipaddr_inherit operand of the unit control information definition.
  • HiRDB/Parallel Server
    To inherit IP addresses, specify Y; otherwise, specify N. Note that the units that are the targets of the server mode cannot inherit IP addresses. Therefore, if you specify Y here, you need to specify N for the pd_ha_ipaddr_inherit operand of the unit control information definition of the units that are the targets of the server mode.
    If you specify N for this operand, you cannot specify Y for the pd_ha_ipaddr_inherit operand of the unit control information definition.
136) pd_ha_switch_timeout = Y | N
This operand can be specified when the server mode is used. Specification of this operand is invalid in the monitor mode.
This operand specifies whether to switch the system without waiting for internal termination processing of HiRDB when internal termination processing of HiRDB (a unit for a HiRDB/Parallel Server) during system switchover has exceeded the server failure monitoring time. The server failure monitoring time referred to here is the time specified for the patrol operand of the Hitachi HA Toolkit Extension.
For details about the patrol operand of the Hitachi HA Toolkit Extension, see the manual Hitachi HA Toolkit.
Y:
Switches the system without waiting for internal termination processing of HiRDB when internal termination processing of HiRDB during system switchover has exceeded the server failure monitoring time. In this case, system switchover is carried out by assuming that HiRDB has slowed down.
If you are using the standby-less system switchover (1:1) facility or the standby-less system switchover (effects distributed) facility, the specification for this operand is invalid during planned system switchover.
N:
Does not switch the system until internal termination processing of HiRDB during system switchover is terminated.
Advantage
If internal termination processing of HiRDB during system switchover takes a long time because, for example, of a disk error system switchover may be delayed as a result. If you specify Y (default value) for this operand, you can switch the system without waiting for internal termination processing of HiRDB, even when it is taking a long time.
Notes
  • If you specify Y for this operand when the patrol operand value is small, planned system switchover may turn into system switchover based on slow-down. This is because internal termination processing of HiRDB during planned system switchover exceeds the time specified by the patrol operand.
  • You need to be careful if restart is specified for the switchtype operand of the Hitachi HA Toolkit Extension. If pd_ha_switch_timeout=Y (default value) is specified, and if internal termination processing of HiRDB exceeds the server failure monitoring time, HiRDB is not started in the system in which the failure occurred. In this case, the system is immediately switched.
    For details about the switchtype operand of the Hitachi HA Toolkit Extension, see the manual Hitachi HA Toolkit.
137) pd_ha_mgr_rerun = wait | notwait
This operand applies only to a HiRDB/Parallel Server.
Specifies whether to wait for the completion of start processing of other units before switching the system for the system manager unit.
wait:
Waits for the completion of start processing of other units before switching the system for the system manager unit. In this case, the following checks are performed to start the system manager unit:
  • Version check for each unit
  • Is the dictionary server running?
  • Is at least one front-end server running?
  • Is at least one back-end server running?
If another unit has stopped, system switchover for the system manager unit may take a long time or fail. The following table describes what happens when system switchover for the system manager unit occurs when some of the units are stopped.
Reduced activation specification (pd_start_level value)Specification of a name for the unit that does not startAction
Not specified (0)[Figure]Cannot start.
Specified (1)Nothing is specified for the pd_start_skip_unit operandSystem switchover occurs after a wait time specified by the pd_reduced_check_time operand.*
Stopped unit is specified for the pd_start_skip_unit operand.System switchover is completed immediately.*
Legend:
[Figure]: Not applicable
* System switchover is completed only when all of the following conditions are satisfied:
  • The dictionary server is running.
  • At least one front-end server is running.
  • At least one back-end server is running.
notwait:
Does not wait for the completion of start processing of other units before switching the system for the system manager unit. When this operand is specified, system switchover is executed rapidly for the system manager unit without waiting for other units that have stopped to start.
When system switchover is completed for the system manager unit, the message KFPS05210-I (system initialization completion message) is issued even if the HiRDB operating environment is not ready. If a job cannot be continued even after the completion of the system manager switchover because of a UAP error, for example, use the pdls command to check the system operating status.
Condition
To specify notwait for this operand, the system configuration must satisfy a certain condition. For details about the required system configuration, see the HiRDB Version 8 System Operation Guide.
Specification guidelines
To prevent failure of the system switchover for the system manager unit, specify notwait. For details about how to handle system manager unit switchover failures, see the HiRDB Version 8 System Operation Guide.
138) pd_ha_transaction = error | queuing
Specifies whether to use the transaction queuing facility. Also specifies the processing that takes place when the number of connections to the HiRDB server exceeds the maximum number of concurrent connections (value specified by the pd_max_users operand) during system switchover. For details on for the transaction queuing facility, see the HiRDB Version 8 System Operation Guide.
error:
  • Does not use the transaction queuing facility. The transactions being processed by the back-end server or dictionary server being switched end in errors.
  • If the number of connections to the HiRDB server exceeds the maximum number of concurrent connections during system switchover, the connections to the HiRDB server end in errors.
queuing:
  • Uses the transaction queuing facility. Instead of ending in errors, the transactions being processed by the back-end server or dictionary server being switched are queued by the front-end server until system switchover is completed.
  • If the number of connections to the HiRDB server exceeds the maximum number of concurrent connections during system switchover, connection processing to the HiRDB server is retried for the duration of pd_ha_trn_queuing_wait_time + pd_ha_trn_restart_retry_time. However, the HiRDB client version must be 07-00 or newer.
Conditions
If you use the transaction queuing facility, all of the following conditions must be satisfied:
  • HiRDB/Parallel Server
  • The rapid system switchover facility, standby-less system switchover (1:1) facility, or standby-less system switchover (effects distributed) facility is used.
Note that these conditions need not be satisfied if connection processing to the HiRDB server is to be retried by a HiRDB client.
Relationship to client environment definition
Even when queuing is specified for this operand, you can specify that the transaction queuing facility not be used for each client. To cancel the transaction queuing facility for each client, specify the PDHATRNQUEUING operand in the client environment definition. For details about the PDHATRNQUEUING operand, see the HiRDB Version 8 UAP Development Guide.
Relationship to other operands
This operand is related to the following operands:
  • pd_ha_trn_queuing_wait_time
  • pd_ha_trn_restart_retry_time
139) pd_ha_trn_queuing_wait_time = transaction-queuing-wait-time
[Figure]<unsigned integer>((1-3600))<<180>>(Seconds)
Specifies the transaction queuing wait time when the transaction queuing facility is used. If the standby unit or server does not start within the wait time specified by this operand, the transactions being queued end in errors. Transactions that occur subsequently end in errors without being queued. If a unit or server starts before this wait time, transactions are resumed without waiting for the time specified by this operand.
If the standby-less system switchover (1:1) facility is used, the alternate portion becomes the standby system during normal operation, and the normal BES unit becomes a standby system during alternation.
Condition
The pd_ha_transaction operand must be set to queuing.
Specification guidelines
Normally, you need not specify this operand. However, if the rollforward during system switchover takes 180 seconds or longer, increase the value of this operand.
140) pd_ha_trn_restart_retry_time = retry-time-upper-limit-after-transaction-start-request-errors
[Figure]<unsigned integer>((1-3600))<<60>>(Seconds)
If system switchover occurs while the transaction queuing facility is being used, transactions are queued by the front-end server. However, during the period between the system switchover and the restart of the standby unit or server, the front-end server cannot detect the system switchover. During this period (the period between the system switchover and the restart of the standby unit or server), the front-end server issues a transaction start request to the running unit or server. However, because the running unit has already been abnormally terminated, this transaction start request ends in an error. For the transaction that ends in an error, the front-end server re-issues a transaction start request (retries the transaction start request).
This operand specifies the upper limit for this retry time. If the standby unit or server is not restarted within the value specified by this operand, the transactions being retried end in errors. Furthermore, transactions that occur subsequently end in errors without being retried. Note that if the standby unit or server begins to restart before this retry time, no retries occur and transactions are queued.
Condition
The pd_ha_transaction operand must be set to queuing.
Specification guidelines
Normally, you need not specify this operand. However, if disk switching by cluster software takes 60 seconds or longer, increase the value of this operand.
Remarks
The relationship between the pd_ha_trn_queuing_wait_time and pd_ha_trn_restart_retry_time operands is explained as follows.
[Figure]
Explanation:
Zones A and D:
Transactions can be started (normal state).
Zone B:
The unit containing the back-end server is being switched, and the front-end server cannot detect this condition. Transaction start request is retried for the duration of time specified by the pd_ha_trn_restart_retry_time operand. When the front-end server detects the system switchover, transactions are queued. If the front-end server cannot detect it within the specified time, the transactions end in errors.
Zone C:
The unit containing the back-end server is being switched, and the front-end server has detected this condition. Transactions are queued for the duration of time specified by the pd_ha_trn_queuing_wait_time operand. If the transactions cannot be started within the specified time, they end in errors.
141) pd_ha_resource_act_wait_time = maximum-wait-time-for-resource-activation]
[Figure]<unsigned integer>((2-3600))<<10>> (seconds)
When you use the standby-less system switchover (effects distributed) facility, this operand specifies the maximum wait time for the running server's resources to be activated at the time of unit startup. Unit startup processing is placed on hold up to the specified amount of wait time. If the resources are activated within the specified amount of time, unit startup processing resumes.
Advantages
When unit startup processing is completed, jobs can be started only if the running server's startup processing is completed in the unit. By specifying an appropriate value in this operand, you can start your jobs immediately after the unit's startup processing is completed because the unit startup processing will have waited on wait status for resource activation.
Specification guidelines
Normally, there is no need to specify this operand. You should specify this operand when all the following conditions are applicable:
  • The standby-less system switchover (effects distributed) facility is being used
  • The KFPS05623-I message was displayed
  • The target unit for the message contains the running server
Use the following guideline to determine the value to be specified in this operand:
10 + time required for resource activation processing (seconds)
Remarks
If the unit does not contain the running server, the running server's startup is placed on hold for the amount of time specified in this operand. However, if all the servers in the unit start as standby servers, unit startup processing is restarted without waiting for the amount of time specified in this operand.