Hitachi

HA Monitor Cluster Software Messages


KAMN766-E

An error occurred during the acquisition of information about the public cloud environment.

In the AWS environment:

HA Monitor failed to acquire the information required to reset the local host (the instance ID and the IDs of the ENIs to be disconnected from the network). The remote host cannot reset the local host or cannot disconnect the local host from the network.

In an Azure environment:

An attempt to acquire information (virtual machine ID) required for a remote host to reset the local host failed. The remote host is not able to reset the local host.

In the OCI environment:

An attempt to acquire the information (instance OCID) required for a remote host to reset the local host failed. The remote host is not able to reset the local host.

S:

Continues processing.

O:

Contact a system administrator.

Action:
In the AWS environment:

HA Monitor failed to acquire the instance ID or region name of the local host, or the IDs of the ENIs to be disconnected from the network. Currently, the remote host cannot reset the local host or disconnect the local host from the network.

Possible causes are as follows:

• There are errors in the network-disconnection settings file (only when hot-standby switchover based on network disconnection is used).

• The AWS CLI did not work correctly on AWS.

• HA Monitor failed to acquire the instance ID or region name from Instance Metadata Service (IMDS) on AWS.

After detecting an error, HA Monitor attempts to acquire the instance ID of the local host and the IDs of the ENIs to be disconnected from the network at 60-second intervals. Eliminate the cause of the error by taking the action described later, and then confirm that the KAMN765-I message is output. If the KAMN765-I message is not output even after taking the action, immediately ask AWS support for an investigation based on the following execution logs, and then take appropriate actions:

/opt/hitachi/HAmon/spool/cloud/moncld_getinf.log_err

/opt/hitachi/HAmon/spool/cloud/moncld_nwdefchk.log_err (only when hot-standby switchover based on network disconnection is used)

Take the actions described in the following sections: "Checking the network-disconnection settings file (only when hot-standby switchover based on network disconnection is used)" and "Checking the AWS settings".

Checking the network-disconnection settings file (only when hot-standby switchover based on network disconnection is used)

Check the network-disconnection settings file for errors.

If the network-disconnection settings file has errors, correct them, and then restart HA Monitor.

If the file has no errors, check whether the defined ENIs are attached to the instance.

If the defined ENIs are not attached:

Currently, it is impossible to perform business communications and replicate the disk for business use. Attach the necessary ENIs to the instance.

For details about how to attach an ENI, see the section that describes hot-standby switchover based on network disconnection in the manual For Public Cloud Systems HA Monitor Cluster Software Guide.

If the defined ENIs are attached:

Take the action described in "Checking the AWS settings".

Checking the AWS settings

See "[AWS] Configuring AWS" in the manual For Public Cloud Systems HA Monitor Cluster Software Guide, and revise the settings to correct errors.

In an Azure environment:

An attempt to acquire the virtual machine ID has failed. The remote host is not able to reset the local host. Immediately ask Azure support for investigation based on the execution log (/opt/hitachi/HAmon/spool/cloud/moncld_getinf.log_err), and then take appropriate action.

After an error is detected, HA Monitor attempts to acquire the virtual machine ID at 60-second intervals. Eliminate the cause of the error, and then make sure that the message KAMN765-I is output.

In the OCI environment:

An attempt to acquire the instance OCID has failed. The remote host is not able to reset the local host. Immediately ask the OCI support for investigation based on the execution log (/opt/hitachi/HAmon/spool/cloud/moncld_getinf.log_err), and then take appropriate actions.

After an error is detected, HA Monitor attempts to acquire the instance OCID at 60-second intervals. Eliminate the cause of the error, and then make sure that the message KAMN765-I is output.