HiRDB Datareplicator Version 8 Description, User's Guide and Operator's Guide
This section explains the files and processes that are used during extraction processing. For details about the files and processes used when
JP1/Cm2 is used for operations management, see 3.4.5 Files and processes used for operations management.
The files discussed as follows are used during extraction processing.
This file stores database update information at the HiRDB. The source Datareplicator extracts update information from this file.
Datareplicator uses these files to store the update information extracted from the HiRDB system log file during extraction processing.
The source Datareplicator stores the update information extracted during extraction processing sequentially into one of the extraction information queue files. When this extraction information queue file becomes full, Datareplicator uses another extraction information queue file. This is called swapping, and it enables the source Datareplicator to store a large amount of update information. Swapping takes place in the order of the qufile001 to qufile016 operands specified in the extraction environment definition.
When all extraction information queue files become full, Datareplicator re-uses the first file. However, if transmission of the update information has not been completed for the file that is to be used next, swapping cannot take place. In such a case, Datareplicator outputs a message indicating that the queue file is full and stops extracting update information from the system log file until transmission from the file has been completed.
The following figure shows the procedure for storing data in the extraction information queue files.
Figure 3-13 Procedure for storing data in the extraction information queue files
Datareplicator uses the data linkage file to store and read HiRDB communication messages that are required for extraction processing, such as the storage status of update information in the system log at HiRDB and the status of reading update information from the system log file at the source Datareplicator.
These files store the extraction/transmission status required for recovery in the event of an error. The extract-time status files include the extraction master status file and the extraction server status file.
If extraction or transmission processing results in an error, Datareplicator outputs error details to an error information file. The extract-time error information files include the extraction master error information files and the extraction node master error information files.
You can also output to the syslog file the information that is output to the error information files. You use the syslogout operand in the extraction system definition to specify whether the information is to be output also to the syslog file.
Datareplicator uses the activity trace files to collect Datareplicator's activity status. These files contain information about Datareplicator's operation and performance. The activity trace files include the extraction master trace files and the extraction node master trace files.
To use the activity trace files, specify the int_trc_lvl and int_trc_filesz operands in the extraction system definition. You can edit the obtained activity trace files with the hdstrcedit command. For details about the hdstrcedit command, see its command syntax in Chapter 7. Command Syntax.
The extraction system definition file defines the overall operating environment for the source Datareplicator, such as the source Datareplicator's identifier and the target identifier.
This file defines the operating environment for extraction processing, such as the names and sizes of the extraction information queue files.
These files define operating environments for transmission processing, such as the service names and host names for communications.
This file defines detailed information about extraction and transmission processing, such as the correspondence between source table/columns and update information and the destination of update information.
This file is obtained by using the hdeprep command to convert the extraction definition file to the internal format. You must execute this conversion to the extraction definition preprocessing file before you start the source Datareplicator.
Note that if Datareplicator cannot connect to HiRDB for a reason such as omission of the PDUSER environment variable or a password, the KFRB00868-W message is output, in which case the source Datareplicator is started without checking the validity of the extraction definition preprocessing file.
These files store a record of the dates and times Datareplicator's commands are executed.
Figure 3-14 shows the organization of processes during extraction processing when the source HiRDB is a single server, and Figure 3-15 shows the organization of processes when the source HiRDB is a parallel server.
Figure 3-14 Organization of processes during extraction processing: Source HiRDB is a single server
Figure 3-15 Organization of processes during extraction processing: Source HiRDB is a parallel server
The extraction command process processes the source Datareplicator's command and issues an instruction to the extraction master process. If the source HiRDB is a parallel server, there is one extraction command process under the system manager.
The extraction master process controls the extraction node master process. If the source HiRDB is a parallel server, there is one extraction master process under the system manager.
The extraction node master process manages the extraction process and the transmission process. If the source HiRDB is a parallel server, there is one extraction node master process at each server machine that contains a back-end server. The source Datareplicator calls such a server machine a node.
The extraction process extracts update information from the system log file and stores it in the extraction information queue file. If the source HiRDB is a parallel server, one extraction process exists at each back-end server that is subject to extraction.
The transmission process reads update information from the extraction information queue file and sends it to the target system. There are as many transmission processes as there are destinations. If the source HiRDB is a parallel server, there are as many transmission processes as there are destinations at each back-end server that is subject to extraction. However, if you specify sendmst in the sendcontrol extraction system definition operand, up to the number of transmission process can exist as is specified in the sendprocnum extraction system definition operand.
Hereafter, the method used when sendmst is specified in the sendcontrol extraction system definition operand is referred to as the sendmst method, while the method used when nodemst is specified is referred to as the nodemst method.
The transmission master process starts, stops, and schedules the transmission processes to control/suppress the number of transmission processes to be started when there are many destinations. The transmission master process is generated when a definition is made to control the number of transmission processes. For details about how to control the number of transmission processes, see 3.2.5 Controlling the number of transmission processes.
The activity trace collection process collects activity trace information.
All rights reserved. Copyright (C) 2007, 2013, Hitachi, Ltd.