In jobs that
include a change data capture transformation, you can use a control
table to prevent the update of target rows that were processed in
an earlier run. When you run a job that uses a control table, the
CDC transformation first finds in the source the most recent insert,
update, or delete action for a given unique identifier (business key).
The most recent source row is then compared to the prior actions that
appear in the control table. If the unique identifiers match, and
if the rest of the rows are identical, then the source row is a duplicate
and it is not added to the target.
Control
tables are optional, so you need to use one only if the source changed
data table contains information that was already loaded into the target.
The control
table can be in SAS format or in native database format.
Column
definitions in the control table are similar to those that are required
in the source changed data tables.
You can
use the New Table Wizard to create control tables.
In control
tables, the names and order of the following columns can vary, because
you identify those columns in the properties window of the CDC transformation:
Application Name
identifies the application
that compares the source change data records to the records in the
target to test for previous updates. A typical value for this column
is SAS Data Integration Studio
. The
column type is character and the length is 64.
Table Name
identifies the source
changed data table. The column type is character and the length is
64.
Context
provides the unique
identifiers in the target that are not to be overwritten. The context
is a character value with length of 32 for DB2, Attunity, and General.
Oracle context is numeric with a length of 8.
Rows Processed
records the number
of source changed data records that were processed the last time that
the job was run. This value is updated at the end of the job run,
as an output from the CDC transformation. The type of this column
is numeric and the length is 8.
Timestamp
identifies the time
and date when the job was run, in DATETIME16.6 format. The type of
this column is numeric and the length is 8.