The
second stage contains the first loop job. The transformations in the
first loop job represent the subjob, which is the job that is run
in parallel. For a standard Web log, each stream consists of a Clickstream
Log transformation, a Clickstream Parse transformation, and two checkpoints,
which are created by renaming the Return Code transformation and which
enable you to configure how errors are processed.
The transformations
in this stage are described in the following table:
Loop One Transformations
|
|
Inputs from and Outputs
to
|
Loop 1 (Recognize and
Parse) transformation
|
Passes the appropriate
parameters through to the job flows that are executed in parallel.
Each parallel stream should have the following parameters set:
-
INPUTFILE is supplied by the FULLNAME
source column
-
OUTLIBPATH is supplied by the PARMLIBPATH
source column
-
INFILENUM is supplied by the FILENUM
source column
|
From: Set Output Library
(reused SAS Extract) transformation
To: Clickstream Log
transformation
To: Filter - Only properly
parsed logs (SAS Extract)
|
Clickstream Log transformation
|
Extracts and decodes
(URL and character) data from a single log for each pass through the
loop; determines the raw Web log type and creates a SAS DATA step
view that is used to read the raw data.
|
From: Loop 1 (Recognize
and Parse) transformation
To: RC Check - Log transformation
To: Clickstream Parse
transformation
|
RC Check - Log transformation
|
Evaluates the return
code from Clickstream Log; sends e-mail to specified address if the
log step fails.
|
From: Clickstream Log
transformation
To: Clickstream Parse
transformation
|
Clickstream Parse transformation
|
Parses this data and
generates n output tables,
where n is the number of groups
expected by the Sessionize loop (the second loop).
For customer intelligence
template jobs, this step also identifies the campaign and customer
who clicked on a specific treatment.
Campaign information
is denoted by these columns:
-
EntrySource: ID of the entity that
originated access to the landing page
-
EntryActionID: ID that represents
the Entry Source
-
S1 through S4 - identifies the
subject of an Entry Action either alone or with other Subject ID parameters
Clickstream Parse populates
EntrySource with a value of “SDM” if there is a value
in the EntryActionID and S1 columns.
|
From: RC Check - Log
transformation
To: RC Check - Parse
transformation
|
RC Check - Parse transformation
|
Evaluates the return
code from Clickstream Parse; sends e-mail to specified address if
the parse step fails.
|
From: Clickstream Parse
transformation
To: Loop End transformation
|
|
Ends loop processing;
returns to beginning of loop
|
From: RC Check - Parse
transformation
To: Filter - Only properly
parsed logs (reused SAS Extract) transformation
|
The following display
shows the first loop stage of the template job.
Loop 1 Process Flow for Standard Web Logs
When you
process SAS page tag logs, an additional Clickstream Parse transformation
(and associated RC Check) are inserted in order to process the data
elements collected by the SAS page tag (after the RC Check-Log transformation
and before the Clickstream Parse transformation). This additional
Clickstream Parse transformation is named Parse Tagged Data Items.
For a partial list of the data elements processed by this additional
transformation, see
SAS Page Tag Predefined Data Elements Reference.