Clickstream Parse Input and Output Columns

Clickstream Parse Input Columns

The Clickstream Log transformation maps the columns from a Web log to the Clickstream Parse Input Columns and loads an output table with data from the log. This table becomes the input to the Clickstream Parse transformation. The following table lists the metadata for the Clickstream Parse input columns.
Clickstream Parse Input Columns
Column Name
Description
Label
Length
SAS Format
CLK_Client_IP
Specifies the visitor's IP address.
Client ID
64
$64.
CLK_cs_Bytes
Specifies the number of bytes that the client sends to the server, upon a server request.
Bytes Received
8
COMMA15.
CLK_cs_Cookie
Specifies the raw cookie string.
Cookie String
32760
$32760.
CLK_cs_Host
Specifies the host name, which is derived from the URL field that follows http://.
Requested Host
64
$64.
CLK_cs_Method
Specifies the method that is used to submit the request (for example, POST or GET).
HTTP Method
8
$8.
CLK_cs_Referrer
Specifies the full URL and any query parameters from the referring page.
Referrer
1024
$1024.
CLK_cs_URI_Query
Specifies the query string that is passed to the URL.
Query Sting
1024
$1024.
CLK_cs_URI_Stem
Specifies the URI, which is the URL, but without the http://www.domain.com/ field.
Requested File
1024
$1024.
CLK_cs_UserAgent
Specifies the string that identifies the user's browser, which the user's browser sends.
User Agent
160
$160.
CLK_cs_Username
Specifies the user name that the client used for authentication, if applicable.
Username
32
$32.
CLK_cs_Version
Specifies the version of the HTTP protocol that is being used.
HTTP Version
8
$8.
CLK_Date
Specifies the date stamp of the request.
Date
8
DATE9.
CLK_GMT_Offset
Specifies the Greenwich Mean Time (GMT) offset.
GMT Offset
5
$5.
CLK_Null
Specifies the placeholder for a field that is not being used.
Null Variable
8
$8.
CLK_s_Server
Specifies the server name, such as s-ComputerName.
Server Name
48
$48.
CLK_s_Server_IP
Specifies the IP address of the Web server.
Server IP Address
16
$16.
CLK_s_Server_Port
Specifies the number of the port that the Web server runs on.
Server Port
8
$8.
CLK_s_Sitename
Specifies the name of the virtual Web site.
Site Name
32
$32.
CLK_sc_Bytes
Specifies the number of bytes that the server sends to the client, upon a client request.
Bytes Sent
8
COMMA15.
CLK_sc_Status
Specifies the HTTP status code that the client receives from the server.
HTTP Status
8
4.
CLK_sc_SubStatus
Specifies the secondary status that is returned by some Web servers.
Sub Status
8
4.
CLK_Time
Specifies the timestamp of the request.
Time
8
TIME.
CLK_Time_Taken
Specifies the amount of time that is taken for the server to respond to the client request.
Time Taken
8
TIME.
CLK_sc_Win32_Status
Specifies the status that is returned by the Windows operating system.
Win32 Status
8
4.

Clickstream Parse Output Columns

The Clickstream Parse transformation maps the Parse input columns to a set of Parse output columns. The following table lists the metadata for the Clickstream Parse output columns.
Clickstream Parse Output Columns
Column Name
Description
Completion Method
Label
Length
SAS Format
Browser
Specifies the type of browser that the visitor uses.
Is derived from CLK_cs_UserAgent, by using pattern matching on known browser names.
Browser
40
$40.
Browser_Version
Specifies the version of the browser that the visitor uses.
Is derived from CLK_cs_UserAgent by using pattern matching to locate the browser name, and then extracting the version number that follows it.
Browser Version
16
$16.
Bytes_Received
Specifies the number of bytes that the client sends to the server.
Pass-Through CLK_cs_Bytes.
Bytes Received
8
COMMA15.
Bytes_Sent
Specifies the number of bytes that the server sends to the client.
Pass-Through CLK_sc_Bytes
Bytes Sent
8
COMMA15.
Client_IP
Specifies the visitor's IP address.
Pass-Through CLK_Client_IP
Client IP
64
$64.
Cookie_Jar
Specifies the raw contents of the cookie jar.
Pass-Through CLK_cs_Cookie
Cookie Jar
32760
$32760.
Date_Time
Specifies the date and time of the request.
Is derived by combining CLK_Date and CLK_Time
Date and Time
8
DATETIME.
Domain
Specifies the host name.
Pass-Through CLK_cs_Host
Domain
128
$128.
Method
Specifies the method that is used to submit the request (for example, POST or GET).
Pass-Through CLK_cs_Method
Method
8
$8.
Platform
Specifies the hardware platform of the visitor's computer.
Is derived from CLK_cs_UserAgent, by using pattern matching on known platform names.
Platform
40
$40.
Query_String
Contains the parameters that are specified in the URL. It is also referred to as the query or the CGI parameters.
Uses the Pass-Through CLK_URI_Query if non-blank. Otherwise, this query uses the query string from CLK_cs_URI_Stem.
Query String
1024
$1024.
Record_ID
Specifies the unique identifier for each record.
Is derived by combining the date of the SAS process, the SAS process ID, and the record counter.
Record ID
24
$24.
Referrer
Specifies the referring page (the URL from which the user requests access to the next URL).
Pass-Through CLK_cs_Referrer
Referrer
1024
$1024.
Referrer_Domain
Specifies the domain of the referrer.
Is derived from CLK_cs_Referrer, and is the text that is located between the protocol (http://) and the first-level path (/).
Referrer Domain
128
$128.
Referrer_Internal
Specifies whether the referrer is internal.
Is derived from a user-modified rule that runs after parse and sets referrer_internal to 1 when condition passes.
Referrer Internal
3
$3.
Referrer_Query_String
Specifies the query string that is passed with the referrer.
Is derived from CLK_cs_Referrer, and is the text that is passed in the URL after the question mark (?).
Referrer Query String
1024
$1024.
Referrer_Requested_File
Specifies the path and the filename of the referrer.
Is derived from CLK_cs_Referrer, and is all of the text that is located between the end of the domain name and the query string, if any.
Referrer Requested File
1024
$1024.
Requested_File
Specifies the requested file.
Pass-Through CLK_cs_URI_Stem
Requested_File
1024
$1024.
Server
Specifies the physical computer name that the Web server runs on, such as CLK_s_ComputerName.
Pass-Through CLK_s_ComputerName
Server
32
$32.
Server_IP
Specifies the IP address of the Web server.
Pass-Through CLK_s_IP
Server IP Address
16
$16.
Server_Port
Specifies the port that the Web server runs on, such as CLK_s_Port.
Pass-Through CLK_s_Port
Server Port
8
$8.
Sitename
Specifies the name of the virtual Web site, such as CLK_s_SiteName.
Pass-Through CLK_s_SiteName
Site Name
48
$48.
Status_Code
Specifies the HTTP status code that the server returns to the client during this request.
Pass-Through CLK_sc_Status
Status Code
8
4.
SubStatus
Specifies the secondary status that is returned by some Web servers.
Pass-Through CLK_sc_SubStatus
Sub Status
8
4.
User_Agent
Specifies the string that contains a description of the user's browser, which the user's browser sends.
Pass-Through CLK_cs_UserAgent
User Agent
160
$160.
Username
Specifies the user name that the client sends to the server for authentication, if applicable.
Pass-Through CLK_cs_Username
Username
32
$32.
Visitor_ID
Specifies a unique identifier for a visitor to the site. It typically contains the user's IP address and the name of the browser's user agent.
Is derived by combining CLK_Client_IP and CLK_cs_UserAgent, which is the default value, or by defining a user-defined rule that runs after the Clickstream Parse transformation.
Visitor Identifier
225
$225.