Contents
About
What's New in SAS Data Integration Studio 4.9
Introduction
Overview of SAS Data Integration
About SAS Data Integration
Advantages of SAS Data Integration
A Basic Data Integration Environment
How to Get Help for SAS Data Integration Studio
Installing SAS Data Integration Studio
Administrative Documentation for SAS Data Integration Studio
Accessibility Features in SAS Data Integration Studio
General User Tasks
Getting Started
Security for SAS Data Integration Studio
Main Tasks for Creating Process Flows
Starting SAS Data Integration Studio
Connecting to a SAS Metadata Server
Working with the Folders Tree
Selecting a Default SAS Application Server
Registering SAS Libraries
Working with User-Defined Formats
Registering Tables and Cubes
Overview of Transformations
Working with Stored Processes
Working with Web Services
Specifying Global Options in SAS Data Integration Studio
Working with Change Management
Search Metadata
Add a Note or Document to a Registered Object
View the Content of Notes or Documents
Importing, Exporting, and Copying Metadata
Metadata Import and Export in SAS Data Integration Studio
Working with SAS Package Metadata
Preparing to Import or Export SAS Package Metadata
Exporting SAS Package Metadata
Importing SAS Package Metadata
Copying and Pasting Metadata Objects
Working with SAS Metadata Bridges
Usage Notes for Importing or Exporting with a SAS Metadata Bridge
Preparing to Import or Export with a SAS Metadata Bridge
Importing New Metadata with a SAS Metadata Bridge
Importing Updated Metadata with a SAS Metadata Bridge
Exporting Metadata with a SAS Metadata Bridge
Working with Tables
About Tables
Registering Existing Tables with the Register Tables Wizard
Registering New Tables with the New Table Wizard
Viewing or Updating Table Metadata
Using a Physical Table to Update Table Metadata
Specifying Options for Tables
Supporting Case and Special Characters in Table and Column Names
Maintaining Column Metadata
Standardizing Columns
Maintaining Keys
Maintaining Indexes
Browsing Table Data
Editing SAS Table Data
Using the View Data Window to Create a SAS Table
Specifying Browse and Edit Options for Tables and External Files
Working with External Files
About External Files
Registering a Delimited External File
Registering a Fixed-Width External File
Registering an External File with User-Written Code
Viewing or Updating External File Metadata
Overriding the Code Generated by the External File Wizards
Specifying NLS Support for External Files
Accessing an External File with an FTP Server or an HTTP Server
Viewing Data in External Files
Registering a COBOL Data File That Uses a COBOL Copybook
Using an External File in the Process Flow for a Job
Using a Format File to Register a Fixed-Width External File
Creating Jobs
About Jobs
Creating an Empty Job
Creating a Process Flow for a Job
Creating a Job That Contains Jobs
Working with Default Temporary Output Tables
Specifying Options for Jobs
Documenting Process Flow Diagrams
Accessing Local and Remote Data
Viewing or Updating Job Metadata
Displaying the SAS Code for a Job
Common Code Generated for a Job
Managing Jobs
About Managing Jobs
Submitting a Job for Immediate Execution
Meeting Prerequisites for Collecting Job Statistics
Reviewing a Successful Job
Diagnosing and Correcting an Unsuccessful Job
Adding a Transformation to an Existing Job
Understanding the Job Has Changed Warning
Understanding the Crossed Versions in a Job Warning
Displaying Run-Time Statistics in SAS Job Monitor
Displaying Run-Time Statistics in SAS Web Report Studio or the SAS Stored Process Server
Maintaining Column Mappings
Managing the Scope of Column Changes in Jobs
Managing Connections in Job Editor Windows
Viewing the Code for a Transformation
Specifying Options for Transformations
Redirecting Temporary Output Tables
Pushing ELT Job Code Down to a Database
Using a Web Client to Orchestrate Jobs
Restarting Jobs From Checkpoints
About Restarting Jobs
Prerequisites for Restarting Jobs
Adding Checkpoints to a Job
Restarting a Job
Managing the Status of Jobs and Transformations
About Status Handling for Jobs and Transformations
Default Conditions, Actions, and Conditional Action Sets
Prerequisites for Actions
Perform Actions Based on the Status of a Job
Perform Actions Based on the Status of a Transformation
Macro Variables for Status Handling
Deploying Jobs
About Deploying Jobs
About Deploying Jobs for Scheduling
Prerequisites for Deploying a Job for Scheduling
Deploying Jobs for Scheduling
Using a Command Line to Deploy Jobs
Redeploying Jobs for Scheduling
Using Scheduling to Handle Complex Process Flows
Using Deploy for Scheduling to Execute Jobs on a Remote Host
About Deploying Jobs as Stored Processes
Prerequisites for Deploying a Job as a Stored Process
Deploying Jobs as Stored Processes
Redeploying Jobs to Stored Processes
Viewing or Updating Stored Process Metadata
About Deploying Jobs as Web Services
Prerequisites for Web Service Jobs
Requirements for Web Service Jobs
Creating a Web Service Job
Deploying a Web Service Job as a Stored Process
Deploying a Stored Process as a Web Service
Working with Versions
About Versions
Prerequisites for Version Control
Example Setup for an Apache Subversion (SVN) Server
Creating a Version
Reviewing and Managing Versions
Comparing Versions
Working with Generated Code
About Code Generated for Jobs
Displaying the Code Generated for a Job
Displaying the Code Generated for a Transformation
Specifying Options for Jobs
Specifying Options for a Transformation
Modifying Configuration Files or SAS Start Commands for Application Servers
Working with User-Written Code
About User-Written Code
Adding User-Written Code to the Precode and Postcode Tab
Adding a User Written Code Transformation to a Job
Creating and Using a Generated Transformation
Updating a Generated Transformation
Editing the Generated Code for a Job or Transformation
Replacing the Generated Code for a Job or Transformation
Converting a SAS Code File to a Job
Optimizing Process Flows
About Process Flow Optimization
Managing Process Data
Managing Columns
Streamlining Process Flow Components
Using Simple Debugging Techniques
Using SAS Logs
Reviewing Temporary Output Tables
Additional Performance Optimization Information
Working with Impact Analysis and Data Lineage
Impact Analysis and Data Lineage
Performing an Impact Analysis
Performing Impact Analysis on a Generated Transformation
Performing Reverse Impact Analysis
Using SAS Lineage
Working with Reports
About Metadata Reports
Opening the Reports Window
Selecting the Reports Perspective
Customizing the Tables Report
Customizing the Job Documentation Report
Running and Saving a Report
Saving a Report As a Document Object
Viewing a Report
Creating Your Own Report
Working with Data Management Platform
Integration with DataFlux Data Management Platform
General Prerequisites for Data Quality Transformations
Prerequisites for Running a DataFlux Job or Profile in a SAS Data Integration Studio Job
Analyzing the Quality of Data Sources
Standardizing Values with a Standardization Scheme
Standardizing Values with a Definition
Using Match Codes to Improve Record Matching
Using a DataFlux Data Service in a SAS Data Integration Studio Job
Using a DataFlux Job or Profile in a SAS Data Integration Studio Job
Working with Transformations
Working with Analysis Transformations
About Analysis Transformations
Creating a Correlation Analysis
Creating a Distribution Analysis
Generating Forecasts
Frequency of Eye Color By Hair Color Crosstabulation
One-Way Frequency of Eye Color By Region
Creating Summary Statistics for a Table
Creating a Summary Tables Report from Table Data
Working with Loader Transformations
About Loader Transformations
About the SPD Server Table Loader Transformation
Teradata Table Loader Transformation
About the Table Loader Transformation
About the Oracle Bulk Table Loader Transformation
About the DB2 Bulk Table Loader
Setting Table Loader Transformation Options
Selecting a Load Technique in the Table Loader
Removing Non-Essential Indexes and Constraints during a Load
Considering a Bulk Load
Working with SAS Sort Transformations
About Sort Transformations
Optimizing Sort Performance
Creating a Table That Contains the Sorted Contents of a Source
Working with SQL Join Transformations
About Join Transformations
Using the Designer Window
Reviewing and Modifying Clauses, Joins, and Tables in an SQL Query
Understanding Automatic Joins
Selecting the Join Type
Adding User-Written SQL Code
Debugging an SQL Query
Adding a Column to the Target Table
Adding a Join to an SQL Query on the Designer Tab
Creating a Simple SQL Query
Configuring a SELECT Clause
Adding a CASE Expression
Creating or Configuring a WHERE Clause
Adding a GROUP BY Clause and a HAVING Clause
Adding an ORDER BY Clause
Adding Subqueries
Validating or Submitting an SQL Query
Joining a Table to Itself
Using Parameters with an SQL Join
Constructing a SAS Scalable Performance Data Server Star Join
Optimizing SQL Processing Performance
Performing General Data Optimization
Influencing the Join Algorithm
Setting the Implicit Property for a Join
Enabling Explicit Pass-Through Processing for SQL Join Transformations
Using Properties Window Options to Optimize SQL Processing Performance
Working with Other SQL Transformations
About Other SQL Transformations
Inserting Rows into a Target Table
Using the SQL Set Operators Transformation
Enabling Explicit Pass-Through Processing for Other SQL Transformations
Working with Iterative Jobs and Parallel Processing
About Iterative Jobs
Creating and Running an Iterative Job
Creating a Parameterized Job
Creating a Control Table
About Parallel Processing
Setting Options for Parallel Processing
Working with Slowly Changing Dimensions
About Slowly Changing Dimensions
About Dimension Tables
About Fact Tables
Usage Notes for Slowly Changing Dimensions
Loading a Dimension Table with Type 1 Updates
Loading a Dimension Table with Type 1 and 2 Updates
Comparing Tables
Loading a Fact Table Using Dimension Table Lookup
Loading a Table and Adding a Surrogate Primary Key
Tracking Changes in Source Datetime Values
Closing Out Rows in Datetime Change Tracking
Working with Change Data Capture
About the Change Data Capture Transformations
About CDC Changed Data Tables
About CDC Control Tables
Capture Changed Data from Oracle
Working with Message Queues
About Message Queues
Prerequisites for Message Queues
Selecting Message Queue Transformations
Processing a WebSphere Queue
Polling a Websphere Message Queue
Processing a Microsoft Queue
Working with SPD Server Cluster Tables
About SPD Server Cluster Tables
Creating an SPD Server Cluster Table
Maintaining an SPD Server Cluster
Working with Hadoop and SAS LASR Analytic Server
Overview of the Hadoop Transformations
Prerequisites for the Hadoop Transformations
Creating a Pig Job
Creating a Hive Job
Creating a Hadoop Container Job
Monitoring Hadoop Clusters
Overview of the High-Performance Analytics Transformations
Prerequisites for the High-Performance Analytics Transformations
Loading a Table on the SAS LASR Analytic Server
Usage Notes for HPA Software and Hadoop
Appendixes
Main Windows and Wizards
Analysis Window
Checkouts Tree
Code Editor
Comparison Results Window
Connection Profile Window
Desktop
Details Pane
Expression Builder
Folders Tree
Inventory Tree
Job Editor
Properties Windows
Reports Window
Tools-Options Window
Tree View
View Data Windows
Wizards
Usage Notes
General Usage Notes
Usage Notes for Register Tables Wizards and the New Table Wizard
Usage Notes for the View Data Window
Usage Notes for Iterative Jobs
Prerequisites for Running a Job When a DataFlux Server Is Used for Authentication
Usage Notes for Loaders
Miscellaneous Transformations
Using a Business Rule Flow in a Job
Creating a Table That Appends Two or More Source Tables
Creating a Publish to Archive Report from Table Data
Validating Product Data
Creating a Publish to Email Report from Table Data
Integrating a SAS Enterprise Miner Model with Existing SAS Data
Creating a Publish to Queue Report from Table Data
Extracting Data from a Source Table
Creating Reports from Table Data
Create a Table That Ranks the Contents of a Source
Create Two Tables That Are Subsets of a Source
Moving Data Directly from One Machine to Another Machine
Creating Standardized Statistics from Table Data
Creating Transposed Data from Table Data
Converting a SAS or DBMS Table to an XML Table
Using ODS to Specify Output from the XML Writer
Using SOAP to Access a Third-Party Web Service
Using REST to Access a Third-Party Web Service
Generating Enterprise Decision Management Output
Running Conditional Processes
Java Code and Methods for Report Plug-ins
Example Java Code for a Report Plug-in
Reporting Interface Methods
Glossary
Product
Release
SAS Data Integration Studio
4.7_M2
4.7_M1
4.7
4.6
4.4_M1
4.4
4.3_M1
4.3
4.21_M2
4.21_M1
4.21
4.2
3.4
3.3
Type
Usage and Reference
Copyright Date
August 2014
Last Updated
05Aug2014