Profile Data

Introduction

Profile Data icon in the SAS Data Loader window
Use the Profile Data directive to generate profile reports for one or more tables. You can select a subset of the columns that you want to include in the profile report. The Profiles panel of the Configuration window enables you to change the default behavior of new profiles in order to improve performance. For example, you can limit the number of parallel processes that are used in new profile jobs. For more information, see Profiles Panel.

Create a Profile

To create a profile:
  1. On the SAS Data Loader directives page, click the Profile Data directive. The Source Table task is displayed:
    shows data sources.
  2. Click a data source to display its tables:
    shows selected tables inside the data source.
  3. Select the table or tables for the profile report.
    If a profile already exists for a table, PROFILED appears beneath the table name. You can view the existing profile by selecting the table and clicking View Profile.
    The Select menu (Select Menu icon) provides several options to make selecting tables easier:
    • Select All New Tables: Automatically selects all new tables in the current data source.
    • Select Recent Table: Enables you to choose from a list of recently used tables. If you select a table from a different data source, the source table information is adjusted accordingly.
    • Deselect All Tables: Deselects all tables that you have selected in the current data source.
    Tip
    To view sample data from a table, select the table, and then click data sample icon in the Source Table header to display the SAS Table Viewer.
    Click Next. The Columns task is displayed:
    select columns from source tables for the profile report
  4. The Columns task displays the total number of columns that are to be processed in the profile report. If you selected more than one table for your report, the tables are listed by name. Click expand button next to the tables to display the columns that are included in the profile report.
  5. The column names in the Selected columns pane appear in the report. Select an individual column name and click de-select all columns icon or select column icon to move the column name between the Available columns pane and the Selected columns pane until the correct list of names appears in the Selected columns pane. Click de-select column icon or select all columns icon to move all column names at once.
    When the column selection is complete, click Next. The Report task is displayed:
    specify profile name and location
  6. In the Report task, enter a name for the profile report in the Report name field.
    If you selected multiple tables and want a separate report for each table, click Create a separate report for each table profiled.
    Click Next to display the Result task:
    create profile report
  7. Click Create Profile Report. After successfully creating any profile reports, a screen similar to the following is displayed:
    display profile report creation success
    The following actions are available:
    View Profile Report
    enables you to view the Profile Report. See Saved Profile Reports for more information about the profile report.
    Log
    displays the SAS log that is generated during the creation of the profile.
    Code
    displays the SAS code that generates the profile.

Usage Notes

Hive tables have a maximum table name length of 132 characters. Many of the SAS Data Loader directives can create tables with names that exceed the SAS table name length limit of 32 characters. The tables that you submit for profiling in the Profile Data directive must conform to the 32-character name length limit. Table names that exceed 32 characters generate error messages.