Saved Profile Reports

Introduction

Saved Profile Reports icon in the SAS Data Loader window
Use the Saved Profile Reports directive to view the results of previously executed data profiles and to create notes about the results. The profiles are created with the Profile Data directive. The profile reports and notes are stored as XML documents on the file system. Saved Profile Reports displays these XML files in a readable format.

About Profile Reports

Profile reports can provide valuable information about a Hadoop table and help identify issues that might exist before you use the table for data management or analysis. A profile report includes a summary view with information about the table that was profiled and detail views with information about individual columns in the table.

Summary View

The summary view of a profile report includes the following information:
Count
the total number of rows in the table that was profiled.
Data Quality Metrics
measurements of data quality for the columns in the table. Measurements include information about the uniqueness of column values, pattern analysis results, and completeness information, including null or blank values.
Note: The measurement of percent null (Null (%)) is rounded to the nearest tenth of a percent. Percentages of null values that are smaller than 0.01 are rounded to zero. Refer to the number of null values (Null (n)) as needed.
Descriptive Measures
descriptive statistics for columns in the table, including information about the central tendency of the data and how it is dispersed. Depending on the data type of the column, these measures might not be available.
Metadata Measures
metadata for the columns in the table, including the data type, the column length, and whether the column is a primary key candidate.
Charts
summary graphics that provide information about the uniqueness and incompleteness of column values.
Screen capture of uniqueness and incompleteness graphics in Charts section of profile report

Column Detail Views

When you click on a column from the summary view in a profile report, another view is displayed that provides more detailed information about the selected column.
The detail view of a profile report includes the following information:
Count
the total number of rows in the table that was profiled.
Standard Metrics
a combined listing of the data quality metrics, the descriptive measures, and the metadata measures for the column that were displayed on the summary view.
Frequency Distribution
a listing of the unique values for the column, including information about how frequently a value occurs in the table. When you select a value from the list, the associated section of the pie chart is highlighted.
Screen capture of frequency distribution information for a column, with highlighted value and pie chart section.
Pattern Distribution
a listing of the distinct pattern values that were derived from performing pattern analysis on the values for the column. The content of the pattern value describes the content of the data and indicates whether each character is uppercase, lowercase, or numeric. When you select a value from the list, the associated section of the pie chart is highlighted.
Screen capture of pattern distribution information for a column, with highlighted value and pie chart section.
Outliers
a listing of extreme values for the column. By default, the 10 lowest values and the 10 highest values are saved, but you can change the number of outliers that are saved in the profile configuration settings. For more information, see Profiles Panel.
Data Type Analysis
a listing of possible types of data for the information in the column, as determined by data type analysis that is automatically performed by SAS Data Loader. Results for data type analysis are available only for columns that contain string characters (for example, contact information such as name, address, state, ZIP code, and so on).
Screen capture of data type analysis information for a column, with highlighted value and pie chart section.

Open Saved Profile Reports

To open a saved profile report:
  1. In the SAS Data Loader directives page, click the Saved Profile Reports directive to open a new browser tab. The Select a Profile Report page is displayed on the new tab:
    profile report list
    Note: Any profile job that runs longer than five days is deleted from the Select a Profile Report page.
  2. You can filter the list of reports using the following methods:
    • Click calendar icon and select a date. This filter displays profile reports that were generated on or after the selected date.
    • Enter a text string into the search field.
    • Click close filter to remove the filter and restore the full list.
  3. To delete profile reports, select one or more reports and click delete profile .
  4. To open a profile report, click its name.
    • If the report contains a single table, the table opens directly in the detail view shown in Step 6.
    • If the report contains multiple tables, the table opens in an overview:
      table overview
  5. You can click a table to go directly to a more detailed view or you can click show outline icon to open the outline view:
    table overview with outline
    The following actions are available:
    Go to Profile Report List
    returns you to the Profile Report List.
    Show or Hide Outline
    displays or hides the outline in the left pane.
    Show or Hide Trends
    displays or hides the trend graphs for data that is presented in the summary view. You can use trend graphs to quickly visualize changes in the data across multiple versions of the same report. When trend graphs are not displayed, the current value of the metric is shown. For example:
    Screen capture of data values in summary view, with trend graphs off.
    When trend graphs are on, each graph displays the 10 most recent values of a metric, as determined by the selected version of the report. For example:
    Screen capture of data values in summary view, with trend graphs on.
    To view the complete list of values for the metric, you can click the trend graph. A window is displayed:
    Screen capture of the pop-up window showing the complete list of values for the metric.
    Show or Hide Notes
    displays or hides notes in the right pane. You can filter the notes by entering a text string into the filter field.
    Add Note
    opens a dialog box in which you can add a note.
    Report Version
    enables you to select the version of the report by date.
  6. Select a table in the Overview pane or click directly on the table icon to display detailed information in the right pane. The Data Quality Metrics are displayed by default.
    Data Quality Metrics Display
  7. Click expand button next to a table name to display columns. Select a column to display detailed column information in the right pane:
    column detail standard metrics
  8. Click expand button in the gray header bars to display the metrics in those sections. For example, clicking on Frequency Distribution icon displays the following metrics.
    column detail frequency distribution
    Clicking links in the detail view opens SAS Table Viewer.