Removing Historical Data from the Job Statistics Library

Steps for Removing Historical Data from the Job Statistics Library

You might want to occasionally remove historical data from your SAS Data Integration Studio job statistics data sets. Otherwise, data will continue to be added to the data sets indefinitely. To remove historical data, follow these steps:
  1. First, you might want to copy the current contents of the job statistics library (for example, C:\SASDIStudioJobStats) to an archive location. This step ensures that the historical data will be available if it is needed in the future.
  2. Open one of the following scripts in a text editor:
    • On Windows: DIStudioJobStatsDataClean.bat
    • On UNIX: DIStudioJobStatsDataClean.sh
    The scripts are located in SASDIStudioJobReports/batch on the batch server host machine.
  3. Edit the options in the script to specify the data that you want to remove from the detailed data sets. Refer to Syntax for the CleanHistoricalDataApp Script and Syntax Description for the CleanHistoricalDataApp Script.
  4. Run the updated script.
  5. After the script has run successfully, execute joblogsFlow.bat or joblogsFlow.sh as described in Step 3: Run the Second Parsing Program to Create Summary Data Sets. This script creates new summary data sets from the detailed data.
    When you view the job statistics reports, the reports will reflect the removal of the historical data.

Syntax for the CleanHistoricalDataApp Script

CleanHistoricalDataApp <-data> <-log> <-duration YEAR|MONTH|DAY number>
<-timescope FromDate(MM/dd/yyyy) ToDate(MM/dd/yyyy)>

Syntax Description for the CleanHistoricalDataApp Script

-data
specifies that data is to be removed from the tables that contain data from logs.
-log
specifies that data is to be removed from the tables that contain log filenames.
-duration YEAR|MONTH|DAY number
specifies that data is to be removed if it is older than the specified number of years, months, or days.
-timescope FromDate(MM/dd/yyyy) ToDate(MM/dd/yyyy)
specifies that data that was collected during the specified period of time is to be removed.