Overview of SAS/Warehouse Administrator |
Many different approaches can be taken when building a data warehouse. Here are some general guidelines:
plan from the top down -- identify your business goals, then identify the outputs, sources, and data stores that are required to support these goals
implement from the bottom up -- from source data, to detail data stores in a subject area, to summary data stores in a subject area
define, load, and test warehouse data sources and data stores as you go.
Plan Your Data Warehouse |
Identify the subject areas to be included in your data warehouse (such as Sales, Products, Customers, and so on).
Choose a subject area to be developed. The first subject area should be important enough to illustrate the value of your project, but simple enough so that it can be developed quickly and successfully.
Draft the reports and other outputs that you expect to get from the chosen subject area.
Identify data sources required to produce the reports and other outputs that you expect to get from the chosen subject area.
Identify the main column mappings and data transformations between sources and targets in the chosen subject area.
Select the hardware and software required to access the data sources, execute the transformations, and store the refined data in the appropriate formats and locations.
Choose the SAS/Warehouse Administrator data stores that are appropriate for the chosen subject area.
Choose the appropriate exploitation tool(s) for the chosen subject area (such as SAS/EIS for multidimensional analysis; Enterprise Miner for data mining, and so on).
A detailed explanation of how to create a project plan for your data warehouse is beyond the scope of this document. However, the following chapters will help you with the SAS/Warehouse Administrator portion of such a plan:
Create a Data Warehouse Environment |
Create a directory structure for the Data Warehouse Environment.
If necessary, update the SAS configuration file and the SAS autoexec file that is used during the SAS/Warehouse Administrator session so that any required options or librefs are available.
For details about the tasks in this section, see the following chapters:
If you need to convert a Data Warehouse Environment created with an earlier version of SAS/Warehouse Administrator, see Converting Metadata for Environments and Warehouses.
Create a Data Warehouse |
Create a Subject within the Warehouse. Name it after a subject area identified in your project plan.
If required by your project plan, create appropriate summary data stores within the Subject.
If required by your project plan, create appropriate Information Marts in the Subject.
For details about the tasks in this section, see the following chapters:
Maintain a Data Warehouse |
Here are some typical maintenance tasks for a data warehouse.
Schedule Jobs to load or refresh your data warehouse. For details, see Scheduling Jobs.
Use the copy feature in the SAS/Warehouse Administrator Explorer to copy a given group or data store and paste the copy under a valid parent. Use this method to model a new group or data store after an old one rather than creating a completely new object. For details about the copy feature, display the Explorer. From the menu bar, select Help, then Using This Window, then Explorer Pop-Up Menus.
Use the Publish HTML add-in to document the metadata for a given object that is selected in the SAS/Warehouse Administrator Explorer. For details about add-ins, see Customizing SAS/Warehouse Administrator.
Use the Impact Analysis add-in to trace the impact of a proposed change to a data store. For example, if you wanted to change the name of a column in an ODD, you could find out how many data stores in a Warehouse Environment include that column. For details about add-ins, see Customizing SAS/Warehouse Administrator.
Use the appropriate software to back up the data and metadata for each Warehouse Environment. For details about the metadata repositories for an Environment, see Overview: Metadata Repositories.
Use the Metadata Copy Wizard to copy a Warehouse Environment to a new location. You can use this feature to model a new Environment after an old one rather than creating a completely new Environment. For details, see Metadata Copy Wizard.
Exploit a Data Warehouse |
Generally, SAS/Warehouse Administrator is used to build a data warehouse, not exploit it. Its main role is to create and manage warehouse data stores that are then accessed by other applications. In some cases, however, you can use SAS/Warehouse Administrator to prepare data or metadata for exploitation.
For example, here is one way to use SAS/EIS to analyze a summary data store.
Use SAS/Warehouse Administrator to export metadata from the relevant summary data store to a SAS/EIS metabase. For details about this task, see Example: Exporting Metadata to SAS/EIS Software.
Another example is to surface a copy of the metadata for a given Data Warehouse. This method could be used to support business analysts who need to answer such questions as What are the subject areas in this Data Warehouse? What data stores are available for analysis? What is the origin of the data in this column? What does this column mean? How is it derived?
Export metadata from the Data Warehouse. For details, see Exporting Metadata for Groups and Data Stores.
Download the MetaSpace Explorer from the SAS Web site, as described in MetaSpace Explorer.
Install and configure the MetaSpace Explorer to enable access to the exported metadata. For details about this task, see the documentation for the MetaSpace Explorer.
Copyright © 2012 by SAS Institute Inc., Cary, NC, USA. All rights reserved.