This course introduces DataFlux Data Management Studio and includes topics for data explorations and profiling, data jobs to perform data management tasks (such as data quality and entity resolution), data monitoring, usage of DataFlux Expression Engine Language, custom metrics, macro variables, and process jobs.
This course can help prepare you for the following certification exam(s): SAS Data Quality using DataFlux Data Management Studio.
Learn how to
- Understand data explorations.
- Create and review data profiles.
- Create data jobs to improve data quality.
- Create data jobs to perform entity resolution.
- Establish monitoring aspects for your data.
- Work with the DataFlux Expression Engine Language.
- Create and use custom metrics.
- Define and use macro variables.
- Create process jobs.
There currently are no prerequisites for this course.
This course addresses DataFlux Data Management Studio, DataFlux Data Management Server software.
Introduction and Course Flow- Introduction to DataFlux Data Management applications.
- Course flow.
DataFlux Data Management Studio: Getting Started- Introduction.
- Reviewing quality knowledge base and reference sources.
- Establish data connections.
Working through the PLAN Phase of the DataFlux Methodology- Reviewing data explorations.
- Creating and reviewing data profiles.
- Designing data standardization schemes.
ACT: Introduction to Data Jobs- Introduction to data jobs.
- Setting options for data jobs.
- Creating a simple data job.
ACT: Quality- Investigating standardization.
- Working with a Field Layout node.
- Investigating parsing and casing.
- Investigating right fielding and identification analysis.
- Working with Branch and Data Validation nodes.
- Investigating gender analysis.
ACT: Entity Resolution- Creating match codes.
- Clustering records.
- Adding survivorship to the entity resolution job.
- Adding field-level rules for the surviving record.
Working through the MONITOR Phase of the DataFlux Methodology- Defining business rules.
- Data profiling with business rules and alerts.
- Processing business rules in data jobs.
- Data jobs with business rules.
- Data jobs with monitoring tasks.
Staging- Overview of remaining course topics.
- Clean data, new repositories, default macro variables.
DataFlux Expression Engine Language- Introduction and overview of DataFlux Expression Engine Language (EEL).
- Creating dynamic fields for a profile using EEL.
Expression Node in Data Jobs- Working with the Expression node.
- Reviewing the IF/ELSE statement.
- Reviewing return status
Custom Metrics- Reviewing and working with custom metrics in a data profile.
- Using custom metrics in a business rule.
Parameterization with Macros- Creating a macro file.
- Using macros in a data profile.
- Using macros in a data job.
Essentials of Process Jobs- Introduction to process jobs.
- Examining source bindings in a simple process job.
Creating Advanced Process Jobs- Working with conditional processing.
- Working with work tables and events.
Tips, Tricks, and Other Topics- How data is processed in a data job.
- Job optimization.
- Tips for building and testing jobs.
- Working with the Data Management Server.
- Promote to production.