There is a new version of this course. Please see DataFlux Data Management Studio: Fast Track.
This course covers the majority of content of both DataFlux Data Management Studio: Essentials and DataFlux Data Management Studio: Advanced. It introduces and expands the knowledge of DataFlux Data Management Studio and includes topics for data explorations and profiling, data jobs to perform data management tasks (such as data quality and entity resolution), data monitoring, usage of DataFlux Expression Engine language, custom metrics, macro variables and process jobs.
Learn how to
- create and review data explorations
- create and review data profiles
- create data jobs for data improvement
- establish monitoring aspects for your data
- work with the DataFlux Expression Engine Language
- create and use custom metrics
- define and use macro variables
- create process jobs.
Formats available | Standard Duration (duration can vary, see event schedule for details) | | |
Classroom: |
5.0 days | | |
|
There currently are no prerequisites for this course.
This course addresses DataFlux Data Management Studio, DataFlux Data Management Server software.
Introduction and Course Flow
DataFlux Data Management Studio: Getting Started- introduction
- quality knowledge base and reference sources
- data connections
Working Through the PLAN Phase of the DataFlux Methodology- creating data collections
- designing data explorations
- creating data profiles
- designing data standardization schemes
Working Through the ACT Phase of the DataFlux Methodology- introduction to data jobs
- data quality jobs
- data enrichment jobs (self-study)
- entity resolution jobs
Working Through the MONITOR Phase of the DataFlux Methodology- defining business rules
- data profiling with business rules and alerts
- data jobs with business rules
- data jobs with monitoring tasks
Additional Topics- multi-input/multi-output data jobs
- using data job references within a data job
- introduction to DataFlux Data Management server
Staging- overview of remaining course topics
- clean data, new repositories, default macro variables
DataFlux Expression Engine Language- introduction to DataFlux Expression Engine Language (EEL)
- data profiling using EEL
- working with the expression node: IF/ELSE statements
- working with the expression node: RETURN statement
- working with the expression Node: Pushrow function (self-study)
- working with the expression node: grouping functionality (self-study)
Custom Metrics, Macros, and More- working with custom metrics
- working with macro variables
- using multiple locales in a data job (self-study)
Process Jobs- introduction to process jobs
- using variables in process jobs
- conditional processing, work tables, and events
- parallel processing in process jobs (self-study)
Tips, Tricks, and Other Topics- how data is processed in a data job
- job optimization
- tips for building and testing jobs
- deploying jobs
- promote to production
- case study (self-study)
Case Studies- case study for "essentials"
- case study for "advanced"