DataFlux Data Management Studio 2.7: User Guide
Options Dialog
You can use the Options dialog to configure options for the profile Properties tab. The dialog contains the following tabs:
General
Use the values on the General tab to set general parameters for profiles. The main section of the tab contains the following elements:
- Count all rows for frequency distribution; Maximum number of values to count for frequency distribution - When count all rows is selected, specifies that all rows in the frequency distribution are counted. When deselected, enables you to specify the maximum number of rows to consider in frequency distributions.
- Count all rows for pattern distribution; Maximum number of values to count for pattern distribution - When count all rows is selected, specifies that all rows in the pattern distribution are counted. When deselected, enables you to specify the maximum number of rows to consider in pattern distributions.
- Frequency Distribution memory cache size (MB) - Specifies the amount of memory to use during frequency distribution.
- Frequency Distribution memory size per table column (kB) - Specifies the amount of memory used per table column.
- Trim leading/trailing spaces from values when calculating frequency distribution and inferred metadata - When selected, specifies that leading and trailing spaces are removed from each field for frequency distributions.
- Trim leading/trailing spaces from values when calculating pattern frequency distribution - When selected, specifies that leading and trailing spaces are removed from each field for pattern distributions.
The Rule generation section of the tab contains the following elements:
- Number of rows to sample - Specifies the number of rows to sample in a table that is selected for automatic rule generation.
- Enable rule generation for a table when it is selected - Specifies that automatic rule generation is run for a selected table.
The Commit section of the tab contains the following elements:
- Commit every row - When selected, specifies that changes are committed as each row is written to the source table.
- Commit every [X] rows - When selected, specifies that changes are committed each time this number of rows has been written.
- Commit all rows in a single transaction - When selected, specifies that changes are committed after all rows have been written.
Note that you can now run multiple profiles in the same repository if the appropriate commit options are set for the profiles. If the commit option is set to the default value of Commit all rows in a single transaction, then no other client or server accessing the same repository will be able to execute a profile until the first profile is finished. If the commit option is set to Commit every row or Commit every X rows, then you can run multiple profiles in the same repository simultaneously.
See also the usage notes and performance tips in Jobs, Profiles, Data Explorations.
Charts
You can use the Charts tab to set the height and width the hard copies of charts generated during visualization.
Quality Knowledge Base
You can use the Quality Knowledge Base tab to specify the locale in your Quality Knowledge Base that is most appropriate for your data. You can also enable data quality pattern analysis and select a definition.
Documentation Feedback: yourturn@sas.com
Note: Always include the Doc ID when providing documentation feedback.
|
Doc ID: dfU_Profile_Options.html
|