An Access Control Entry (ACE) is an entry of user information made to the Access Control Lists (ACLs) which is used to secure access to individual DataFlux Data Management Server objects.
Access Control Lists (ACLs) are used to secure access to individual DataFlux Data Management Server objects.
Address verification (validation) is the process of comparing a physical address to a reference database of known physical addresses so the original address can be standardized and corrected according to postal authority standards.
Analyze, Improve, Control (AIC) - DataFlux enables organizations to analyze, improve, and control their data from a single data quality integration platform. DataFlux tools and approaches can help you build a comprehensive set of business rules that can create a unified view of your enterprise data and enhance the effectiveness of CDI, CRM, ERP, legacy data migration, or compliance initiatives.
Address Matching Approval System (AMAS) is the program the Australia Post administers to certify address verification software.
Application Programming Interface (API) is a set of software protocols, routines, and/or tools used when building software applications.
Army/Air Force post office (APO) is an indication for the USPS.
ASCII (American Standard Code for Information Interchange) is a character set based on the English alphabet
A basic category is a category that represents a single word. Basic categories are the basic building blocks of Grammar rules. Every basic category in a Grammar corresponds to a category in an ordered word list. For this reason, you should design Grammar rules in parallel with word-analysis logic.
The application of data management routines to data source records in what are often very large groups, usually in processes that require no manual user intervention. Contrast with real-time processing.
These are expressions which are written in a generic manner so they can be reused from multiple rules or applications.
A conditional statement that tells a system running a business process how to react to a particular situation.
A set of logic used to accurately change the case of an input value, accounting for unique values that need to be case sensitive, such as abbreviations and business names.
Coding Accuracy Support System (CASS) is the program the United States Postal Service (USPS) administers to certify address verification software.
Census Bureau Statistical Areas (CBSA)
Cross-Environment Data Access (CEDA)
The census string is a US Census Bureau designation for the boundary area in which the centroid exists. The census string contains state, county, and other census-type information.
A centroid is the approximate mathematical center of the ZIP or ZIP+4 boundary.
These are built-in checks (expressions) that provide a template to the user to build common standard expressions.
A proprietary file type used by DataFlux as a lex table to separate characters in a subject value into more usable segments.
US Commercial Mail Receiving Agency (CMRA)
Consolidated Metropolitan Statistical Areas (CMSA)
Comments are text within a code segment that is not executed. Comments can be either C-style (starts with /* and ends with */) or C++ style (starts with // and continues to the end of a line).
Default logic to handle data such as name and address, which inform the identity management process.
Canadian Post Certification (CPC) is the SERP program administered by the Canadian Post. This is similar to the CASS certification administered by the USPS.
Customer Relationship Management (CRM)
Custom metrics may be used when the standard metrics do not contain the rules you need to accomplish the desired results.
The dashboard is a Web-based view of the task grid and graphs in the Monitor Viewer.
A discovery process that uncovers potential problem areas in large amounts of structured data.
Not used in the sense of a database data type ("varchar" for instance) but used to describe sets of data values that follow certain rules and conventions. "Name" and "Address" are two examples of data types.
A collection of tables containing data that can be accessed easily by a computer system.
An algorithm available to a DataFlux application.
A derived category is a category composed of one or more other categories. The makeup of a derived category is described using rules.
dfIntelliServer provides a real-time or transactional mechanism for communicating with the MCRD through the Architect API. dfIntelliServer has several client libraries (including a Web services client) that can be called from a number of different applications in many different computing environments. dfIntelliServer allows one at a time queries and modifications to the MCRD. dfIntelliServer allows organizations to access Architect jobs through an API that can accept one group of data elements at a time rather than a complete table. This functionality takes advantage of the power of encapsulation of discreet chunks of work in Architect, so a programmer need only make one call to the client API to perform a related set of activities.
Delivery Point Validation (DPV) specifies if the given address is a confirmed delivery point as opposed to being within a valid range of house numbers on the street.
Expression Engine Language (EEL)
Enterprise Resource Planning (ERP)
Extraction, Transformation, and Loading
An event represents an action which should be taken when a rule fails. Actions can include sending email messages, storing the offending row in the repository, or executing an external process.
This is the DataFlux syntax used in the Business Rule Manager to build business rules.
Also known as a "variable" or a "column," a single piece of data in a database table. Database tables can have many fields. The user defines the fields. Each field has a unique identifier in the repository. From a data monitoring standpoint, the fields are not tied to any specific database or table but are bound at the time of execution to the current data set or row.
A field set is a collection of fields that belong together. These usually represent a table of data and are used to aid in building rules and viewing results.
Federal Information Processing Standards (FIPS) - A 5-digit number assigned to each county in the U.S. by the Census Bureau. The first 2 digits are the state code, and the last 3 digits are the county number.
Fleet post office (FPO) indication for USPS used for military personnel.
An algorithm that can determine the gender of persons by their names.
A set of logic used to determine the probable gender of a name or identity-type input string.
A proprietary file type used to store hierarchical patterns pertinent to a specific subject area.
A group rule evaluates and applies all rules to groups of data (for example, data grouped by state and the rules evaluated for each state).
A historical metric is available when a business rule is run a second time under the same report name. You can view and compare the last two reports.
An algorithm that can determine from a known set of options what type of data is represented by a particular subject value.
A set of logic used to identify an input string as a member of a redefined or user-defined value group or category.
Input fields are the fields where you apply the checks specified in the Rule Manager. This list includes all the fields you have defined in the Business Rule Manager, including the Output fields from custom metrics and any grouped by field.
The saved configuration settings for a particular task in a Data Management Studio application. You can run jobs interactively or combine them with other jobs and schedule the set of jobs to run on a particular date or time.
US Locatable Address Conversion Service (LACS) is a product/system in a different USPS product line that allows mailers to identify and convert a rural route address to a "city-style" address.
The country of origin based on an address or country code.
A process that attempts to identify the country of origin of a particular piece of data based on an address, country code, or other field.
MDM focuses on managing master data that is shared by several different systems and groups.
The process of identifying data strings that can be different representations of the same semantic information. For example, the strings Mr. Bob Brauer, Robert J., and Brauer can be considered to match each other.
A set of records grouped together based on some commonality. Cluster IDs are numeric values used to refer to these clusters. You can append cluster IDs to records in a database to document matches.
The end result of passing data through a match definition. A normalized, encrypted string that represents portions of a data string that are considered to be significant with regard to the semantic identity of the data. Two data strings are said to "match" if the same match code is generated for each.
A set of logic used to generate a match code for a data string of a specific data type.
A string representing the value of a single token after match processing.
Minor Civil Division (MCD)
Master Data Management (MDM) focuses on master data shared by several different systems and groups.
The process of joining records and eliminating duplicate records from a table based on user-specified conditions and rules.
Information that describes the properties of data, for example when was last accessed or the size of the data value.
This term is used in US Census data and refers to a population area including a city with 10,000 to 50,000 residents and surrounding areas.
Metropolitan Statistical Areas (MSA) - The MSA code assigned by the Office of Management and Budget. Use this code as an index key in the MSA file.
A namespace is a unique container created to hold a logical grouping of identifiers.
An object is anything that can be stored in the Data Management Studio Navigator and accessed by the Data Management Studio applications.
Objects are individual jobs and services.
Open Database Connectivity (ODBC) - an open standard application programming interface (API) for accessing databases.
Office of Foreign Assets Control (OFAC) - Federal regulations related to the Patriot Act.
Online Analytical Processing (OLAP)
A company, university, or other type of institution. For example: IBM Corporation, University of Connecticut, or St. Joseph's Hospital
The output field is the field(s) used to apply the rule in the custom metric. Set your output field to serve as the field where the results from your custom metric are collected.
The process of dividing a data string into a set of token values. For example: Mr. Bob Brauer, Mr. = Prefix, Bob = Given, Brauer = Family
A name for a context-specific parsing algorithm. A parse definition determines the names and contents of the sub-strings that will hold the results of a parse operation.
A regular expression library that forms the basis of a pattern recognition algorithm.
An algorithm applied to a data string to reduce it to a value that will match other data strings with similar pronunciations.
A private mailbox (PMB) is categorized as a mailbox located at a mail center other than the post office or home.
Principal Metropolitan Statistical Areas (PMSA)
Primary key is a unique identifier assigned to a database field. Social Security Numbers or a ISBNs are examples of possible primary keys.
QuickAddress Software (QAS)
The Quality Knowledge Base (QKB) is a collection of files and configuration settings that contain all DataFlux data management algorithms. The QKB is directly editable using Data Management Studio.
Quality Master Data Management (qMDM)
The Quality Knowledge Base (QKB) locales contain the files, file relationships, and metadata needed to correctly parse, match, standardize, and otherwise process data.
Relational Database Management System (RDBMS) allows you to access data in a database in unique ways, such as adding tables and records, and joining tables.
Residential Delivery Indicator (RDI)
Processing a record or data one piece at a time as it enters a computer system, for financial transactions, for example. Contrast with batch processing.
Also called a "row" or "observation," one complete set of fields in a database table.
A mini-language composed of symbols and operators that enables you to express how a computer application should search for a specified pattern in text.ï¿½A pattern may then be replaced with another pattern, also described using the regular expression language.
A Data Management Studio repository is a hierarchical data storage mechanism.
A row rule evaluates every row of data passed into the Monitoring node.
Software Evaluation and Recognition Program is a program the Canada Post administers to certify address verification software.
A single rule can be either a row level rule or a data set level rule. A row level rule is applied to each row which enters the system while a data set level rule is applied to an entire data set or a portion of a data set.
A rule set is a set of one or more rules which are applied together as a group. Use a rule set when you find you are using a few rules together frequently.
Software Development Kit (SDK)
Regarding matching procedures, sensitivity refers to the relative tightness or looseness of the expected match results. A higher sensitivity indicates you want the values in your match results to be very similar to each other. A lower sensitivity setting indicates that you would like the match results to be "fuzzier" in nature.
The Software Evaluation and Recognition Program (SERP) is a program the Canadian Post administers to certify address verification software.
Service Oriented Architecture (SOA) - All of the interaction with the master customer reference database is through a service-oriented architecture that enables any system to talk to the customer database and request or update information.
A set rule evaluates and applies rules to all of the input data completely (for example, it will evaluate all 1000 rows of data as a set).
Structured Query Language (SQL) is a language used to request information from database systems.
Standard metrics are pre-defined rules (expressions) set in Data Management Studio. Most of the time, this is enough to achieve the results for your job.
A set of logic used to standardize a string.
A collection of transformation rules that typically apply to one subject area, like company name standardization or province code standardization.
The process of transforming a data string so each of the string's token values conforms to a preferred standard representation: IBM Corporation = IBM CORP; Mister Bob Brauer, Junior = MR BOB BRAUER JR.
Statement of Accuracy (SoA) is the form used for Canadian Post Certification (CPC) standards.
A table is a collection of records in a database.
Tasks contain the rules and the events that go with your individual rule. Tasks associate alert events with a rule that are triggered after a rule fails.
Used by DataFlux to designate the output strings of a parse process. The output string of a parse process. A word or atomic group of words with semantic meaning in a data string. A set of expected tokens is defined for each data type.
An industry standard used to allow text and symbols from languages around the world.
This is the version of the repository you are using. The term "unified" means the repository contains data for Data Management Studio Profile reports, Business Rules, and Data Monitoring results.
Uniform Resource Identifier (URI) is a string of characters identifying a resource or file path.
United States Postal Service (USPS) provides postal services in the United States. The USPS offers address verification and standardization tools.
A proprietary file type used for categorizing data look-ups pertinent to a specific subject area.