Distributed Address Verification Node
NOTE: Beginning July 31, 2023, SAS will no longer support or provide updates the following data packs:
- US Address Verification (USPS)
- North America Postal Level Geocode (includes PhonePlus)
- Canada Address Verification (Canada Post SERP (Software Evaluation and Recognition System) version 2, release 2 (V2R2))
- US Street Level Geocode (except for Loqate geocode data)
Users that want to perform US/Canada address verification and geocode jobs, must migrate to the Loqate data packs listed below, prior to July 31, 2023.
- Loqate US Data
- Loqate Data (includes Canada)
After July 31, 2023, the SAS Data Management Studio jobs that rely on the discontinued data packs fail and you will receive license errors. See the SAS Data Management Studio and Server 2.9: Migration Guide for Data Packs for additional information.
You can add a Distributed Address Verification node to a data job to offload address verification to a machine other than the one running the current SAS Data Management Studio job. This function is particularly useful when you are running on a SAS Data Management Server. It offers a large performance increase by eliminating the overhead of connecting to the United States Postal Service (USPS) database when you start a real-time service. The USPS, which provides postal services in the United States, offers address verification and standardization tools. This node also allows many address verification services to be open at once. Previously, a memory limitation prevented multiple services from being opened.
Use the Distributed Address Verification node to verify, correct, and enhance US, Canadian, or World addresses in your existing data. Distributed address verification uses geographic information from various reference databases to match and standardize addresses. This node also consolidates the three traditional address verification nodes (Address Verification - US/Canada, Address Verification - QAS, and Address Verification - World) into one node that is capable of performing all three types of address verification.
Note: To use this node you must have SAS Data Management Server installed and licensed. For additional information regarding SAS Data Management Server installation and configuration please refer to the dfIntelliServer Reference Guide. After you have installed and configured SAS Data Management Server you must add the following key and value to the app.cfg file:
DFCLIENT/CFG = <INTELLISERVER CLIENT CONFIGURATION FILE>
For example:
DFCLIENT/CFG = C:\Program Files\SAS\dfIntelliServer\etc\dfclient.cfg
Once you have added the Distributed Address Verification node, you can double-click it to open its properties window. The properties window includes the following elements:
Name - Specifies a name for the node.
Notes - Enables you to open the Notes window. You use the window to enter optional details or any other relevant information for the input.
The Input section of the window contains following element:
Input Fields - Contains the fields you can use for distributed address verification processing. Items displayed in this list are dependent on your data source(s) and any steps preceding the Distributed Address Verification node in your job flow. For each field you are running address verification on, select the type of address information from the Field Type column. Refer to Distributed Address Verification - Available Fields for a list of available field types.
Note: Specifying a field type for a field does not mean the field will be included in distributed address verification results. To specify fields to be included in the results, use the Available and Selected lists in the Output fields group box.
The Country section of the window contains following elements:
Use input field or system default - When selected, enables you to determine the country by either specifying a country field or using the default system locale. If you know your data set contains only data for a specific country, you can improve performance and accuracy by selecting this option. The data in the country field should be the two character ISO standard for that country.
Important - SAS Data Management Studio does not support multi-byte character sets. If the data returned from the Address Verification job contains multi-byte data, you must force the output to Latin format. For additional information, refer to the ADDR_OUTPUT_LATIN advanced property.
Guess - When selected, prompts the system to guess a locale.
Options - Displays the Options window, where you can set the following address verification options:
- Proper case results - When selected, displays results in proper case. By default, results are all uppercase.
- Output using Latin script only - When selected, the output is forced to Latin instead of the native language (for example, Kanji). Data in non-European languages will be transliterated.
- CASS compliance (United States) - When selected, the Address Verification (US/Canada) node should apply USPS Coding Accuracy Support System (CASS). CASS is the program that USPS administers to certify address verification software standards when modifying address data.
- Correct rural addresses (Canada) - When selected, rural addresses are corrected during the address verification process. If this option is not selected, the rural addresses are not changed during the process. This option is on by default.
- Correct Large Volume Receivers (Canada) - When selected, the process corrects Large Volume Receiver (LVR) addresses. If this option is not selected, when the process identifies an LVR address, the address is recognized as valid and no corrections are made to any of the address components. This option is on by default.
- Pass valid addresses through (Canada) - When selected, valid addresses are passed through the system without modification. When this option is selected, changes like street or state abbreviations are not changed. This option is off by default.
The Output fields section of the window contains following elements:
Available - Displays the fields that you can make available for the next step in your data job. Items displayed in this list are dependent on your data sources and any preceding steps in your data job.
Selected - Displays the fields that will be made available to the next node in your data job.
Additional Outputs - Displays the Additional Outputs window. This window enables you to specify the fields that you can make available to the next node in your data job.
Override Server - Enables you to override the default connection to the dfIntelliServer specified in the dfclient.cfg file. (See the dfIntelliServer Reference Guide for more information about configuration settings). If you want to override the default server setting and perform the area code lookup on a different dfIntelliServer instance, you can specify the server name in the Override Server field and port number on this window.
You can access the following advanced properties by right-clicking the Distributed Address Verification node:
- ADDR_CA_LVR
- ADDR_CA_PASS_VALID
- ADDR_CA_RURAL
- ADDR_GUESS_COUNTRY
- ADDR_OUTPUT_LATIN
- ADDR_PROPERCASE
- ADDR_RETURN_INVALID
- ADDR_US_CASS
- ADDR_US_DPV
- ADDR_US_ELOT
- ADDR_US_LACS
- ADDR_US_RDI
- INPUT_COUNTRY
- INPUT_FIRM
- INPUT_LAST_LINE
- INPUT_LINE1
- INPUT_LINE2
- OUTPUTS
- OVERRIDE_SERVER
- PASSTHRU