The geocoding facility
uses these data sets to match the addresses in the address data set.
As it is processing the address data set, the geocoding facility provides
a progress indicator. For every 10% of the addresses that are geocoded,
a message is written to the SAS log.
When a match is found,
the coordinates of the address location are added to the address data
set, along with any other composite values for the specified address.
For example, if the spatial data has a composite named TRACT that
contains census tract numbers, you can use the geocoding process to
add a TRACT variable to your address data set. The resulting geocoded
address data set can be used as attribute data for the map, or it
can be imported to add point data to the map by using a generic import.
If an address cannot
be matched to the spatial data but the address includes a ZIP code,
then the X and Y coordinates of the center of the ZIP code centroid
for the zone are returned instead of the exact coordinates of the
address. The centroid coordinates are read from the SASHELP.ZIPCODE
data set.
For matching purposes,
the geocoding process converts the address components to uppercase
and attempts to convert direction and street type values to standard
forms. The standardized versions of the address components are also
added to the address data set. The M_ADDR, M_CITY, M_STATE, M_ZIP,
and M_ZIP4 variables that are added to the address data set reflect
the address values that were actually matched during the geocoding
process. If a matching observation was found in the sorted chains
data set, that row number is placed in the M_OBS variable.