SAS Quality Knowledge Base for Contact Information 27

Tax ID

Standardization Definition

Tax ID
Description

The Tax ID standardization definition standardizes Tax Identification Number information.

Examples Input Output
012345678332 012-345-678-332
012345678 012-345-678-000
228 075 409 008 228-075-409-008
28 075 409 008 028-075-409-008
28075409008 028-075-409-008
28 075 409 028-075-409-000
8075409 008-075-409-000
1234567891111 123-456-789-1111
Remarks

If the input contains 9 digits, the default branch code 000 is appended to the end.

If the input contains less than 9 digits, the default branch code 000 is appended to the end and leading zeroes are prepended to the front to produce 12 total digits.

If the data contains 10 or 11 digits, leading zeroes are prepended to the front to produce 12 total digits.

This definition supports 13-digit Tax Identification Numbers which will be used by the government in the future.

If this definition is applied to pre-parsed data, the following input tokens are available:

Tax ID Proper
Branch Code

It is recommended that you map a correlating data field to each available token whenever possible.