SAS Quality Knowledge Base for Contact Information 27

Birth Number

Parse Definition

Birth Number
Description The Birth Number parse definition parses birth numbers into a set of tokens.
Output Tokens Year of Birth
Month of Birth
Day of Birth
Order Number
Additional Info
Examples Input Output Token Output
936223 / 6147 Year of Birth 93
Month of Birth 62
Day of Birth 23
Order Number 6147
Additional Info  
Input Output Token Output
530731/456 Year of Birth 53
Month of Birth 07
Day of Birth 31
Order Number 456
Additional Info  
Input Output Token Output
128399/5616 Year of Birth 1 2
Month of Birth 8 3
Day of Birth 9 9
Order Number 5 6 1 6
Additional Info  
Remarks This parse definition requires the input data to be in the form YYMMDDOOO, where YY is Year of Birth, MM is Month of Birth, DD is Day of Birth and OOO is Order Number, which consists of 3 or 4 digits. Month of Birth have values 01 through 12 (male) or 51 through 62 (female). Order Number is 3 digits if the person is born before 1954 and 4 digits if the person is born later than 1953.

If the input string does not contain a formally correct formatted birth number, the output will have a NO SOLUTION parse result and the output will be parsed based on the order of the input digits, as in the final example, with spaces between the digits.

The recommended procedure is first to determine if the input string can be parsed correctly, then apply parsing to correctly formatted input data. To determine if the formatting is correct, you can do either of the following:

1. Assign a Result Code Field in DataFlux Data Management Studio Parse node and treat the output with NO SOLUTION results separately.

2. Run a gender analysis on the input string using the Birth Number gender analysis definition. Input strings that get the U (Unknown) result cannot be parsed as a formally correct birth number.