SAS Quality Knowledge Base for Contact Information 27

Character (Script Identification)

Pattern Analysis Definition

Character (Script Identification)
Description The Character (Script Identification) pattern analysis definition determines the Unicode script of each character in the input, and outputs a character representing that script.
Output Symbols Symbol Meaning
L Uppercase Latin character
l Lowercase Latin character
Kanji/Han
Katakana
Hiragana
Hangul
Я Uppercase Cyrillic character
я Lowercase Cyrillic character
Θ Uppercase Greek character
θ Lowercase Greek character
Thai
أ Arabic character
א Hebrew character
9 Numeric digit
* other (punctuation, and so on)
Examples Input Output
1ー13ー1 イヌイビル・カチドキ8F 501号室 9*99*9 アアアアア*アアアア9L 999漢漢
JOHN DOE LLLL LLL
(7F, SAS Institute)スズキイチロウ *9L* LLL Lllllllll*アアアアアアア
李大伟 赛仕(北京) 漢漢漢 漢漢*漢漢*
爱新觉罗·溥仪 漢漢漢漢*漢漢
陈耀昌(Chan,Ed Yiu-Cheong) 漢漢漢*Llll*Ll Lll*Llllll*
星光大道62号海王星科技大厦A座6楼 漢漢漢漢99漢漢漢漢漢漢漢漢L漢9漢
珠海市 245400(玫瑰楼) 漢漢漢 999999*漢漢漢*
二零零九年十月二十一日 漢漢漢漢漢漢漢漢漢漢漢
14Mar, 2001 99Lll* 9999
2009/10/21 9999*99*99
H134981(5)------ L999999*9*******
0174685503(D) 9999999999*L*
22020319691106184X 99999999999999999L
碧丽服装(北京)有限公司 漢漢漢漢*漢漢*漢漢漢漢
电话(+86)10-12345678 漢漢**99*99*99999999
Fax:01082741510 Lll*99999999999
(010)82741510-345 *999*99999999*999
Αθήνα Θθθθθ
Банк Яяяя
רודיה סקאלה כשאני אוהב (הערות Liner) Sonotone (1990) אאאאא אאאאא אאאאא אאאא *אאאאא Lllll* Llllllll *9999*
Remarks