SAS Quality Knowledge Base for Contact Information 27
Character (Script Identification) | ||
---|---|---|
Description | The Character (Script Identification) pattern analysis definition determines the Unicode script of each character in the input, and outputs a character representing that script. | |
Output Symbols | Symbol | Meaning |
L | Uppercase Latin character | |
l | Lowercase Latin character | |
漢 | Kanji/Han | |
ア | Katakana | |
あ | Hiragana | |
가 | Hangul | |
Я | Uppercase Cyrillic character | |
я | Lowercase Cyrillic character | |
Θ | Uppercase Greek character | |
θ | Lowercase Greek character | |
ก | Thai | |
أ | Arabic character | |
א | Hebrew character | |
9 | Numeric digit | |
* | other (punctuation, and so on) | |
Examples | Input | Output |
1ー13ー1 イヌイビル・カチドキ8F 501号室 | 9*99*9 アアアアア*アアアア9L 999漢漢 | |
JOHN DOE | LLLL LLL | |
(7F, SAS Institute)スズキイチロウ | *9L* LLL Lllllllll*アアアアアアア | |
李大伟 赛仕(北京) | 漢漢漢 漢漢*漢漢* | |
爱新觉罗·溥仪 | 漢漢漢漢*漢漢 | |
陈耀昌(Chan,Ed Yiu-Cheong) | 漢漢漢*Llll*Ll Lll*Llllll* | |
星光大道62号海王星科技大厦A座6楼 | 漢漢漢漢99漢漢漢漢漢漢漢漢L漢9漢 | |
珠海市 245400(玫瑰楼) | 漢漢漢 999999*漢漢漢* | |
二零零九年十月二十一日 | 漢漢漢漢漢漢漢漢漢漢漢 | |
14Mar, 2001 | 99Lll* 9999 | |
2009/10/21 | 9999*99*99 | |
H134981(5)------ | L999999*9******* | |
0174685503(D) | 9999999999*L* | |
22020319691106184X | 99999999999999999L | |
碧丽服装(北京)有限公司 | 漢漢漢漢*漢漢*漢漢漢漢 | |
电话(+86)10-12345678 | 漢漢**99*99*99999999 | |
Fax:01082741510 | Lll*99999999999 | |
(010)82741510-345 | *999*99999999*999 | |
Αθήνα | Θθθθθ | |
Банк | Яяяя | |
רודיה סקאלה כשאני אוהב (הערות Liner) Sonotone (1990) | אאאאא אאאאא אאאאא אאאא *אאאאא Lllll* Llllllll *9999* | |
Remarks |
Documentation Feedback: yourturn@sas.com |
Doc ID: QKBCI_GB_Pattern_Character-ScriptID.html |