Character (Script Identification)
Pattern Analysis Definition
| Character (Script Identification) | ||
|---|---|---|
| Description | The Character (Script Identification) pattern analysis definition determines the Unicode script of each character in the input, and outputs a character representing that script. | |
| Output Symbols | Symbol | Meaning |
| L | Uppercase Latin character | |
| l | Lowercase Latin character | |
| 漢 | Kanji/Han | |
| ア | Katakana | |
| あ | Hiragana | |
| 가 | Hangul | |
| Я | Uppercase Cyrillic character | |
| я | Lowercase Cyrillic character | |
| Θ | Uppercase Greek character | |
| θ | Lowercase Greek character | |
| ก | Thai | |
| أ | Arabic character | |
| א | Hebrew character | |
| 9 | Numeric digit | |
| * | other (punctuation, and so on) | |
| Examples | Input | Output |
| 1ー13ー1 イヌイビル・カチドキ8F 501号室 | 9*99*9 アアアアア*アアアア9L 999漢漢 | |
| JOHN DOE | LLLL LLL | |
| (7F, SAS Institute)スズキイチロウ | *9L* LLL Lllllllll*アアアアアアア | |
| 李大伟 赛仕(北京) | 漢漢漢 漢漢*漢漢* | |
| 爱新觉罗·溥仪 | 漢漢漢漢*漢漢 | |
| 陈耀昌(Chan,Ed Yiu-Cheong) | 漢漢漢*Llll*Ll Lll*Llllll* | |
| 星光大道62号海王星科技大厦A座6楼 | 漢漢漢漢99漢漢漢漢漢漢漢漢L漢9漢 | |
| 珠海市 245400(玫瑰楼) | 漢漢漢 999999*漢漢漢* | |
| 二零零九年十月二十一日 | 漢漢漢漢漢漢漢漢漢漢漢 | |
| 14Mar, 2001 | 99Lll* 9999 | |
| 2009/10/21 | 9999*99*99 | |
| H134981(5)------ | L999999*9******* | |
| 0174685503(D) | 9999999999*L* | |
| 22020319691106184X | 99999999999999999L | |
| 碧丽服装(北京)有限公司 | 漢漢漢漢*漢漢*漢漢漢漢 | |
| 电话(+86)10-12345678 | 漢漢**99*99*99999999 | |
| Fax:01082741510 | Lll*99999999999 | |
| (010)82741510-345 | *999*99999999*999 | |
| Αθήνα | Θθθθθ | |
| Банк | Яяяя | |
| רודיה סקאלה כשאני אוהב (הערות Liner) Sonotone (1990) | אאאאא אאאאא אאאאא אאאא *אאאאא Lllll* Llllllll *9999* | |
| Remarks | ||