Unicode字符编码表

Posted 振长策而御宇内

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了Unicode字符编码表相关的知识,希望对你有一定的参考价值。

十进制 十六进制  字符数 编码分类(中文) 编码分类(英文)
起始 终止 起始 终止 (个)    
01270000007F128C0控制符及基本拉丁文C0 Control and Basic Latin
128255008000FF128C1控制符及拉丁文补充-1C1 Control and Latin 1 Supplement
2563830100017F128拉丁文扩展-ALatin Extended-A
3845910180024F208拉丁文扩展-BLatin Extended-B
592687025002AF96国际音标扩展IPA Extensions
68876702B002FF80空白修饰字母Spacing Modifiers
7688790300036F112结合用读音符号Combining Diacritics Marks
8801023037003FF144希腊文及科普特文Greek and Coptic
10241279040004FF256西里尔字母Cyrillic
128013270500052F48西里尔字母补充Cyrillic Supplement
132814230530058F96亚美尼亚语Armenian
14241535059005FF112希伯来文Hebrew
15361791060006FF256阿拉伯文Arabic
179218710700074F80叙利亚文Syriac
187219190750077F48阿拉伯文补充Arabic Supplement
19201983078007BF64马尔代夫语Thaana
1984204707C007FF64西非書面語言N'Ko
204821430800085F96阿维斯塔语及巴列维语Avestan and Pahlavi
214421750860087F32MandaicMandaic
21762223088008AF48撒马利亚语Samaritan
230424310900097F128天城文书Devanagari
24322559098009FF128孟加拉语Bengali
256026870A000A7F128锡克教文Gurmukhi
268828150A800AFF128古吉拉特文Gujarati
281629430B000B7F128奥里亚文Oriya
294430710B800BFF128泰米尔文Tamil
307231990C000C7F128泰卢固文Telugu
320033270C800CFF128卡纳达文Kannada
332834550D000D7F128德拉维族语Malayalam
345635830D800DFF128僧伽罗语Sinhala
358437110E000E7F128泰文Thai
371238390E800EFF128老挝文Lao
384040950F000FFF256藏文Tibetan
409642551000109F160缅甸语Myanmar
4256435110A010FF96格鲁吉亚语Georgian
43524607110011FF256朝鲜文Hangul Jamo
460849911200137F384埃塞俄比亚语Ethiopic
499250231380139F32埃塞俄比亚语补充Ethiopic Supplement
5024511913A013FF96切罗基语Cherokee
512057591400167F640统一加拿大土著语音节Unified Canadian Aboriginal Syllabics
576057911680169F32欧甘字母Ogham
5792588716A016FF96如尼文Runic
588859191700171F32塔加拉语Tagalog
592059511720173F32HanunóoHanunóo
595259831740175F32BuhidBuhid
598460151760177F32TagbanwaTagbanwa
60166143178017FF128高棉语Khmer
61446319180018AF176蒙古文Mongolian
6320639918B018FF80ChamCham
640064791900194F80LimbuLimbu
648065271950197F48德宏泰语Tai Le
65286623198019DF96新傣仂语New Tai Lue
6624665519E019FF32高棉语记号Kmer Symbols
665666871A001A1F32BugineseBuginese
668867511A201A5F64BatakBatak
678468951A801AEF112LannaLanna
691270391B001B7F128巴厘语Balinese
704070881B801BB049巽他语Sundanese
710471671BC01BFF64Pahawh HmongPahawh Hmong
716872471C001C4F80雷布查语Lepcha
724872951C501C7F48Ol ChikiOl Chiki
729673911C801CDF96曼尼普尔语Meithei/Manipuri
742475511D001D7F128语音学扩展Phonetic Extensions
755276151D801DBF64语音学扩展补充Phonetic Extensions Supplement
761676791DC01DFF64结合用读音符号补充Combining Diacritics Marks Supplement
768079351E001EFF256拉丁文扩充附加Latin Extended Additional
793681911F001FFF256希腊语扩充Greek Extended
819283032000206F112常用标点General Punctuation
830483512070209F48上标及下标Superscripts and Subscripts
8352839920A020CF48货币符号Currency Symbols
8400844720D020FF48组合用记号Combining Diacritics Marks for Symbols
844885272100214F80字母式符号Letterlike Symbols
852885912150218F64数字形式Number Form
85928703219021FF112箭头Arrows
87048959220022FF256数学运算符Mathematical Operator
89609215230023FF256杂项工业符号Miscellaneous Technical
921692792400243F64控制图片Control Pictures
928093112440245F32光学识别符Optical Character Recognition
93129471246024FF160封闭式字母数字Enclosed Alphanumerics
9472Unicode中文和特殊字符的编码范围

Python Unicode实战

UNICODE字符串

ASCII,Unicode,GBK和UTF-8字符编码的区别和联系

字符编码的前世今生(Unicode,UTF, GB2312)

utf-8编码下,一个字符最多占几个字节?

(c)2006-2024 SYSTEM All Rights Reserved IT常识