Template:Unicode blocks/testcases

From Wikipedia, the free encyclopedia


Plane Block range Block name Code points[a] Assigned characters Scripts[b][c][d][e][f]
 


 0 BMP U+0000..U+007F Basic Latin[g] 128 128 Latin (52 characters)128, Common (76 characters)
 0 BMP U+0080..U+00FF Latin-1 Supplement[h] 128 128 Latin (64 characters), Common (64 characters)
 0 BMP U+0100..U+017F Latin Extended-A 128 128 Latin
 0 BMP U+0180..U+024F Latin Extended-B 208 208 Latin
 0 BMP U+0250..U+02AF IPA Extensions 96 96 Latin
 0 BMP U+02B0..U+02FF Spacing Modifier Letters 80 80 Bopomofo (2 characters), Latin (14 characters), Common (64 characters)
  1. ^ Code point count includes unassigned code points: non-character, reserved
  2. ^ The script has one or multiple characters in the block, as defined by the Script Property. This is independent of the block name
  3. ^ "Common" and "Unknown" (Zyyy) and "Inherited" (Zinh or Qaai) refer to Scripts in ISO 15924
  4. ^ Unicode Blocks data file. As of Unicode version 13.0
  5. ^ UAX 24: Unicode Script Property (4 alpha code)
  6. ^ UAX 24: Script data file
  7. ^ Called "C0 Controls and Basic Latin" in ISO/IEC 10646
  8. ^ Called "C1 Controls and Latin-1 Supplement" in ISO/IEC 10646


planes[edit]

Plane Block range Block name Code points[a] Assigned characters Scripts[b][c][d][e][f]
 0 BMP U+0000..U+007F Basic Latin[g] 128 128 Latin (52 characters), Common (76 characters)
 0 BMP U+DC00..U+DFFF Low Surrogates 1,024 0 Unknown
 1 SMP U+10000..U+1007F Linear B Syllabary 128 88 Linear B
 1 SMP U+16B00..U+16B8F Pahawh Hmong 144 127 Pahawh Hmong
 2 SIP U+20000..U+2A6DF CJK Unified Ideographs Extension B 42,720 42,718 Han
 2 SIP U+2F800..U+2FA1F CJK Compatibility Ideographs Supplement 544 542 Han
 3 TIP U+30000..U+3134F CJK Unified Ideographs Extension G 4,944 4,939 Han
14 SSP U+E0000..U+E007F Tags 128 97 Common
14 SSP U+E0100..U+E01EF Variation Selectors Supplement 240 240 Inherited
15 PUA-A U+F0000..U+FFFFF Supplementary Private Use Area-A 65,536 65,534 Unknown
16 PUA-B U+100000..U+10FFFF Supplementary Private Use Area-B 65,536 65,534 Unknown
  1. ^ Code point count includes unassigned code points: non-character, reserved
  2. ^ The script has one or multiple characters in the block, as defined by the Script Property. This is independent of the block name
  3. ^ "Common" and "Unknown" (Zyyy) and "Inherited" (Zinh or Qaai) refer to Scripts in ISO 15924
  4. ^ Unicode Blocks data file. As of Unicode version 13.0
  5. ^ UAX 24: Unicode Script Property (4 alpha code)
  6. ^ UAX 24: Script data file
  7. ^ Called "C0 Controls and Basic Latin" in ISO/IEC 10646