ARIB STD B24 character set

Volume 1 of the Association of Radio Industries and Businesses (ARIB) STD-B24 standard for Broadcast Markup Language[2] specifies, amongst other details, a character encoding for use in Japanese-language broadcasting. It was introduced on 1999-10-26.[2] The latest revision is version 6.3 as of 2016-07-06.

ARIB STB-B24 encoding
StandardARIB STB-B24 Volume 1
ClassificationISO 2022 profile/extension
Transforms / EncodesARIB STB-B24 Kanji, Kana and mosaic sets,
JIS X 0201
ARIB STB-B24 Kanji set
Weather symbols: a few of the extended symbols included.
LanguagesJapanese, English, Russian
Partial support: Greek, Chinese
StandardARIB STB-B24 Volume 1
ClassificationISO-2022-structured CJK DBCS
ExtendsJIS X 0208
Encoding formats
  • ARIB STB-B24 encoding (ISO 2022 based)
  • Shift JIS (ARIB variant)[1]

It includes a number of ARIB extended characters (ARIB外字, ARIB gaiji) not found in the base standards (JIS X 0208 and JIS X 0201). It was the source standard for many symbol characters which were added to Unicode, including portions of the Miscellaneous Symbols, Enclosed Alphanumeric Supplement and Enclosed Ideographic Supplement blocks.[3] Its contributions partially overlap the Unicode emoji, but were added a year earlier, in Unicode 5.2.[4]

Fascicle 1 of the ARIB STD-B62 standard, published in 2014, defines Unicode mappings for a selection of the B24 extended characters (excluding, for example, those duplicated by JIS X 0213), as well as a few extended Kanji.[5] It also includes a mapping of utilised characters outside the Basic Multilingual Plane to the BMP's private use area.

Sets and codes

edit

The ARIB STD B24 standard defines multiple character sets and a method of switching between them. These include a Kanji set (an extension of JIS X 0208), an Alphanumeric set, a Hiragana set, Katakana sets of two distinct layouts and four mosaic sets.[6] The sets are selected using ISO 2022 mechanisms for 94-sets, using the following codes (proportional sets use the same layout as the corresponding non-proportional ones):[7]

SetTypeCode (column/line)Code (hexadecimal)Code (ASCII character)Comments
Kanji2-byte4/242BThe escape code B used for the ARIB Kanji set[7] is used for the 1983 version of JIS C 6226 (JIS X 0208, of which the ARIB Kanji set is an extension) in ISO-2022-JP.[8][9]
Alphanumeric1-byte4/104AJJIS_C6220-ro (ISO646-JP, JIS X 0201 Roman set). Similar to ASCII, with two assignments differing. Escape code J matches usage in ISO-2022-JP.[9]
Proportional alphanumeric1-byte3/6366
Hiragana1-byte3/0300Hiragana themselves follow the same layout as row 4 of JIS X 0208, but without a lead byte. Also adds several additional assignments for punctuation.
Proportional Hiragana1-byte3/7377
Katakana1-byte3/1311Katakana themselves follow the same layout as row 5 of JIS X 0208, but without a lead byte. Also adds several additional assignments for punctuation.
Proportional Katakana1-byte3/8388
JIS X 0201 Katakana1-byte4/949IJIS_C6220-jp (JIS X 0201 Kana set). Escape code matches usage in ISO-2022-JP-3.
Mosaic A1-byte3/2322Pseudographics (ISO-IR-71)
Mosaic B1-byte3/3333Pseudographics (ISO-IR-137)
Mosaic C1-byte3/4344Non-spacing pseudographics (ISO-IR-71 subset with separated mosaic blocks)
Mosaic D1-byte3/5355Non-spacing pseudographics

Code charts

edit

Kanji (double-byte) set

edit

This is a double-byte character set extending JIS X 0208.

Lead byte

edit

The encoding bytes correspond to the row or cell number plus 0x20, or 32 in decimal (see below). Hence, the code set starting with 0x21 has a row number of 1, and its cell 1 has a continuation byte of 0x21 (or 33), and so forth. Most of the code corresponds to JIS X 0208.

ARIB STD-B24 Kanji (double-byte) set (lead bytes)
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x  SP  1-_ 2-_ 3-_ 4-_ 5-_ 6-_ 7-_ 8-_ 9-_ 10-_ 11-_ 12-_ 13-_ 14-_ 15-_
3x 16-_ 17-_ 18-_ 19-_ 20-_ 21-_ 22-_ 23-_ 24-_ 25-_ 26-_ 27-_ 28-_ 29-_ 30-_ 31-_
4x 32-_ 33-_ 34-_ 35-_ 36-_ 37-_ 38-_ 39-_ 40-_ 41-_ 42-_ 43-_ 44-_ 45-_ 46-_ 47-_
5x 48-_ 49-_ 50-_ 51-_ 52-_ 53-_ 54-_ 55-_ 56-_ 57-_ 58-_ 59-_ 60-_ 61-_ 62-_ 63-_
6x 64-_ 65-_ 66-_ 67-_ 68-_ 69-_ 70-_ 71-_ 72-_ 73-_ 74-_ 75-_ 76-_ 77-_ 78-_ 79-_
7x 80-_ 81-_ 82-_ 83-_ 84-_ 85-_ 86-_ 87-_ 88-_ 89-_ 90-_ 91-_ 92-_ 93-_ 94-_ DEL
  Unused lead byte
  Lead byte
  Differences from JIS X 0208

Character sets 0x21-0x74 (row numbers 1-84: punctuation, alphabets, numbers, Kana, Kanji)

edit

Character set 0x75–0x76 (row numbers 85–86, additional kanji)

edit

This part is the source standard for a small number of CJK Unified Ideographs in Unicode, where it is designated with the JARIB- source prefix in the Unihan database.[10]

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x75)[11]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
3402
𠅘
20158
4EFD
仿
4EFF
4F9A
4FC9
509C
511E
51BC
351F
5307
5361
536C
8A79
𠮷
20BB7
3x
544D
5496
549C
54A9
550E
554A
5672
56E4
5733
5734
FA10
5880
59E4
5A23
5A55
5BEC
4x
FA11
37E2
5EAC
5F34
5F45
5FB7
6017
FA6B
6130
6624
66C8
66D9
66FA
66FB
6852
9FC4
5x
6911
693B
6A45
6A91
6ADB
𣏌
233CC
𣏾
233FE
𣗄
235C4
6BF1
6CE0
6D2E
FA45
涿
6DBF
6DCA
6DF8
FA46
6x
6F5E
6FF9
7064
𤋮
FA6C
𤋮
242EE
7147
71C1
7200
739F
73A8
73C9
73D6
741B
7421
FA4A
7426
7x
742A
742C
7439
744B
3EDA
7575
7581
7772
4093
78C8
78E0
7947
79AE
9FC6
4103
ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x76)[11]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
9FC5
79DA
7A1E
筿
7B7F
7C31
4264
7D8B
7FA1
8118
813A
FA6D
82AE
845B
84DC
84EC
3x
8559
85CE
8755
87EC
880B
88F5
89D2
8AF6
8DCE
8FBB
8FF6
90DD
9127
912D
91B2
9233
4x
9288
9321
9348
9592
96DE
9903
9940
9AD9
9BD6
9DD7
9EB4
9EB5
5x
6x
7x

Character set 0x7A (row number 90, traffic symbols)

edit

Characters 90-45 through 90-63 and 90-66 through 90-84 (shown below shaded) are listed in the B24 standard only in table 7-10 (the list of extension characters), and are also the only characters in rows 90 through 91 which are not transport-related symbols; this is noted in the B24 standard in an endnote to table 7-10.[12] The remainder of the extensions are listed in both table 7-4 (the double-byte code chart) and table 7-10.[12]

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7A)[5][13]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
26CC
26CD
2757
26CF
26D0
26D1
26D2
26D5
26D3
26D4
3x 🅿
1F17F
🆊
1F18A
26D6
26D7
26D8
26D9
26DA
26DB
26DC
26DD
26DE
26DF
26E0
26E1
4x
2B55
3248
3249
324A
324B
324C
324D
324E
324F
2491
2492
2493
5x 🅊
1F14A
🅌
1F14C
🄿
1F13F
🅆
1F146
🅋
1F14B
🈐
1F210
🈑
1F211
🈒
1F212
🈓
1F213
🅂
1F142
🈔
1F214
🈕
1F215
🈖
1F216
🅍
1F14D
🄱
1F131
🄽
1F13D
6x
2B1B
2B24
🈗
1F217
🈘
1F218
🈙
1F219
🈚
1F21A
🈛
1F21B
26BF
🈜
1F21C
🈝
1F21D
🈞
1F21E
🈟
1F21F
🈠
1F220
🈡
1F221
🈢
1F222
🈣
1F223
7x 🈤
1F224
🈥
1F225
🅎
1F14E
3299
🈀
1F200
  Additions from table 7-10 not in table 7-4.

Character set 0x7B (row number 91, map symbols)

edit

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7B)[5][13][14]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
26E3
2B56
2B57
2B58
2B59
2613
328B
3012
26E8
3246
3245
26E9
[a]
0FD6
26EA
26EB
3x
26EC
2668
26ED
26EE
26EF
2693
2708
26F0
26F1
26F2
26F3
26F4
26F5
🅗
1F157
24B9
24C8
4x
26F6
🅟
1F15F
🆋
1F18B
🆍
1F18D
🆌
1F18C
🅹
1F179
26F7
26F8
26F9
26FA
🅻
1F17B
260E
26FB
26FC
26FD
26FE
5x 🅼
1F17C
26FF
6x
7x
  Not in ARIB STD-B62

Character set 0x7C (row number 92, units, enclosed forms, list markers, arrows)

edit

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7C)[5][13][14]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
27A1
2B05
2B06
2B07
2B2F
2B2E
5E74
6708
65E5
5186
33A1
33A5
339D
33A0
33A4
3x 🄀
1F100
2488
2489
248A
248B
248C
248D
248E
248F
2490
[b] [b] [b] [b] [b] [b]
4x 🄁
1F101
🄂
1F102
🄃
1F103
🄄
1F104
🄅
1F105
🄆
1F106
🄇
1F107
🄈
1F108
🄉
1F109
🄊
1F10A
3233
3236
3232
3231
3239
3244
5x
25B6
25C0
3016
3017
27D0
²
00B2
³
00B3
🄭
1F12D
(vn)[c] (ob)[c] (cb)[c] (ce[c] mb)[c] (hp)[c] (br)[c] (p)[c]
6x (s)[c] (ms)[c] (t)[c] (bs)[c] (b)[c] (tb)[c] (tp)[c] (ds)[c] (ag)[c] (eg)[c] (vo)[c] (fl)[c] (ke[c] y)[c] (sa[c] x)[c]
7x (sy[c] n)[c] (or[c] g)[c] (pe[c] r)[c] 🄬
1F12C
🄫
1F12B
3247
🆐
1F190
🈦
1F226
213B
  Not in ARIB STD-B62

Character set 0x7D (row number 93, game and weather symbols, fractions, units, enclosed forms)

edit

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7D)[5][13][14]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
322A
322B
322C
322D
322E
322F
3230
3237
337E
337D
337C
337B
2116
2121
3036
3x
26BE
🉀
1F240
🉁
1F241
🉂
1F242
🉃
1F243
🉄
1F244
🉅
1F245
🉆
1F246
🉇
1F247
🉈
1F248
🄪
1F12A
🈧
1F227
🈨
1F228
🈩
1F229
🈔
1F214
🈪
1F22A
4x 🈫
1F22B
🈬
1F22C
🈭
1F22D
🈮
1F22E
🈯
1F22F
🈰
1F230
🈱
1F231
2113
338F
3390
33CA
339E
33A2
3371
5x ½
00BD
2189
2153
2154
¼
00BC
¾
00BE
2155
2156
2157
2158
2159
215A
2150
215B
2151
2152
6x
2600
2601
2602
26C4
2616
2617
26C9
26CA
2666
2665
2663
2660
26CB
2A00
203C
2049
7x
26C5
2614
26C6
2603
26C7
26A1
26C8
269E
269F
266C
260E
  Not in ARIB STD-B62

Character set 0x7E (row number 94, list markers)

edit

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7E)[5][13][14]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
2160
2161
2162
2163
2164
2165
2166
2167
2168
2169
216A
216B
2470
2471
2472
3x
2473
2474
2475
2476
2477
2478
2479
247A
247B
247C
247D
247E
247F
3251
3252
3253
4x
3254
🄐
1F110
🄑
1F111
🄒
1F112
🄓
1F113
🄔
1F114
🄕
1F115
🄖
1F116
🄗
1F117
🄘
1F118
🄙
1F119
🄚
1F11A
🄛
1F11B
🄜
1F11C
🄝
1F11D
🄞
1F11E
5x 🄟
1F11F
🄠
1F120
🄡
1F121
🄢
1F122
🄣
1F123
🄤
1F124
🄥
1F125
🄦
1F126
🄧
1F127
🄨
1F128
🄩
1F129
3255
3256
3257
3258
3259
6x
325A
2460
2461
2462
2463
2464
2465
2466
2467
2468
2469
246A
246B
246C
246D
246E
7x
246F
2776
2777
2778
2779
277A
277B
277C
277D
277E
277F
24EB
24EC
325B
  Not in ARIB STD-B62

Single-byte sets

edit

Alphanumeric set

edit
ARIB STD-B24 Alphanumeric set[16]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x !
0021
"
0022
#
0023
$
0024
%
0025
&
0026
'
0027
(
0028
)
0029
*
002A
+
002B
,
002C
-
002D
.
002E
/
002F
3x 0
0030
1
0031
2
0032
3
0033
4
0034
5
0035
6
0036
7
0037
8
0038
9
0039
:
003A
;
003B
<
003C
=
003D
>
003E
?
003F
4x @
0040
A
0041
B
0042
C
0043
D
0044
E
0045
F
0046
G
0047
H
0048
I
0049
J
004A
K
004B
L
004C
M
004D
N
004E
O
004F
5x P
0050
Q
0051
R
0052
S
0053
T
0054
U
0055
V
0056
W
0057
X
0058
Y
0059
Z
005A
[
005B
¥
00A5
]
005D
^
005E
_
005F
6x `
0060
a
0061
b
0062
c
0063
d
0064
e
0065
f
0066
g
0067
h
0068
i
0069
j
006A
k
006B
l
006C
m
006D
n
006E
o
006F
7x p
0070
q
0071
r
0072
s
0073
t
0074
u
0075
v
0076
w
0077
x
0078
y
0079
z
007A
{
007B
|
007C
}
007D
203E
  Differences from US-ASCII

Hiragana set

edit
ARIB STD-B24 Hiragana set[17]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
3041
3042
3043
3044
3045
3046
3047
3048
3049
304A
304B
304C
304D
304E
304F
3x
3050
3051
3052
3053
3054
3055
3056
3057
3058
3059
305A
305B
305C
305D
305E
305F
4x
3060
3061
3062
3063
3064
3065
3066
3067
3068
3069
306A
306B
306C
306D
306E
306F
5x
3070
3071
3072
3073
3074
3075
3076
3077
3078
3079
307A
307B
307C
307D
307E
307F
6x
3080
3081
3082
3083
3084
3085
3086
3087
3088
3089
308A
308B
308C
308D
308E
308F
7x
3090
3091
3092
3093
309D
309E
30FC
3002
300C
300D
3001
30FB
  Character allocations not following row 4 of JIS X 0208

Katakana set

edit
ARIB STD-B24 Katakana set[18]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
30A1
30A2
30A3
30A4
30A5
30A6
30A7
30A8
30A9
30AA
30AB
30AC
30AD
30AE
30AF
3x
30B0
30B1
30B2
30B3
30B4
30B5
30B6
30B7
30B8
30B9
30BA
30BB
30BC
30BD
30BE
30BF
4x
30C0
30C1
30C2
30C3
30C4
30C5
30C6
30C7
30C8
30C9
30CA
30CB
30CC
30CD
30CE
30CF
5x
30D0
30D1
30D2
30D3
30D4
30D5
30D6
30D7
30D8
30D9
30DA
30DB
30DC
30DD
30DE
30DF
6x
30E0
30E1
30E2
30E3
30E4
30E5
30E6
30E7
30E8
30E9
30EA
30EB
30EC
30ED
30EE
30EF
7x
30F0
30F1
30F2
30F3
30F4
30F5
30F6
30FD
30FE
30FC
3002
300C
300D
3001
30FB
  Character allocations not following row 5 of JIS X 0208

JIS X 0201 Katakana set

edit
ARIB STD-B24 JIS X 0201 Katakana set[19]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
FF61
FF62
FF63
FF64
FF65
FF66
FF67
FF68
FF69
FF6A
FF6B
FF6C
FF6D
FF6E
FF6F
3x
FF70
FF71
FF72
FF73
FF74
FF75
FF76
FF77
FF78
FF79
FF7A
FF7B
FF7C
FF7D
FF7E
ソ
FF7F
4x
FF80
FF81
FF82
FF83
FF84
FF85
FF86
FF87
FF88
FF89
FF8A
FF8B
FF8C
FF8D
FF8E
FF8F
5x
FF90
FF91
FF92
FF93
FF94
FF95
FF96
FF97
FF98
FF99
FF9A
FF9B
FF9C
FF9D
FF9E
FF9F
6x
7x

Mosaic sets

edit
ARIB STD-B24 Mosaic Set A[20] (ISO-IR-71)[21]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x 🬀
1FB00
🬁
1FB01
🬂
1FB02
🬃
1FB03
🬄
1FB04
🬅
1FB05
🬆
1FB06
🬇
1FB07
🬈
1FB08
🬉
1FB09
🬊
1FB0A
🬋
1FB0B
🬌
1FB0C
🬍
1FB0D
🬎
1FB0E
3x 🬏
1FB0F
🬐
1FB10
🬑
1FB11
🬒
1FB12
🬓
1FB13
258C
🬔
1FB14
🬕
1FB15
🬖
1FB16
🬗
1FB17
🬘
1FB18
🬙
1FB19
🬚
1FB1A
🬛
1FB1B
🬜
1FB1C
🬝
1FB1D
4x 🬼
1FB3C
🬽
1FB3D
🬾
1FB3E
🬿
1FB3F
🭀
1FB40
25E3
🭁
1FB41
🭂
1FB42
🭃
1FB43
🭄
1FB44
🭅
1FB45
🭆
1FB46
🭨
1FB68
🭩
1FB69
🭰
1FB70
🮕
1FB95
5x 🭇
1FB47
🭈
1FB48
🭉
1FB49
🭊
1FB4A
🭋
1FB4B
25E2
🭌
1FB4C
🭍
1FB4D
🭎
1FB4E
🭏
1FB4F
🭐
1FB50
🭑
1FB51
🭪
1FB6A
🭫
1FB6B
🭵
1FB75
2588
6x 🬞
1FB1E
🬟
1FB1F
🬠
1FB20
🬡
1FB21
🬢
1FB22
🬣
1FB23
🬤
1FB24
🬥
1FB25
🬦
1FB26
🬧
1FB27
2590
🬨
1FB28
🬩
1FB29
🬪
1FB2A
🬫
1FB2B
🬬
1FB2C
7x 🬭
1FB2D
🬮
1FB2E
🬯
1FB2F
🬰
1FB30
🬱
1FB31
🬲
1FB32
🬳
1FB33
🬴
1FB34
🬵
1FB35
🬶
1FB36
🬷
1FB37
🬸
1FB38
🬹
1FB39
🬺
1FB3A
🬻
1FB3B
ARIB STD-B24 Mosaic Set B[20] (ISO-IR-137)[22]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
2596
25AA
𜹇
1CE47
259F
25B6
🠷
1F837
🮛
1FB9B
🯣
1FBE3
🯫
1FBEB
3x
2584
2597
25AC
𜹐
1CE50
2599
25C0
🠵
1F835
🮚
1FB9A
🯡
1FBE1
🯩
1FBE9
4x
5x
6x 🭒
1FB52
🭓
1FB53
🭔
1FB54
🭕
1FB55
🭖
1FB56
25E5
🭗
1FB57
🭘
1FB58
🭙
1FB59
🭚
1FB5A
🭛
1FB5B
🭜
1FB5C
🭬
1FB6C
🭭
1FB6D
7x 🭝
1FB5D
🭞
1FB5E
🭟
1FB5F
🭠
1FB60
🭡
1FB61
25E4
🭢
1FB62
🭣
1FB63
🭤
1FB64
🭥
1FB65
🭦
1FB66
🭧
1FB67
🭮
1FB6E
🭯
1FB6F
� Not in Unicode
ARIB STD-B24 Mosaic Set C[20]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x 𜹑
1CE51
𜹒
1CE52
𜹓
1CE53
𜹔
1CE54
𜹕
1CE55
𜹖
1CE56
𜹗
1CE57
𜹘
1CE58
𜹙
1CE59
𜹚
1CE5A
𜹛
1CE5B
𜹜
1CE5C
𜹝
1CE5D
𜹞
1CE5E
𜹟
1CE5F
3x 𜹠
1CE60
𜹡
1CE61
𜹢
1CE62
𜹣
1CE63
𜹤
1CE64
𜹥
1CE65
𜹦
1CE66
𜹧
1CE67
𜹨
1CE68
𜹩
1CE69
𜹪
1CE6A
𜹫
1CE6B
𜹬
1CE6C
𜹭
1CE6D
𜹮
1CE6E
𜹯
1CE6F
4x
5x 𜺏
1CE8F
6x 𜹰
1CE70
𜹱
1CE71
𜹲
1CE72
𜹳
1CE73
𜹴
1CE74
𜹵
1CE75
𜹶
1CE76
𜹷
1CE77
𜹸
1CE78
𜹹
1CE79
𜹺
1CE7A
𜹻
1CE7B
𜹼
1CE7C
𜹽
1CE7D
𜹾
1CE7E
𜹿
1CE7F
7x 𜺀
1CE80
𜺁
1CE81
𜺂
1CE82
𜺃
1CE83
𜺄
1CE84
𜺅
1CE85
𜺆
1CE86
𜺇
1CE87
𜺈
1CE88
𜺉
1CE89
𜺊
1CE8A
𜺋
1CE8B
𜺌
1CE8C
𜺍
1CE8D
𜺎
1CE8E

Most of ARIB STD-B24 Mosaic Set D does not exist in Unicode.

Shift_JIS variant

edit

In addition to the modified ISO 2022 encoding, the B24 standard also specifies a Shift JIS encoding following JIS X 0208:1997, but with the addition of the extended characters in the kanji set.[1]

First byte
0 1 2 3 4 5 6 7 8 9 A B C D E F
0
1
2 ! " # $ % & ' ( ) * + , - . /
3 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4 @ A B C D E F G H I J K L M N O
5 P Q R S T U V W X Y Z [ ¥ ] ^ _
6 ` a b c d e f g h i j k l m n o
7 p q r s t u v w x y z { | }
8
9
A
B ソ
C
D
E
F
Second byte
0 1 2 3 4 5 6 7 8 9 A B C D E F
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
 
Non printable ASCII character
Unaltered ASCII character
Modified ASCII character
Single-byte half-width katakana
First byte of a double-byte character, used by JIS X 0208
First byte of an ARIB extended character
Not used as first byte, unallocated space in JIS X 0208
Not used as first byte
Second byte of a double-byte character whose first half of the JIS sequence was odd
Second byte of a double-byte character whose first half of the JIS sequence was even
Unused as second byte of a double-byte character

See also

edit

Footnotes

edit
  1. Glossed as "temple" (i.e. Buddhist temple) in B24 table 7-10 (the list of extension characters).
  2. 1 2 3 4 5 6 Small form (70% size per code chart / table 7-10) of a kanji character. Shown here simulated. Private Use Area code points shown are those used by the Nishiki-teki font.[15]
  3. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 Musical abbreviation (or half thereof) not present in Unicode, simulated here with multiple characters. Private Use Area code points shown are those used by the Nishiki-teki font.

References

edit
  1. 1 2 ARIB (2008), p. 105, part 2, section 7.3 Cite error: The named reference "ReferenceA" was defined multiple times with different content (see the help page).
  2. 1 2 ARIB (2008)
  3. Suignard, Michel (2008-03-11). "ISO/IEC JTC1/SC2/WG2 N 3397: Japanese TV Symbols" (PDF).
  4. "Unicode 5.2 Emoji List". Emojipedia.
  5. 1 2 3 4 5 6 ARIB (2014), pp. 33–50, part 2, Table 5-2
  6. ARIB (2008), pp. 48–52
  7. 1 2 ARIB (2008), p. 39, part 2, Table 7-3
  8. Japanese National Committee on ISO/TC97/SC2 (1984-07-01). Japanese Graphic Character Set for Information Interchange (PDF). ITSCJ/IPSJ. ISO-IR-87.{{citation}}: CS1 maint: numeric names: authors list (link)
  9. 1 2 RFC 1468 (IETF)
  10. "kIRG_JSource". Unicode Han Database (Unihan) (Unicode Standard Annex). Unicode Consortium. 2024-07-31. UAX #38.
  11. 1 2 ARIB (2017), pp. 72–86, part 2, Tables 7-11 and 7-12
  12. 1 2 ARIB (2008), p. 72
  13. 1 2 3 4 5 ARIB (2008), pp. 54–72, part 2, Table 7-10
  14. 1 2 3 4 ARIB (2008), pp. 46–47, part 2, Table 7-4
  15. "Nishiki-teki Version 3.82b (2021-07-23) - 6,416 characters in the Private Use Areas" (PDF).
  16. ARIB (2008), p. 48, part 2, Table 7-5
  17. ARIB (2008), p. 50, part 2, Table 7-7
  18. ARIB (2008), p. 49, part 2, Table 7-6
  19. ARIB (2008), p. 52, part 2, Table 7-9
  20. 1 2 3 ARIB (2008), p. 51, part 2, Table 7-8
  21. CCITT (1983-10-01). Second Supplementary Set of Mosaic Characters (PDF). ITSCJ/IPSJ. ISO-IR-71.
  22. CCITT (1987-07-31). Mosaic-1 Set of Data Syntax 1 of CCITT Rec. T.101 (PDF). ITSCJ/IPSJ. ISO-IR-137.

ARIB standards

edit

Further reading

edit
edit