UTF-8 Encoding Visualizer
Binary bit mapping from Unicode to UTF-8
Character
Unicode Code Point
U+20AC
Decimal: 8364
Binary (16 bits)
0
0
1
0
0
0
0
0
1
0
1
0
1
1
0
0
UTF-8 Encoding (3 bytes)
Byte 1
0xE2
226 decimal
Control Bits
1
1
1
0
Leading byte (3 bytes total)
Data Bits
0
0
1
0
Complete Byte
11100010
Byte 2
0x82
130 decimal
Control Bits
1
0
10 = Continuation
Data Bits
0
0
0
0
1
0
Complete Byte
10000010
Byte 3
0xAC
172 decimal
Control Bits
1
0
10 = Continuation
Data Bits
1
0
1
1
0
0
Complete Byte
10101100
Combined Data Bits = Unicode Binary
0
0
1
0
0
0
0
0
1
0
1
0
1
1
0
0
16 bits (UTF-8 data bits)
Control bits (UTF-8 format)
Data bits (grouped by byte)
Color Legend:
Leading byte (blue family)
Continuation byte 2 (amber family)
Continuation byte 3 (rose family)
UTF-8 Encoding Patterns:
1 byte:
0
xxxxxxx
2 bytes:
110
xxxxx
10
xxxxxx
3 bytes:
1110
xxxx
10
xxxxxx
10
xxxxxx
4 bytes:
11110
xxx
10
xxxxxx
10
xxxxxx
10
xxxxxx