UTF-8 Encoding Visualizer

Binary bit mapping from Unicode to UTF-8

Unicode Code Point
U+20AC
Decimal: 8364
Binary (16 bits)
0
0
1
0
0
0
0
0
1
0
1
0
1
1
0
0
UTF-8 Encoding (3 bytes)
Byte 1
0xE2
226 decimal
Control Bits
1
1
1
0
Leading byte (3 bytes total)
Data Bits
0
0
1
0
Complete Byte
11100010
Byte 2
0x82
130 decimal
Control Bits
1
0
10 = Continuation
Data Bits
0
0
0
0
1
0
Complete Byte
10000010
Byte 3
0xAC
172 decimal
Control Bits
1
0
10 = Continuation
Data Bits
1
0
1
1
0
0
Complete Byte
10101100
Combined Data Bits = Unicode Binary
0
0
1
0
0
0
0
0
1
0
1
0
1
1
0
0
16 bits (UTF-8 data bits)
Control bits (UTF-8 format)
Data bits (grouped by byte)
Color Legend:
Leading byte (blue family)
Continuation byte 2 (amber family)
Continuation byte 3 (rose family)
UTF-8 Encoding Patterns:
1 byte: 0xxxxxxx
2 bytes: 110xxxxx 10xxxxxx
3 bytes: 1110xxxx 10xxxxxx 10xxxxxx
4 bytes: 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx