Why does Cyrillic take 2 bytes per character in utf8?

V

vrazbros2018-12-15 17:05:50

Character encoding

vrazbros, 2018-12-15 17:05:50

Why does Cyrillic take 2 bytes per character in utf8 ?

Reply

Answer the question

In order to leave comments, you need to log in

2 answer(s)

M

MaksPaverov, 2018-12-15
@MaksPaverov

UTF-8 (from the English Unicode Transformation Format, 8-bit - “Unicode transformation format, 8-bit”) is one of the generally accepted and standardized text encodings that allows you to store Unicode characters using a variable number of bytes (from 1 to 6) .
FROM 1 to 6 BYTES (each of which is 8 BITS)
Depends on the character, Russians take 2 bytes

A

Alexander, 2018-12-15
@alexr64

https://ru.wikipedia.org/wiki/UTF-8
https://ru.wikipedia.org/wiki/%D0%AE%D0%BD%D0%B8%D...
https://ru. wikipedia.org/wiki/%D0%9A%D0%B8%D1%80%D...
Because in Unicode, 5 blocks of a two-byte range were allocated for Cyrillic.