I
I
Ivan Melnikov2019-08-15 14:45:49
Unicode
Ivan Melnikov, 2019-08-15 14:45:49

Is there information available anywhere on the frequency of use of individual unicode characters in different text styles and in different languages?

It is necessary to evaluate the degree of "deadness" of individual characters of the cp1251 encoding.

Answer the question

In order to leave comments, you need to log in

1 answer(s)
D
dmshar, 2019-08-15
@dmshar

1. What does Unicode have to do with cp1251 encoding? (Hint cp1251 - 8-bit encoding, Unicode - at least 16-bit)
2. What does cp1251 have to do with "different languages" (Hint cp1251 - Cyrillic, but in fact - Russian-language encoding)
3. What does the "dead" character mean? Well, for example, the symbol "~" is almost dead. And it does not depend on the style of the text, nor on the language.
4. Modern Unicode (as of May 2019) contains 137,994 characters. How do you imagine a table with the frequency of their use?

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question