V
V
vlarkanov2019-04-17 14:43:21
C++ / C#
vlarkanov, 2019-04-17 14:43:21

How to correctly find the length of a Cyrillic string in Unicode in C++?

How to correctly calculate the length of a string with Russian letters, punctuation marks and spaces?
So far, I found that the length of the string with rbhbkkbwtq turns out to be twice the correct one.

int strLength = strBody.length();
std::cout << strBody << ":" << strLength ;

...

Ololo:5
Ололо:10

Answer the question

In order to leave comments, you need to log in

2 answer(s)
S
SerJook, 2019-04-17
@vlarkanov

If under Linux, then somehow you can calculate like this:

#include <cstdlib>
//....
setlocale(LC_ALL, "ru_RU.utf8");
int strLength = mbstowcs(NULL, strBody.c_str(), 0);

Unfortunately, this won't work on Windows.

R
Ronald McDonald, 2019-04-17
@Zoominger

So far, I found that the length of the string with Russian letters is twice as long as the correct one.

Pusho UTF-8 encoding.
Although the length of the string should be the same, it's more about the size in memory.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question