Answer the question
In order to leave comments, you need to log in
What regular expression to use to search for a block of Russian text?
There is an HTML page, you need to pull out a block of Russian text from it. There is only one block of text in Russian. Advise the regular expression or how it is better to make it?
Answer the question
In order to leave comments, you need to log in
Depending on what is meant by the block. If there are no tags inside it, then you can somehow cut it out like this:
>([^<]*[A-Zaa-z][^<]*)<
As a result, everything between >< and contains at least one Russian letter. This is offhand, so be sure to make sure that the regularizer works with exactly the same encoding.
use
jsoup .
for REGEX, set a regular expression that includes Russian characters, punctuation marks.
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question