Answer the question
In order to leave comments, you need to log in
Where does text.ru get data for checking for plagiarism?
Hello. Who has any guesses where text.ru takes the data to check for plagiarism? They have some faster source of data than SERPs.
For example, I added unique text content on one of the sites, and in just a minute it (the content) was already detected and analyzed by the text.ru algorithm. And the appearance of this content in the search results of Yandex and Google still has to wait more than one week.
Answer the question
In order to leave comments, you need to log in
Most likely, there is a classic "cumulative" bigdata approach. Asynchronously in the background, data is parsed from the network, this allows you to always keep the data up-to-date and dynamically replenish it. Then metadata is formed for quick analysis, they are already stored in the service database. Then, when you have already directly entered the text and sent it for validation, the comparisons are analyzed using fuzzy search or other optimized algorithms for working with text, metadata are compared and the result is returned. Of course, I can be wrong, but if I needed to implement such a solution, then the principle of operation would be similar to the one described above.
ha ha, everything unknown to us seems wonderful
there is no secret: search engines
the appearance of this content in the search results of Yandex and Google still have to wait more than one week., and in duckduckgo you don’t have to wait
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question