I
I
Ivan Petrov2021-05-13 18:13:19
Google
Ivan Petrov, 2021-05-13 18:13:19

Why, after restoring the site (after hacking), Google robots continue to break into the left addresses of the left site maps?

The WordPress site was hacked, some left pages and sitemaps of the /sitemap_index_5.xml type were created.
The site was restored and these sitemaps are not listed in robots.txt, but judging by the server logs, IP addresses from Google are still accessing non-existent maps site. Records like this:

66.249.69.1 - - [13/May/2021:17:49:51 +0300] "GET /blucher12.xml HTTP/1.0" 404 15007
66.249.69.6 - - [13/May/2021:18:06:30 + 0300] "GET /sheepshank35.xml HTTP/1.0" 404 15009
66.249.69.1 - - [13/May/2021:18:12:05 +0300] "GET /uglifruit13.xml HTTP/1.0" 404 15009

In the 2ip.ru/whois/ service, I looked at this IP address and the name of the provider is visible there: Google LLC, host: crawl-66-249-69-1.googlebot.com
As far as I understand, Google bots climb into these non-existent sitemaps, but why and how to fix it?

Answer the question

In order to leave comments, you need to log in

2 answer(s)
D
DevMan, 2021-05-13
@IvanPetrow

because they are in the memory of Google.
if they cleaned it up, and these urls give errors, then over time, Google will throw them out of its list.

D
Dimonchik, 2021-05-13
@dimonchik2013

you, most importantly, get rid of the idea that "Google starts searching on Internet sites" only after you enter a query - girls in cooking courses tell this - Google searches for what it has saved,
then an initial understanding of the work of crawlers will appear, and you look there - and SEO
in other words - what have you conjured up there - Google is not a decree

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question