Why, after restoring the site (after hacking), Google robots continue to break into the left addresses of the left site maps?

I

Ivan Petrov2021-05-13 18:13:19

Google

Ivan Petrov, 2021-05-13 18:13:19

The WordPress site was hacked, some left pages and sitemaps of the /sitemap_index_5.xml type were created.
The site was restored and these sitemaps are not listed in robots.txt, but judging by the server logs, IP addresses from Google are still accessing non-existent maps site. Records like this:

66.249.69.1 - - [13/May/2021:17:49:51 +0300] "GET /blucher12.xml HTTP/1.0" 404 15007
66.249.69.6 - - [13/May/2021:18:06:30 + 0300] "GET /sheepshank35.xml HTTP/1.0" 404 15009
66.249.69.1 - - [13/May/2021:18:12:05 +0300] "GET /uglifruit13.xml HTTP/1.0" 404 15009

In the 2ip.ru/whois/ service, I looked at this IP address and the name of the provider is visible there: Google LLC, host: crawl-66-249-69-1.googlebot.com
As far as I understand, Google bots climb into these non-existent sitemaps, but why and how to fix it?

Reply

Answer the question

In order to leave comments, you need to log in

2 answer(s)

D

DevMan, 2021-05-13
@IvanPetrow

because they are in the memory of Google.
if they cleaned it up, and these urls give errors, then over time, Google will throw them out of its list.

D

Dimonchik, 2021-05-13
@dimonchik2013

you, most importantly, get rid of the idea that "Google starts searching on Internet sites" only after you enter a query - girls in cooking courses tell this - Google searches for what it has saved,
then an initial understanding of the work of crawlers will appear, and you look there - and SEO
in other words - what have you conjured up there - Google is not a decree