G
G
Ganjubas_Original2017-05-16 11:56:05
Search Engine Optimization
Ganjubas_Original, 2017-05-16 11:56:05

Site on WordPress. How do links like site_address.ru/?p=333 get into the index?

Greetings, comrades. It's my first time with the press. The problem is that search engine indexes include pages like site_address.ru/?p=333. I read and looked. Everywhere it is advised to set up permalinks, but they are already set up in the admin panel. This is how the site_address is /%category%/%postname%/
And in fact, when following links with GET, we automatically redirect to the page with the correct url. But why do search engines see these pages and add them to the index as duplicates? And the main question is how to fix it?

Answer the question

In order to leave comments, you need to log in

4 answer(s)
D
djalin, 2017-05-16
@djalin

User-agent: *
Crawl-delay: 1  
Disallow: /webstat/
Disallow: /cgi-bin
Disallow: /wp-admin
Disallow: /wp-includes
Disallow: /wp-content/plugins
Disallow: /wp-content/cache
Disallow: /wp-content/themes
Disallow: /trackback
Disallow: */trackback
Disallow: /wp-trackback
Disallow: /wp-feed
Disallow: /comments
Disallow: /wp-comments
Disallow: /xmlrpc.php
Disallow: */trackback
Disallow: */feed
Disallow: /feed/
Disallow: */comments
Disallow: /category
Disallow: /category/*/*
Disallow: /tag
Disallow: /*?*
Disallow: /*?
Disallow: /to/
Disallow: /&*
Disallow: /page/
Disallow: /goto/
Disallow: /goto/*

S
Site Developer, 2017-05-16
@secsite

But why do search engines see these pages and add them to the index as duplicates? And the main question is how to fix it?
The main question is highlighted. The rest will disappear as soon as the main reason is clarified. And where they come from - you can see in the same webmaster or server logs (server statistics).
Although if there is a 301 redirect (you need to make sure that it is 301, not 302, and especially not 200), then there should be no duplicates. Maybe they were, but the PS has not yet been indexed.

M
mletov, 2017-05-16
@mletov

Incomprehensible pages of GET requests in Yandex Webmaster statistics, how to deal with this?
Play around with robots.txt
clean-param or disallow help
Why this happens is not important. Most likely, search engines determine that the site is on WP, and start hitting typical addresses, if they do not return 404, they get into the index.
It happened to me with Drupal, Yandex persistently indexed pages like /node/123 (analogue of WordPress /?p=123), although the page aliases were registered, but there were no links of this kind in the template. I wrote a rule in robots.txt according to the pattern like "disallow node/*", everything was reindexed fine

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question