How to properly close technical directories and duplicate site pages from Google indexing?

O

Oleg Keeach2019-02-14 15:39:51

Google

Oleg Keeach, 2019-02-14 15:39:51

I am promoting several sites on CMS Opencart, and a question arose about closing technical pages and catalogs from indexing:
Link to robots.txt on one of these sites - miumia.net.ua/robots.txt
and link to the sitemap - miumia.net.ua/ sitemap.xml
Here is an example of a page when switching from a category on the site - miumia.net.ua/katalog-tkanej/byaz-dlya-poshiva?pro...
Here is an example of a duplicate page (most likely viewed products from the module) - miumia.net. ua/index.php?route=product/product&prod...
How can such pages be more efficiently and correctly excluded from indexing:
1) Register CNC on categories and products, and close the rest of the technical pages in the robot?
2) Register on each page unnecessary for the search engine <meta name=“robots” content=“noindex,nofollow”>+ exclude them in the robots (and does it make sense to close them both in the robots and on each individual page) ?
3) Or are there plugins that prescribe CNC for Opencart (if there are few goods, then you can manually prescribe them, but when there are hundreds and thousands of them - it's hard :C)?
And I would like to know - if at the beginning of the optimization the Google robot has already indexed all these technical directories and duplicate pages - and I will start optimizing these sites in the ways indicated above and send the changes to Google Webmaster - how long it will take (approximately) for them to fall out of the index And will they even fall out like this?
Thanks in advance, experience in optimizing and promoting small and adequate information on this subject is not yet known from anyone :)

Reply

Answer the question

In order to leave comments, you need to log in

2 answer(s)

A

Alexander, 2019-02-14
@keeach

robots is only a recommendation to the robot, which means it can ignore it. According to Google's help, you can prevent getting into the index:
- server response with codes 40*
- special header in the server response
- meta name="robots" content="noindex,nofollow"
if pages are included in the index, or there are links from outside, then robots will be ignored , although it will be written in the output that the result is hidden, etc., but apparently this content will participate in the ranking

P

Puma Thailand, 2019-02-14
@opium

Surely there you can set up a canonical url and do not suffer with closing