Answer the question
In order to leave comments, you need to log in
How to check a site for broken links?
Good afternoon.
There is a site . It needs to be checked for broken links. Checked with the help of services and Xenu. Everyone sees only the main one. There are no sources, I rummaged through the page code and concluded that it was written in Java (I could be wrong), I think this is the problem. How can I automate the search for broken links on this site?
PS When I get access to the site, I'll try to analyze it through Google Webmasters.
Answer the question
In order to leave comments, you need to log in
1) Go to the page you need
2) Using regular expressions or something else, select all href
3) Follow all links and if the response code is not 200, then write this link to non-working
This is Sodom!
If you look at the http response from nu.edu.kz , you will immediately see this:
/**
* This is the loopback script to process the url before the real page loads. It introduces
* a separate round trip. During this first roundtrip, we currently do two things:
* - check the url hash portion, this is for the PPR Navigation.
* - do the new window detection
* the above two are both controled by parameters in web.xml
You can make a robot that will follow links and receive a response from the server (200, 404, 500, etc.), if the answer is 200, then the link is not broken.
But that option probably won't work for you.
the simplest option is
1) to set up a local proxy
2) to know an adequate site-rocking program in which you can set a proxy (if teleport pro is set)
a proxy is required in order to track the statistics of the animal and a site-rocking chair in order to replace you with access to all available URLs
If quickly manually - www.brokenlinkcheck.com
Similar services are enough.
Maybe you are doing something wrong, here is a more detailed article https://k-gayduk.ru/blog/tech/bitye-ssylki.html , it always helped me.
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question