S
S
Sergey Brovko2016-02-12 01:43:34
PHP
Sergey Brovko, 2016-02-12 01:43:34

How to parse an HTML page, an endless list of pages?

Guys, help .. In general, there is a task to parse the site, audio recordings are stored on the site. I had practically nothing to do with parsing, I decided to try it, but I ran into such a problem: on the site, records are loaded in an endless list (scroll to the bottom - get a new portion), each record leads to a page from which you need to pick up a certain text (title) and the actual audio file , which I will later download with name = title.
On the main page, where an endless list is a link to each entry of the form , and on the entry itself there is already a bunch of text and an audio recording Download
Actually, the question is how to parse if it is loaded with an endless list? Please provide examples of how to solve this problem.

Answer the question

In order to leave comments, you need to log in

1 answer(s)
E
entermix, 2016-02-12
@cyber01

Actually the question is how to carry out parsing if it is loaded with an infinite list?

1. Enable Firebug
2. Study the principle by which new posts are loaded, in fact, it works like a regular pagination (site.ru/start=0, site.ru/start=10,..., ), but only the necessary information is returned , not the whole page, usually it is html code in json format
3. Go through the pages and get the necessary information

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question