I
I
iXelper2018-08-10 19:55:43
C++ / C#
iXelper, 2018-08-10 19:55:43

How to parse all html code - C#?

You need to parse the html code from the page using the webBrowser element.
There is a page with a list, BUT IT IS LOADING GRADUALLY.
Question: how to parse all html content and get to the last element of the list?

Answer the question

In order to leave comments, you need to log in

1 answer(s)
I
Ilya, 2018-08-11
@Gorily

Method 1. Qualitatively, but difficult.
Content loading usually occurs by GET requests with parameters, where the parameters are how many articles to skip from the beginning and how many to receive. In response, ready-made HTML code usually arrives, which is simply embedded into the page and can be easily parsed.
Vector to study:
Take Selenium + Chrome (for clarity), open a page in them and emulate by standard means either pressing a button (for example, "Show more") or pressing the space bar (scrolls the page down). From time to time check using XPath whether the bottom has been reached.
Vector to study:
Both paths are very accessible to study.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question