R
R
R000M2019-09-04 13:25:05
HTML
R000M, 2019-09-04 13:25:05

Are there services that can extract text from a page by URL?

You give the service the URL of the page, and it returns the text without menus, comments, ads, "Most interesting in 24 hours", "Reading now", etc.
For example, if the page is from a news site, then only the text of the news itself is returned.

Answer the question

In order to leave comments, you need to log in

3 answer(s)
V
Vitaly Sukhomlinov, 2019-09-04
@Licut

Most likely there are no services. After all, the parser needs to be configured for a specific DOM tree of the site that we are parsing. This is either looking for common parsing services and agreeing to be customized for you or ordering a simple PHP parser from a programmer through curl, for example, and some kind of phpquery.

I
Igor Rodichev, 2019-09-04
@Bubunt

Perhaps https://getpocket.com will do.

M
Maxim, 2019-09-04
@Tomio

There are services, but you have to pay to get enough data)
Here, for example, a couple: https://www.parsehub.com, https://www.octoparse.com

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question