What's the best way to parse html?

D

Davidaa_WoW2021-08-12 15:31:10

HTML

Davidaa_WoW, 2021-08-12 15:31:10

So the bottom line is, I often need to parse some sites, transfer all the necessary information to my database (naturally with their consent), all parsing is one-time, updates are not needed at all. At the moment I’m doing it through the integromat service, it’s ideal in this regard - you can build your own chain, and add it where you need it, and it’s convenient to parse through regex, the only problem is that it’s too expensive - 1 site = $ 1, even a little more. So, is there any convenient and most importantly powerful software for parsing pages? Preferably all the same by regex, I know it's bad form, but it's more convenient for me - sometimes you pull out the content, sometimes the attributes. You can also try to implement it through the code, but everything I was looking for was too difficult, or did not fit my needs. The final data export can be at least in csv, at least in json,

Reply

Answer the question

In order to leave comments, you need to log in

1 answer(s)

A

Alexander Makarov, 2021-08-12
@serbananas

Check out ZennoPoster, suitable for scraping and more.