Answer the question
In order to leave comments, you need to log in
What's the best way to parse html?
So the bottom line is, I often need to parse some sites, transfer all the necessary information to my database (naturally with their consent), all parsing is one-time, updates are not needed at all. At the moment I’m doing it through the integromat service, it’s ideal in this regard - you can build your own chain, and add it where you need it, and it’s convenient to parse through regex, the only problem is that it’s too expensive - 1 site = $ 1, even a little more. So, is there any convenient and most importantly powerful software for parsing pages? Preferably all the same by regex, I know it's bad form, but it's more convenient for me - sometimes you pull out the content, sometimes the attributes. You can also try to implement it through the code, but everything I was looking for was too difficult, or did not fit my needs. The final data export can be at least in csv, at least in json,
Answer the question
In order to leave comments, you need to log in
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question