I
I
Ivan Damaskin2015-01-21 01:45:08
Parsing
Ivan Damaskin, 2015-01-21 01:45:08

How and with what it is possible to parse ads?

Hello. I googled soft, there are some programs for parsing, but I just can’t achieve the desired result. Maybe I don't use it.
Let me know if anyone has experience with this. There is an ad site. Take for example the category of cars. https://999.md/list/transport/cars How to get ads out of there and to set up the result in the form of a database so that you can easily import it to your site. Thanks to everyone who responded -)

Answer the question

In order to leave comments, you need to log in

4 answer(s)
D
Danil Chekalin, 2015-01-21
@dakiesse

In Google, write: {programming language you use} parse html
Here is the second result for the query "php parse html" , habrahabr.ru/post/176635
and what’s on the link to the habr will help), and then write the whole thing into the database (or write it down to the database in between, it’s up to you)

A
Anton Fedoryan, 2015-01-21
@AnnTHony

I recently wrote a similar test for an interview.
Parse the site using C# + HtmlAgilityPack.dll.
As it was written above, you need to get an html page with the data of interest and pull out the information from there, and where to write it (to the database, to the file) - the choice is yours.
To do this, you need to know the programming language (not necessarily C #), understand the work of web servers, understand HTML itself in order to be able to find the tags between which information is contained.
From my own experience, I can say that parsing regular expressions is not an option. :)

I
Ivan Damaskin, 2015-01-21
@Joannes

Or maybe such that the site is protected from parsing? And how can this be verified? For example, I can't get html from the site 999.md

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question