K
K
Koreetz Koreetz2015-09-03 13:32:50
Parsing
Koreetz Koreetz, 2015-09-03 13:32:50

How to parse site information?

Good day!
At work, they set the task: there are certain sites you need to "take information from them" how to do this through parsing? (so as not to make stupid copy-pastes!), and the information will need to be inserted either into excel or into the 1C product card?
Is it possible at all ......?

Answer the question

In order to leave comments, you need to log in

4 answer(s)
V
Vladimir Martyanov, 2015-09-03
@vilgeforce

Generally possible. You write a script in Python/Perl/Go/etc, get a page, rip out information from it, write it down where you need it.

K
Kirill, 2015-09-03
@GoooodBoy

I tried on Python + Grab in two evenings to build a parser for a lost movie, which collected all the information and pictures about the series. Everything was saved in exel. Easy enough to learn.

J
jacksparrow, 2015-09-03
@jacksparrow

Depending on what you put into the meaning of the word parsing, you can parse - get pages with a curl, and parse - analyze the house through a simple html Dom parser.

L
LINKeR UA, 2015-09-03
@LINKeRxUA

I use multi-threaded CURL (multicurl) through a proxy - in order to get content, let's say from 100-500 pages in 3-5 seconds. And PHP DomDocument + DomXPath for parsing.
The blessing of power of the server suffices.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question