G
G
grindel2012-09-19 01:02:49
linux
grindel, 2012-09-19 01:02:49

Regular parsing with the results entered into a Mysql table

Hello!
I have some kind of home server on Debian Linux. I want him to, say, every 10 minutes go to a closed page in contact (with cookies), find posts there from a certain user and with a certain label and add their text to the database, moreover, in a certain way (depending on the label / labels in the post). And of course he should add only new posts.

Please tell me how to implement it?
I'm at a loss, I don't even know where to start, what to google.

Answer the question

In order to leave comments, you need to log in

2 answer(s)
G
Golden, 2012-09-19
@grindel

perl and its modules: lwp - for working with the web, dbi - for going to the muscle. Run as mentioned above with a cron, or daemonize the script. while 1, if you understand what I mean (c)
If the task requires implementation, and not pampering for personal development, you can turn to third-party specialists for development, there are plenty of resources for this, freelance, odesk, etc. I won't advertise, they don't pay me for it.

A
avalak, 2012-09-19
@avalak

Cron - for scheduled execution.
VK API (it should be more convenient for these purposes) / CURL (representing a browser) - for obtaining
HTML parser data (specific for each language. The best option if you are going to mow for a browser)
SQL query / ORM / AR for communicating with the database.
In general, something like this.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question