Answer the question
In order to leave comments, you need to log in
What to choose to create a system: Parser + database work + serious analytics + graphical reports + web interface?
Hello!
Task:
1. Go to sites with the database I need (the number is from 1-2 to 1-2 dozen), to pages with a multi-page list. From this list (dynamic issuance), be able to open end pages via hyperlinks. In them, find and press the desired button to download the file, in the dialog box of the web interface, select the necessary file parameters (the desired file format, etc.) and press the button to download data.
Frequency - 1 time per day. The number of final pages and downloads of files is from several thousand to several tens of thousands.
2. Record the received data in the database.
3. Conduct processing and analytics
4. The next step is to visit sites with textual, mainly news information, go through the archives of publications, news, announcements, analyze textual and graphic information, save only the necessary information in the database, do not download the entire sample into local databases, but only the results of your own analytics ), to link the results of the 2nd work with the results of the first work.
Frequency - 1 time per day. The number of landing pages, analytical actions and downloads is from several tens to several thousand.
5. Issue analytics, build graphs from the collected database.
6. In this mode (observations and analytics), the complex will work for 0.5-1 year, and the results may suit me local in any local form.
7. In the future, when the analytics is sufficient, display the complex in the web interface so that the analytics can be used publicly and non-publicly via the web.
Please tell me, for the implementation of this task, which programming languages, tools, libraries, frameworks for MS Windosw, which are most suitable for implementation. You have to learn everything from scratch.
Yes. It is also desirable to support modularity so that in the future individual modules, for example, working with graphics, working with the web, and the parsing module can be implemented by third-party programmers.
Thank you in advance for your advice!
Answer the question
In order to leave comments, you need to log in
nutch.apache.org | www.opensearchserver.com
+ PostgreSQL + php + jquery+plugins
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question