Answer the question
In order to leave comments, you need to log in
How to organize the collection of data from several systems in the data warehouse for further processing and use in OLAP?
Good day!
Faced with the need to collect data from several systems for further use in order to build analytical reports.
I surfed the Internet, but did not find anything intelligible, so I ask for help.
I am looking for a solution that can meet the following requirements:
-Collecting data from several databases from different vendors oracle db, mssql, etc.
-Setting collection time ranges (once a day, every hour, etc.)
-Selecting full or incremental collection depending on the conditions (for example, once a month full, every hour incremental)
-Setting tables and fields from which data is collected ( each system has its own set)
-Having remote access to the systems from which data is collected (for example: data warehouse in St. Petersburg, and target systems in Moscow and
Novosibirsk
)
.
Thank you.
Answer the question
In order to leave comments, you need to log in
Thank you for inviting me as an expert.
I have to say that there is no solution in the general case.
You will have to create the data aggregator yourself.
Option 1. Very smart application. Mega Crawler.
The Mega Crawler application must support the ability to establish remote connections to several types of databases. The application must know about the data structure of each specific database to which it clings.
It will either be hard-coded logic that only the programmer can change.
Or flexibly customizable logic, the development of which can take the main development resources and which will be powerless when connected to N + 1 database.
The analogue most known to you is yandexbot. It tries to parse pages, find texts, build an index on them.
Option 2. Application with a network of data agents. Spider net.
Spider is engaged in selecting the necessary data from one database and sending this data to the center of the "web" according to a given protocol and in exactly the format that the central controlling node understands.
The analogue most known to you is Yandex.Market. Online stores themselves implement the unloading of goods and transfer to Yandex.Market in a predetermined format.
What are your options?
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question