L
L
lukoie2020-09-17 05:29:58
Analytics
lukoie, 2020-09-17 05:29:58

How to work with a large array of "open data"?

Hello.
If a 5-10 gigabyte xml file is given for analytics, what do data analysts usually do with them? Somehow, it must be drawn in somewhere, so that it is possible to make selections, queries. And then I'm stalled now, I have a 5 gigabyte xml on my hands, and I'll never know how to filter data on it. Let's say I open it for reading by a certain browser that does not create a cache and does not load the entire house-tree. But then it is impossible to filter only those nodes that interest me (and make another selection inside this selection). How it's done? I don’t believe that everyone comes up with their own bike, and as a result, these xml are loaded into a relational database and manipulated there. Or is that what happens?

Answer the question

In order to leave comments, you need to log in

1 answer(s)
A
Alexander Skusnov, 2020-09-17
@AlexSku

MatLab has such a Tall Array data type .
https://www.mathworks.com/solutions/big-data-matla...

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question