Answer the question
In order to leave comments, you need to log in
How to work with a large array of "open data"?
Hello.
If a 5-10 gigabyte xml file is given for analytics, what do data analysts usually do with them? Somehow, it must be drawn in somewhere, so that it is possible to make selections, queries. And then I'm stalled now, I have a 5 gigabyte xml on my hands, and I'll never know how to filter data on it. Let's say I open it for reading by a certain browser that does not create a cache and does not load the entire house-tree. But then it is impossible to filter only those nodes that interest me (and make another selection inside this selection). How it's done? I don’t believe that everyone comes up with their own bike, and as a result, these xml are loaded into a relational database and manipulated there. Or is that what happens?
Answer the question
In order to leave comments, you need to log in
MatLab has such a Tall Array data type .
https://www.mathworks.com/solutions/big-data-matla...
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question