D
D
denislysenko2021-12-13 21:18:56
Apache Spark
denislysenko, 2021-12-13 21:18:56

1 rdd is one piece of 128 bytes or a stream?

1 rdd is one piece of 128 bytes or a stream?
So what is rdd other than a resilient distributed dataset?

Answer the question

In order to leave comments, you need to log in

1 answer(s)
C
Cheypnow, 2021-12-14
@denislysenko

RDD is a distributed collection. That is, a set of data distributed over a cluster on which distributed operations can be performed.
For example, there is a file lying in a distributed file system, each node stores its own part of the file.
Reading will create one RDD representing the contents of this file. At the same time, each node on which it was stored will process its part of the data.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question