Search
June 30, 2017 | By Seth Muthukaruppan
Resilient Distributed Datasets (RDD) is a fundamental data structure of Spark. It is an immutable distributed collection of objects. RDDs have actions, which return values, and transformations, which return pointers to new RDDs.
Seth Muthukaruppan | Consultant
Seth is a Professional Services consultant at Instaclustr.