What is Apache Spark RDD

RDD stands for Resilient Distributed Dataset. Its a distributed dataset which has the capability to recover from failures.