Scalable Distributed Programming Using MapReduce and Resilient Distributed Datasets (Spark)

Apache Spark is an open source project originally developed at the University of California. This distributed, universal cluster computing framework provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Related

Online Resources