Apache Big Data Stack
This is a collection of open source software utilities that allow you to integrate a flexible number of computers into a network to solve problems with large amounts of data. There are solutions available for distributed data storage as well as for distributed computing e.g. with MapReduce strategy.
Related
- [notebook] Mapreduce in R - Prime Numbers
- [notebook] Mapreduce in Python - Word Count