Hadoop
Apache Hadoop is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation.
At its core, Hadoop has two major layers namely
- Processing/Computation layer (MapReduce), and
- Storage layer (Hadoop Distributed File System).
Outline
Resources
- todo