Skip to content

Hadoop

hadoop

Apache Hadoop is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation.

At its core, Hadoop has two major layers namely

  • Processing/Computation layer (MapReduce), and
  • Storage layer (Hadoop Distributed File System).

Outline

Resources

  • todo