Hadoop and MapReduce Notes
Hadoop and MapReduce Notes
1. History of Hadoop
2. Apache Hadoop
4. Components of Hadoop
8. Hadoop Streaming
- Allows developers to write MapReduce jobs in any language (e.g., Python, Perl).
9. Hadoop Pipes
- Includes Hive, Pig, HBase, Sqoop, Flume, Oozie, Zookeeper, Mahout, etc.
- Job submission -> Job initialization -> Task assignment -> Map phase -> Shuffle & sort -> Reduce