Hands-On Hadoop Tutorial
Hands-On Hadoop Tutorial
Hands-On Hadoop Tutorial
Tutorial
Chris Sosa
Wolfgang Richter
May 23, 2008
General Information
Hadoop uses HDFS, a distributed file
system based on GFS, as its shared
filesystem
Master node
– Keeps track of namespace and metadata
about items
– Keeps track of MapReduce jobs in the system
Slave Nodes
Centurion064 also acts as a slave node
Slave nodes
– Manage blocks of data sent from master node
– In terms of GFS, these are the chunkservers
Easy
– hadoop namenode –format