0% found this document useful (0 votes)

365 views10 pages

Hadoop Cluster Setup

This document provides instructions for configuring a Hadoop cluster across multiple machines including modifying host files, setting up passwordless SSH, editing configuration files like core-site.xml and hdfs-site.xml, formatting the namenode, and starting and stopping the Hadoop processes. Key steps include modifying hosts files on all machines, setting up SSH between master and slaves, editing configuration files like masters and slaves files on all machines, formatting the namenode on the master, and starting/stopping processes by running scripts on the master only.

Uploaded by

bispsolutions

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

365 views10 pages

Hadoop Cluster Setup

Uploaded by

bispsolutions

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 10

Configuring Hadoop Cluster on multiple machine

Agenda
Modify your hosts file SSH from master to all slaves SSH to all slaves to master Edit masters file Edit slaves file Modify hadoop-env.sh file Modify core-site.xml file Modify hdfs-site.xml file Modify mapred-site.xml file Formatting of name node Start Hadoop cluster Stop Hadoop cluster

Modify your hosts file

Hosts file contains mapping of ip to hostname Edit your hosts file by typing the below command in your terminal

sudo vi /etc/hosts

Add entries for master & slaves

Repeat the same step on all master/slaves machines.

Master needs to communicate with each slave machine

There should be passwordless ssh from master machine to slave machine Follow the 3 commands to set passwordless ssh from master to slave username@master:~> ssh-keygen -t rsa username@master:~> ssh username@slave1 mkdir -p .ssh username@master:~> cat .ssh/id_rsa.pub | ssh username@slave1 'cat >> .ssh/authorized_keys' Repeat the same steps for each slave machine.

Each slave needs to communicate with master machine

There should be passwordless ssh from each slave machine to master machine Follow the 3 commands to set passwordless ssh from slave to master username@slave1:~> ssh-keygen -t rsa username@slave1:~> ssh username@master mkdir -p .ssh username@slave1:~> cat .ssh/id_rsa.pub | ssh username@master 'cat >> .ssh/authorized_keys' Repeat the same steps on each slave machine

Edit masters file

Open masters file ( HADOOP_HOME/conf/masters ) Add master machine entry in the file Save the master file Make these changes on each machine on cluster (master/slaves)

Edit slaves file

Open slaves file ( HADOOP_HOME/conf/slaves ) Add all slaves machine entry in the file Add slave entry 1 per line. Save the slaves file Make these changes on each machine on cluster (master/slaves)

Modify hadoop-env.sh file

hadoop-env.sh file contains system level variable. Make the following entry in HADOOP_HOME/conf/hadoop-env.sh

export JAVA_HOME=/usr export HADOOP_HOME=/home/neeraj/local_cluster_home/hadoop-1.0.3 Make these changes on each machine on cluster (master/slaves)

Modify core-site.xml file

We need to make the following entry in core-site.xml..

<configuration> <property> <name>fs.default.name</name> <value>hdfs://master:9000</value> </property> <property> <name>hadoop.tmp.dir</name> <value>/home/neeraj/local_cluster_home/hadoop1.0.3/hdfs_temp</value> </property> </configuration> Make these changes on each machine on cluster (master/slaves)

Modify hdfs-site.xml file

We need to make the following entry in hdfs-site.xml.. <configuration> <property> <name>dfs.replication</name> <value>1</value> <description>It's the number of times the block of a file will be replicated on cluster. Default is 3

</description> </property> <property> <name>dfs.data.dir</name> <value>/home/neeraj/local_cluster_home/hadoop1.0.3/hdfs_data</value> </property> </configuration>

Make these changes on each machine on cluster (master & slaves)

Modify mapred-site.xml file

We need to make the following entry in mapred-site.xml..

<configuration> <property> <name>mapred.job.tracker</name> <value>master:9001</value> <description>The host and port on MapReduce job tracker runs at. </description> </property>

</configuration> Make these changes on each machine on cluster (master/slaves)

Format your Namenode

Run the following command on your master machine ./hadoop namenode -format

Start your Hadoop cluster

Run the following command on master machine ./start-all.sh No need to start anything on slave machines

Check Hadoop daemons

Run the jps command on master machine

Run the jps command on slave machines

Stop your Hadoop cluster

Run the following command on master machine ./stop-all.sh No need to stop anything on slave machines

Thanks

Contact Point :www.bispsolutions.com

Lab - 03 - ProductManagement - Using - SignalR and Entity Framework
No ratings yet
Lab - 03 - ProductManagement - Using - SignalR and Entity Framework
18 pages
SpringBoot Basics
No ratings yet
SpringBoot Basics
11 pages
Struts2.X SrinivasaReddy Version1.0
0% (1)
Struts2.X SrinivasaReddy Version1.0
32 pages
Big Data Lab Manual
No ratings yet
Big Data Lab Manual
32 pages
BDA Lab Manual UPDATED
No ratings yet
BDA Lab Manual UPDATED
45 pages
Hadoop
No ratings yet
Hadoop
18 pages
Bi Lab File
No ratings yet
Bi Lab File
19 pages
3 Introduction To Hadoop Administration
No ratings yet
3 Introduction To Hadoop Administration
8 pages
Slot 27-Working With ASP - NET Core Web API
No ratings yet
Slot 27-Working With ASP - NET Core Web API
47 pages
Hadoop Configuration
No ratings yet
Hadoop Configuration
12 pages
Docker and Kubernetes Bootcamp Brochure Content
No ratings yet
Docker and Kubernetes Bootcamp Brochure Content
13 pages
Hadoop Installation
No ratings yet
Hadoop Installation
5 pages
Unit 3 PART 2
No ratings yet
Unit 3 PART 2
11 pages
BDA Lab File
No ratings yet
BDA Lab File
4 pages
Exp 1-2
No ratings yet
Exp 1-2
9 pages
Experiment-2 BDA Lab
No ratings yet
Experiment-2 BDA Lab
13 pages
Group A 1st
No ratings yet
Group A 1st
4 pages
BDA Unit-4
No ratings yet
BDA Unit-4
38 pages
Slideset 03
No ratings yet
Slideset 03
79 pages
Documenting Software Architectures in An Agile Wor
No ratings yet
Documenting Software Architectures in An Agile Wor
25 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Muramoto Hideyoshi
No ratings yet
Muramoto Hideyoshi
5 pages
PRACTICAL 4 - Single and Multi Node Hadoop Install
No ratings yet
PRACTICAL 4 - Single and Multi Node Hadoop Install
11 pages
6 Hadoop
No ratings yet
6 Hadoop
20 pages
Bda A2
No ratings yet
Bda A2
17 pages
Install Hadoop
No ratings yet
Install Hadoop
8 pages
Operating System CSET209: Inter Process Communication (Ipc)
No ratings yet
Operating System CSET209: Inter Process Communication (Ipc)
45 pages
Hadoop 6
No ratings yet
Hadoop 6
5 pages
TP2 - 3IM - en
No ratings yet
TP2 - 3IM - en
7 pages
DataVisuaization Lab
No ratings yet
DataVisuaization Lab
5 pages
2ND TEST MCQS CC Copies
No ratings yet
2ND TEST MCQS CC Copies
7 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
BDA LAB Programs
No ratings yet
BDA LAB Programs
56 pages
Start Hadoop
No ratings yet
Start Hadoop
4 pages
BDAO
No ratings yet
BDAO
23 pages
TMA-OB-Spring - 2023-2024
No ratings yet
TMA-OB-Spring - 2023-2024
4 pages
MERN
100% (1)
MERN
18 pages
Controlling Accesst 1 Aug
No ratings yet
Controlling Accesst 1 Aug
17 pages
IR MetaData Guide
No ratings yet
IR MetaData Guide
8 pages
Hadoop Installation Steps
100% (1)
Hadoop Installation Steps
6 pages
How To Install and Set Up A 3-Node Hadoop Cluster
No ratings yet
How To Install and Set Up A 3-Node Hadoop Cluster
36 pages
Introduction To Data Warehouse Using Cognos
100% (2)
Introduction To Data Warehouse Using Cognos
56 pages
Install Hdfs
No ratings yet
Install Hdfs
3 pages
Lab 0-Cluster With Multiple VMs-30-01-2024
No ratings yet
Lab 0-Cluster With Multiple VMs-30-01-2024
6 pages
CIS Module 8 Cloud Computing
No ratings yet
CIS Module 8 Cloud Computing
32 pages
Setup 8
No ratings yet
Setup 8
16 pages
BDA Lab Manual-1
No ratings yet
BDA Lab Manual-1
60 pages
Hadoop 2x Installation With HA (NFS - QJM)
No ratings yet
Hadoop 2x Installation With HA (NFS - QJM)
43 pages
Big Data Hadoop & Spark Curriculum
No ratings yet
Big Data Hadoop & Spark Curriculum
10 pages
IOT Project Based Learning
No ratings yet
IOT Project Based Learning
7 pages
BDA Unit-4
No ratings yet
BDA Unit-4
38 pages
Coimbatore Keywords
No ratings yet
Coimbatore Keywords
44 pages
Lab 1
No ratings yet
Lab 1
12 pages
Sa Unit 1 Chapter 1 Architecture Business Cycle
No ratings yet
Sa Unit 1 Chapter 1 Architecture Business Cycle
29 pages
Hadoop Installation Steps
No ratings yet
Hadoop Installation Steps
4 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Installation Process of HADOOP
No ratings yet
Installation Process of HADOOP
12 pages
Department of Computer Engineering Istanbul S. Zaim University, Istanbul, Turkey
No ratings yet
Department of Computer Engineering Istanbul S. Zaim University, Istanbul, Turkey
42 pages
Ex 3
No ratings yet
Ex 3
3 pages
Hadoop Installatio1
No ratings yet
Hadoop Installatio1
22 pages
How To Set Up A Multi-Node Hadoop Cluster On Amazon EC2
No ratings yet
How To Set Up A Multi-Node Hadoop Cluster On Amazon EC2
25 pages
Ex3a 2
No ratings yet
Ex3a 2
1 page
19cdhxkmkbac7zSumitAgarwal (15 0)
No ratings yet
19cdhxkmkbac7zSumitAgarwal (15 0)
11 pages
Compete Guide: Oracle Weblogic 12C vs. Ibm Was V8.5.5: Summary of Oracle Key Differentiators
No ratings yet
Compete Guide: Oracle Weblogic 12C vs. Ibm Was V8.5.5: Summary of Oracle Key Differentiators
21 pages
Btech Cs 7 Sem Cloud Computing rcs075 2020
No ratings yet
Btech Cs 7 Sem Cloud Computing rcs075 2020
1 page
Django Cheat Sheet: by Via
No ratings yet
Django Cheat Sheet: by Via
1 page
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
No ratings yet
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
14 pages
Hadoop Single Node Cluster Setup Steps
No ratings yet
Hadoop Single Node Cluster Setup Steps
7 pages
On Master Nodes Nodes: Install and Edit Bashrc On All Nodes For JAVA and HADOOP
No ratings yet
On Master Nodes Nodes: Install and Edit Bashrc On All Nodes For JAVA and HADOOP
10 pages
CEC315 - Introduction To Cloud Computing - Module 3
No ratings yet
CEC315 - Introduction To Cloud Computing - Module 3
16 pages
Edureka Apache Hadoop Single Node Cluster On Ubuntu
No ratings yet
Edureka Apache Hadoop Single Node Cluster On Ubuntu
9 pages
Hadoop Multinode Setup
No ratings yet
Hadoop Multinode Setup
16 pages
Design Process
No ratings yet
Design Process
20 pages
Hadoop
No ratings yet
Hadoop
27 pages
Create A Multi-Node Cluster For Distributed Hadoop Environment
No ratings yet
Create A Multi-Node Cluster For Distributed Hadoop Environment
5 pages
How To Set Up A Multi-Node Hadoop Cluster On Amazon EC2 - WithScreenShots
No ratings yet
How To Set Up A Multi-Node Hadoop Cluster On Amazon EC2 - WithScreenShots
42 pages
Hyperion Web Analysis: Amit Sharma Hyperion Trainer
No ratings yet
Hyperion Web Analysis: Amit Sharma Hyperion Trainer
35 pages
Service Component Architecture (SCA) (SCA) : Lionel Seinturier
No ratings yet
Service Component Architecture (SCA) (SCA) : Lionel Seinturier
16 pages
Single Node Hadoop Cluster
No ratings yet
Single Node Hadoop Cluster
9 pages
Hadoop 2 - Pseudo Node Installation
No ratings yet
Hadoop 2 - Pseudo Node Installation
9 pages
Hadoop Single Node Installation
No ratings yet
Hadoop Single Node Installation
4 pages
Hadoop Installation On CentOS PDF
No ratings yet
Hadoop Installation On CentOS PDF
3 pages
OBIEE Installation 31 July
No ratings yet
OBIEE Installation 31 July
24 pages
OBIEE Installation 31 July
No ratings yet
OBIEE Installation 31 July
24 pages
OBIEE Installation 31 July
No ratings yet
OBIEE Installation 31 July
24 pages
OBIEE Installation 31 July
No ratings yet
OBIEE Installation 31 July
24 pages
Hadoop Multinode Cluster Installation
No ratings yet
Hadoop Multinode Cluster Installation
4 pages
Hadoop Installation Cluster
No ratings yet
Hadoop Installation Cluster
9 pages
Hadoop Basic Commands
No ratings yet
Hadoop Basic Commands
8 pages
Unit Iii Analysis Design Concepts and Principles
No ratings yet
Unit Iii Analysis Design Concepts and Principles
48 pages
Linux Networking and Troubleshooting: Load Balancing With Haproxy
No ratings yet
Linux Networking and Troubleshooting: Load Balancing With Haproxy
3 pages
Email:: Professional Summary
No ratings yet
Email:: Professional Summary
4 pages
NetSuite PBCS Implementation & Advance Planning
No ratings yet
NetSuite PBCS Implementation & Advance Planning
7 pages
Cloud Computing: Opportunities and Challenges
No ratings yet
Cloud Computing: Opportunities and Challenges
3 pages
Training On Oracle Hyperion Products Suite: Amit Sharma
No ratings yet
Training On Oracle Hyperion Products Suite: Amit Sharma
35 pages
Training On Oracle Hyperion Products Suite: Amit Sharma
No ratings yet
Training On Oracle Hyperion Products Suite: Amit Sharma
35 pages
$ Sudo Apt-Get Install Oracle-Java8-Installer
No ratings yet
$ Sudo Apt-Get Install Oracle-Java8-Installer
4 pages
Hyperion Essbase Basics
No ratings yet
Hyperion Essbase Basics
24 pages
Cloudera Administrator Training
No ratings yet
Cloudera Administrator Training
6 pages
HFM Intro
No ratings yet
HFM Intro
27 pages
HFM Intro
No ratings yet
HFM Intro
27 pages
O O O O O: Some Common Mistakes While Loading Data Into Essbase Server
No ratings yet
O O O O O: Some Common Mistakes While Loading Data Into Essbase Server
4 pages
Implementing Multiple Fact Tables
No ratings yet
Implementing Multiple Fact Tables
4 pages
Hadoop Multi Node Cluster
No ratings yet
Hadoop Multi Node Cluster
7 pages
Inro Demo v1 31 July
No ratings yet
Inro Demo v1 31 July
16 pages
Inro Demo v1 31 July
No ratings yet
Inro Demo v1 31 July
16 pages
Inro Demo v1 31 July
No ratings yet
Inro Demo v1 31 July
16 pages
Web Services Book
No ratings yet
Web Services Book
2 pages
Template 1
No ratings yet
Template 1
2 pages
Template 1
No ratings yet
Template 1
2 pages
Abhishek Network Engineer
No ratings yet
Abhishek Network Engineer
8 pages
An Introduction To: Hyperion Financial Data Quality Management
No ratings yet
An Introduction To: Hyperion Financial Data Quality Management
26 pages
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hedaya Alasooly
No ratings yet
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hidaia Mahmood Alassouli
No ratings yet

Hadoop Cluster Setup

Uploaded by

Hadoop Cluster Setup

Uploaded by

Configuring Hadoop Cluster on multiple machine

Modify your hosts file

Add entries for master & slaves

Repeat the same step on all master/slaves machines.

Master needs to communicate with each slave machine

Each slave needs to communicate with master machine

Edit masters file

Edit slaves file

Modify hadoop-env.sh file

Modify core-site.xml file

Modify hdfs-site.xml file

</description> </property> <property> <name>dfs.data.dir</name> <value>/home/neeraj/local_cluster_home/hadoop1.0.3/hdfs_data</value> </property> </configuration>

Make these changes on each machine on cluster (master & slaves)

Modify mapred-site.xml file

</configuration> Make these changes on each machine on cluster (master/slaves)

Format your Namenode

Start your Hadoop cluster

Check Hadoop daemons

Run the jps command on master machine

Run the jps command on slave machines

Stop your Hadoop cluster

Contact Point :www.bispsolutions.com

You might also like