0% found this document useful (0 votes)

28 views18 pages

Hadoop Installation

Uploaded by

placementcell1234567890

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views18 pages

Hadoop Installation

Uploaded by

placementcell1234567890

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

EXP NO: 1A INSTALLATION OF JAVA

DATE:

AIM:

To setup java development environment for working with Hadoop software.

STEPS:

1. sudo apt-get update

2. In this step, we will install latest version of JDK(1.8) on the machine.

The Oracle JDK is the official JDK; however, it is nolonger provided by Oracle as a default
installation for Ubuntu. You can still install it using apt-get.

To install any version, first execute the followingcommands:

a. sudo apt-get install python-software-properties

b. sudo add-apt-repository ppa:webupd8team/java

c. sudo apt-get update

Then, depending on the version you want to install, execute one of the following commands:

Oracle JDK 7: sudo apt-get install oracle-java7-installer

Oracle JDK 8: sudo apt-get install oracle-java8-installer

Follow these steps to setup Java on Windows and validate the install.
Download Java for Windows 10

Download the latest Java Development Kit installation file for Windows 10 to have the latest
features and bug fixes.

1. Using your preferred web browser, navigate to the Oracle Java Downloads page.
2. On the Downloads page, click the x64 Installer download link under
the Windows category. At the time of writing this article, Java version 17 is the latest
long-term support Java version.

Wait for the download to complete.

Install Java on Windows 10

After downloading the installation file, proceed with installing Java on your Windows system.

Follow the steps below:

Step 1: Run the Downloaded File

Double-click the downloaded file to start the installation.

Step 2: Configure the Installation Wizard

After running the installation file, the installation wizard welcome screen appears.

1. Click Next to proceed to the next step.

2. Choose the destination folder for the Java installation files or stick to the default path.
Click Next to proceed.

3. Wait for the wizard to finish the installation process until the Successfully Installed message
appears. Click Close to exit the wizard.

Set Environmental Variables in Java

Set Java environment variables to enable program compiling from any directory. To do so,
follow the steps below:

Step 1: Add Java to System Variables

1. Open the Start menu and search for environment variables.

2. Select the Edit the system environment variables result.

3. In the System Properties window, under the Advanced tab, click Environment Variables…

4. Under the System variables category, select the Path variable and click Edit:
5. Click the New button and enter the path to the Java bin directory:

Download Java for Windows 10

Download the latest Java Development Kit installation file for Windows 10 to have the latest
features and bug fixes.

Wait for the download to complete.

Install Java on Windows 10

After downloading the installation file, proceed with installing Java on your Windows system.
Follow the steps below:

Step 1: Run the Downloaded File

Double-click the downloaded file to start the installation.

Step 2: Configure the Installation Wizard

After running the installation file, the installation wizard welcome screen appears.

1. Click Next to proceed to the next step.

2. Choose the destination folder for the Java installation files or stick to the default path.
Click Next to proceed.

3. Wait for the wizard to finish the installation process until the Successfully Installed message
appears. Click Close to exit the wizard.
Set Environmental Variables in Java

Set Java environment variables to enable program compiling from any directory. To do so,
follow the steps below:

Step 1: Add Java to System Variables

1. Open the Start menu and search for environment variables.

2. Select the Edit the system environment variables result.

3. In the System Properties window, under the Advanced tab, click Environment Variables…
4. Under the System variables category, select the Path variable and click Edit:

5. Click the New button and enter the path to the Java bin directory:
Step 2: Add JAVA_HOME Variable

Some applications require the JAVA_HOME variable. Follow the steps below to create the
variable:

1. In the Environment Variables window, under the System variables category, click
the New… button to create a new variable.
2. Name the variable as JAVA_HOME.

3. In the variable value field, paste the path to your Java jdk directory and click OK.

4. Confirm the changes by clicking OK in the Environment Variables and System

properties windows.

Test the Java Installation

Run the java -version command in the command prompt to make sure Java installed correctly:

If installed correctly, the command outputs the Java version

RESULT:

Thus the installation of Java has been executed successfully.

EX:NO: 1B INSTALLATION OF HADOOP

DATE:

AIM:

Downloading and installing Hadoop; Understanding different Hadoop modes.

Startup scripts,Configuration files.

PROCEDURE:

Hadoop software can be installed in three modes ofoperation:

• Stand Alone Mode: Hadoop is a distributed software and is designed to run on a

commodity of machines. However, we can install it on a single node in stand-alone
mode. In this mode, Hadoop software runs as a single monolithic java process. This
mode is extremelyuseful for debugging purpose. You can first testrun your Map-
Reduce application in this mode on small data, before actually executing it on cluster
with big data.

• Pseudo Distributed Mode: In this mode also,Hadoop software is installed on a

Single Node.Various daemons of Hadoop will run on the same machine as separate
java processes. Hence all the daemons namely NameNode, DataNode,
SecondaryNameNode, JobTracker,TaskTracker run on single machine.

• Fully Distributed Mode: In Fully Distributed Mode, the daemons NameNode,

JobTracker, SecondaryNameNode (Optional and can be run on a separate node) run
on the Master Node.The daemons DataNode and TaskTracker runon the Slave Node.

Hadoop Installation: Ubuntu Operating System in stand-alonemode

STEPS:

1. Now, let us setup a new user account for Hadoop

installation. This step is optional, but recommendedbecause it gives you flexibility to have a
separate account for Hadoop installation by separating this installation from other software
installation

• sudo adduser hadoop_dev ( Upon executing this command, you will

prompted to enter the newpassword for this user. Please enter the password
and enter other details. Don’t forget to save the details at the end)

• su - hadoop_dev( Switches the user fromcurrent user to the new

user created i.e Hadoop_dev)

2. Download the latest Hadoop distribution.

• Visit this URL and choose one of the mirror sites.You can copy the download
link and also use “wget” to download it from command prompt:

We get http:// apache.mirrors.lucidnetworks.net/hadoop/

3. Untar the file :

common/hadoop-2.7.0/hadoop-2.7.0.tar.gz

tar xvzf hadoop-2.7.0.tar.gz

4. Rename the folder to hadoop2

mv hadoop-2.7.0 hadoop2

5. Edit configuration file /home/hadoop_dev/ hadoop2/etc/hadoop/hadoop-

env.sh and setJAVA_HOME in that file.

vim /home/hadoop_dev/hadoop2/etc/hadoop/

• hadoop-env.sh
• uncomment JAVA_HOME and update it followingline:

export JAVA_HOME=/usr/lib/jvm/java-8- oracle

( Please check for your relevant java installation and set this value accordingly. Latest
versions of Hadoop require > JDK1.7)

6. Let us verify if the installation is successful or not

( change to home directory cd /home/ hadoop_dev/hadoop2/):

• bin/hadoop( running this command shouldprompt you with various

options)
7. This finishes the Hadoop setup in stand-alonemode.

8. Let us run a sample hadoop programs that isprovided to you in the download
package:

$ mkdir input (create the input directory)

$ cp etc/hadoop/*.xml input ( copy over all the xml files to input folder)

$ bin/hadoop jar share/hadoop/mapreduce/hadoop-mapreduce-examples-

2.7.0.jar grepinput output 'dfs[a-z.]+'

(grep/find all the files matching the pattern ‘dfs[a-z.]+’ and copy those files
to output directory)

$ cat output/* (look for the output in the outputdirectory that Hadoop creates
for you).

Hadoop Installation: PsuedoDistributed Mode( Locally )

Steps for Installation

1. Edit the file /home/Hadoop_dev/hadoop2/etc/hadoop/core-site.xml as below:

<configuration>
<property>
<name>fs.defaultFS</name>
<value>hdfs://localhost:9000</value>
</property>
</configuration>

Note: This change sets the namenode ip and port.

2. Edit the file /home/Hadoop_dev/hadoop2/etc/hadoop/hdfs-site.xml as below:

<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>

Note: This change sets the default replicationcount for blocks used by HDFS.

3. We need to setup password less login so that themaster will be able to do a password-
less ssh to start the daemons on all the slaves.

Check if ssh server is running on your host or not:

a. ssh localhost( enter your password and if youare able to login then ssh server is
running)

b. In step a. if you are unable to login, then installssh as follows:

sudo apt-get install ssh

C.Setup password less login as below:

i. ssh-keygen -t dsa -P '' -f ~/.ssh/id_dsa

ii. cat ~/.ssh/id_dsa.pub >> ~/.ssh/

We can run Hadoop jobs locally or on YARN in this mode. In this Post, we
will focus on authorized_keys

4. running thejobs locally.

5. Format the file system. When we format namenode it formats the meta-data related to
data-nodes. By doing that, all the information on the datanodes are lost and they can be
reused for newdata:

a. bin/hdfs namenode –format

6. Start the daemons

a. sbin/start-dfs.sh (Starts NameNode andDataNode)

You can check If NameNode has started successfully or not by using the following
web interface: http://0.0.0.0:50070 .

If you are unable tosee this, try to check the logs in the /home/ hadoop_dev/hadoop2/logs
folder.

7. You can check whether the daemons are runningor not by issuing Jps command.

8. This finishes the installation of Hadoop in pseudodistributed mode.

9. Let us run the same example we can in theprevious blog post:

i) Create a new directory on the hdfs

bin/hdfs dfs -mkdir –p /user/hadoop_dev

Copy the input files for the program to hdfs:
bin/hdfs dfs -put etc/hadoop input

Run the program:

bin/hadoop jar share/hadoop/mapreduce/ hadoop-mapreduce-examples-

2.6.0.jar grep

input output 'dfs[a-z.]+'

ii) View the output on hdfs:

bin/hdfs dfs -cat output/*

10. Stop the daemons when you are done executing the jobs, with the below command:

sbin/stop-dfs.sh

Hadoop Installation – PsuedoDistributed Mode( YARN )

Steps for Installation

1. Edit the file /home/hadoop_dev/hadoop2/etc/hadoop/mapred-site.xml as below:

<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>

2. Edit the fie /home/hadoop_dev/hadoop2/etc/hadoop/yarn-site.xml as below:

<configuration>
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
</configuration>

Note: This particular configuration tells

MapReduce how to do its shuffle. In this case ituses the mapreduce_shuffle.

3. Format the NameNode:

bin/hdfs namenode –format

4. Start the daemons using the command:

sbin/start-yarn.sh
This starts the daemons ResourceManager andNodeManager.

Once this command is run, you can check if ResourceManager is running or not by visiting
the following URL on browser : http://0.0.0.0:8088 . If you are unable to see this, check for
the logs in thedirectory: /home/hadoop_dev/hadoop2/logs

5. To check whether the services are running, issuea jps command. The following shows all
the services necessary to run YARN on a single server:

$ jps
15933 Jps
15567 ResourceManager
15785 NodeManager
6. Let us run the same example as we ran before:

i) Create a new directory on the hdfs

bin/hdfs dfs -mkdir –p /user/hadoop_dev

Copy the input files for the program to hdfs:

bin/hdfs dfs -put etc/hadoop input

ii) Run the program:

bin/yarn jar share/hadoop/mapreduce/ hadoop-mapreduce-examples-2.6.0.jar grep

input output 'dfs[a-z.]+'

iii) View the output on hdfs:

bin/hdfs dfs -cat output/*

7. Stop the daemons when you are done executingthe jobs, with the below command:

sbin/stop-yarn.sh

This completes the installation part of Hadoop.

RESULT:

Thus the installation of Hadoop has been executed successfully.

4 Oct Part 1 UNQ Cloud
No ratings yet
4 Oct Part 1 UNQ Cloud
14,584 pages
Java Foundation With Data Structures Topic: Installation Guide For JDK and Eclipse
No ratings yet
Java Foundation With Data Structures Topic: Installation Guide For JDK and Eclipse
7 pages
Cohesity Deployment Guide Microsoft SQL Configurations
No ratings yet
Cohesity Deployment Guide Microsoft SQL Configurations
35 pages
TrailHead ENV INIT
No ratings yet
TrailHead ENV INIT
83 pages
App Steps
No ratings yet
App Steps
27 pages
Sakina Web
No ratings yet
Sakina Web
49 pages
BDA Lab Manual1
No ratings yet
BDA Lab Manual1
54 pages
4.2-Day-Recape of Java Fundamentals
No ratings yet
4.2-Day-Recape of Java Fundamentals
37 pages
Installing Java 17
No ratings yet
Installing Java 17
10 pages
Java Environment Installation Guide
No ratings yet
Java Environment Installation Guide
13 pages
14459autodesk Civil 3D 2025 For Windows
No ratings yet
14459autodesk Civil 3D 2025 For Windows
3 pages
First App Development
No ratings yet
First App Development
23 pages
CoreJava Day 1
No ratings yet
CoreJava Day 1
22 pages
Chap 2
No ratings yet
Chap 2
5 pages
Java Eclipse Installation Final-1
No ratings yet
Java Eclipse Installation Final-1
10 pages
BDC Output 1
No ratings yet
BDC Output 1
9 pages
Installing Hadoop On Ubuntu
No ratings yet
Installing Hadoop On Ubuntu
29 pages
JDK Installation Guide
100% (1)
JDK Installation Guide
8 pages
Printing
No ratings yet
Printing
5 pages
LAB1 - JDK Installation - Sam Prince Franklin K - 20MIS1115
No ratings yet
LAB1 - JDK Installation - Sam Prince Franklin K - 20MIS1115
11 pages
Installation Guide - JAVA-27
No ratings yet
Installation Guide - JAVA-27
34 pages
Guide To Install Oracle Java-JDK (Java Development Kit)
No ratings yet
Guide To Install Oracle Java-JDK (Java Development Kit)
10 pages
Instalacija Tower Build 1350 I ArmCAD-A Build 1763 Na Win 7 x64 Ultimate
100% (2)
Instalacija Tower Build 1350 I ArmCAD-A Build 1763 Na Win 7 x64 Ultimate
2 pages
Setting Up Environment For JAVA Programming
No ratings yet
Setting Up Environment For JAVA Programming
5 pages
Experiment 1
No ratings yet
Experiment 1
3 pages
iTWO Costx Standalone Database Backup and Recovery
No ratings yet
iTWO Costx Standalone Database Backup and Recovery
28 pages
Downloading and Installing JDK and Netbeans
No ratings yet
Downloading and Installing JDK and Netbeans
12 pages
Java Installation Guide PDF
No ratings yet
Java Installation Guide PDF
16 pages
Java Runtime Environment Java Hotspot Client
No ratings yet
Java Runtime Environment Java Hotspot Client
1 page
Java Software Installation by Satish Sir
No ratings yet
Java Software Installation by Satish Sir
11 pages
Installation Procedure of JAVA & Eclipse
No ratings yet
Installation Procedure of JAVA & Eclipse
6 pages
Instagram Comments
No ratings yet
Instagram Comments
359 pages
Java Installation and Setup For Window
No ratings yet
Java Installation and Setup For Window
5 pages
Pratical 1 Java
No ratings yet
Pratical 1 Java
5 pages
HDFS Installation Steps
No ratings yet
HDFS Installation Steps
17 pages
Installation Guide - JAVA-3438 (1) - 3438
No ratings yet
Installation Guide - JAVA-3438 (1) - 3438
19 pages
Ict WB Answers - c11
No ratings yet
Ict WB Answers - c11
4 pages
Java - Environment Setup
No ratings yet
Java - Environment Setup
2 pages
15 - COMPC - Sakshi Sharma - P1
No ratings yet
15 - COMPC - Sakshi Sharma - P1
6 pages
Java Installation
No ratings yet
Java Installation
3 pages
Blue Business Plan PPT Slides
No ratings yet
Blue Business Plan PPT Slides
50 pages
Assignment:1: Install Java On Windows 10
No ratings yet
Assignment:1: Install Java On Windows 10
4 pages
Day 2
No ratings yet
Day 2
14 pages
Ry It Option Online
No ratings yet
Ry It Option Online
2 pages
Java Installation Guide
No ratings yet
Java Installation Guide
16 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
10 pages
JDK Installation Steps
No ratings yet
JDK Installation Steps
10 pages
Securing Connections To Web User Interface in ACE
100% (1)
Securing Connections To Web User Interface in ACE
12 pages
Java Programming Setup
No ratings yet
Java Programming Setup
5 pages
Java and The Windows Command Prompt
No ratings yet
Java and The Windows Command Prompt
1 page
Steps To Download - Install - Java - Eclipse
No ratings yet
Steps To Download - Install - Java - Eclipse
9 pages
Java - Download and Install JDK 10 On Windows PDF
No ratings yet
Java - Download and Install JDK 10 On Windows PDF
16 pages
Oracle EPMA 11.1.2.4 Slient Installation
100% (1)
Oracle EPMA 11.1.2.4 Slient Installation
42 pages
How To Download and Install JDK 1
No ratings yet
How To Download and Install JDK 1
19 pages
Graded Lab Assignment-Modules 1-4
No ratings yet
Graded Lab Assignment-Modules 1-4
4 pages
CCL - Exp 5 - 122a1108
No ratings yet
CCL - Exp 5 - 122a1108
9 pages
Path Setting Java
No ratings yet
Path Setting Java
3 pages
Downloading and Installing Java On Windows: Prevent Errors Like
No ratings yet
Downloading and Installing Java On Windows: Prevent Errors Like
5 pages
How To Install JDK On Windows: Step 0: Un-Install Older Version(s) of JDK/JRE
No ratings yet
How To Install JDK On Windows: Step 0: Un-Install Older Version(s) of JDK/JRE
5 pages
Setting Up Eclipse With Java 1
No ratings yet
Setting Up Eclipse With Java 1
6 pages
Hadoop Installation
No ratings yet
Hadoop Installation
12 pages
Java - Environment Setup
No ratings yet
Java - Environment Setup
2 pages
CIS CentOS Linux 6 Benchmark v1.1.01 PDF
No ratings yet
CIS CentOS Linux 6 Benchmark v1.1.01 PDF
172 pages
Development Kit (Or JDK For Short, and SE Means Standard Edition) - Basically, A JDK Contains
No ratings yet
Development Kit (Or JDK For Short, and SE Means Standard Edition) - Basically, A JDK Contains
9 pages
Setting The JAVA - HOME Variable in Windows
No ratings yet
Setting The JAVA - HOME Variable in Windows
2 pages
Prerequisites: The Ubuntu 16.04 Initial Server Setup Guide
No ratings yet
Prerequisites: The Ubuntu 16.04 Initial Server Setup Guide
3 pages
Javasw
No ratings yet
Javasw
2 pages
Installing Windows 2000: Preparing For Installation
No ratings yet
Installing Windows 2000: Preparing For Installation
21 pages
Install Java JDK 8
No ratings yet
Install Java JDK 8
10 pages
Java Java Standard Edition (J2SE) : Article Index
No ratings yet
Java Java Standard Edition (J2SE) : Article Index
6 pages
JDK Installation On Window and Ubuntu
No ratings yet
JDK Installation On Window and Ubuntu
3 pages
NIM Master Step by Step
No ratings yet
NIM Master Step by Step
4 pages
How To Install JDK 8 (On Windows, Mac OS, Ubuntu)
No ratings yet
How To Install JDK 8 (On Windows, Mac OS, Ubuntu)
11 pages
1.5.3 TestOut LabSim Linux Facts
No ratings yet
1.5.3 TestOut LabSim Linux Facts
2 pages
Hadoop Imp Commands
No ratings yet
Hadoop Imp Commands
21 pages
TD's Flexible Volume Profile (MT4) : Installation & Authorization
No ratings yet
TD's Flexible Volume Profile (MT4) : Installation & Authorization
7 pages
Cut, Copy, Paste, and Other Common Shortcuts
No ratings yet
Cut, Copy, Paste, and Other Common Shortcuts
17 pages
EMTP-RV Demonstration Installation Guide
No ratings yet
EMTP-RV Demonstration Installation Guide
3 pages
CAT Grade 10 Term 1 Week 4 TG
No ratings yet
CAT Grade 10 Term 1 Week 4 TG
2 pages
DEC50103 PW4 (KP Approved)
No ratings yet
DEC50103 PW4 (KP Approved)
11 pages
Steps To Load Essbase Security - CISCO
100% (1)
Steps To Load Essbase Security - CISCO
4 pages
How To Reset Applications Manager Login Password For Admin Account
No ratings yet
How To Reset Applications Manager Login Password For Admin Account
2 pages
Sun Solaris Cheat Sheet
No ratings yet
Sun Solaris Cheat Sheet
2 pages
16 09 55
No ratings yet
16 09 55
12 pages
Mondorescue Howto PDF
No ratings yet
Mondorescue Howto PDF
53 pages
Java JDK Installation and Configuration
No ratings yet
Java JDK Installation and Configuration
4 pages
Map A Folder To A Drive Letter For Quick and Easy Access - Raymond - CC
No ratings yet
Map A Folder To A Drive Letter For Quick and Easy Access - Raymond - CC
12 pages
Nagios HWg-STE en
No ratings yet
Nagios HWg-STE en
3 pages
DD NDP451 KB2858725 x86 x64 ENU Decompression Log
No ratings yet
DD NDP451 KB2858725 x86 x64 ENU Decompression Log
1 page