Presentation: Hadoop Technology

The document provides an overview of Hadoop technology including its architecture, components, and MapReduce framework. It describes how Hadoop uses a distributed file system to store large datasets across clusters and nodes. It also explains the typical workflow of how data is loaded and stored in HDFS, including how the NameNode manages block placement. Finally, it gives a high-level description of the MapReduce programming model and provides an example of how it can be used to count word frequencies.

Uploaded by

Rahul Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views15 pages

Presentation: Hadoop Technology

Uploaded by

Rahul Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Presentation

HADOOP TECHNOLOGY
Submitted by

Rahul Singh
Roll NO 1503314918,MCA
Under the Guidance of
Rama Chaudhary,
(Assistant Professor)
Raj Kumar Goel Institute of Technology
Contents

Hadoop Introduction
Famous Hadoop Users
Hadoop Architecture
Hadoop Cluster
Hadoop Cluster Components
HDFS Architecture
MapReduce Overview
MapReduce Word Count
HADHOOP

Distributed file system

Traditional hierarchical file organization

Single namespace for the entire cluster

Write-once-read-many access model

How Sample.txt gets loaded into

the Hadoop Cluster?

Client machine does this step and

loads the Sample.txt into cluster. It
breaks the sample.
txt into smaller chunks which are
known as "Blocks" in Hadoop
context. Client put these blocks on
different machines (data nodes)
throughout the cluster.
Next, how does the Client knows
that to which data nodes load
the blocks?
Now NameNode comes into
picture. The NameNode used its
Rack Awareness intelligence to
decide on which DataNode to
provide. For each of the data block
(in this case Block-A, Block-B and
Block-C), Client contacts
NameNode and in response
NameNode sends an ordered list of
3 DataNodes.

For example in response to Block-

A request, Node Name may send
DataNode-2, DataNode-3 and
DataNode-4.
Who does the block replication?
MapReduce Overview

A method for distributing computation across multiple nodes

Each node processes the data that is stored at that node
The Mapper

Reads data as key/value pairs

Outputs zero or more key/value
pairs
The Reducer
Called once for each unique key
Gets a list of all values associated with a key
as input
The reducer outputs zero or more final
key/value pairs
MapReduce: Word Count

Presented by Rahul Singh Roll No:-1503314918, MCA Rajkumare Goel Institute of Technology
No ratings yet
Presented by Rahul Singh Roll No:-1503314918, MCA Rajkumare Goel Institute of Technology
17 pages
Module 2 Hadoop
No ratings yet
Module 2 Hadoop
23 pages
Unit 2
No ratings yet
Unit 2
56 pages
Introduction To Hadoop - Chapter-2
No ratings yet
Introduction To Hadoop - Chapter-2
59 pages
Unit-2 CH 1 Updated
No ratings yet
Unit-2 CH 1 Updated
22 pages
Unit 2 Hadoop
No ratings yet
Unit 2 Hadoop
60 pages
Exploring Bigdata With Hadoop: Dr.A.Bazila Banu Associate Professor Department of Cse
No ratings yet
Exploring Bigdata With Hadoop: Dr.A.Bazila Banu Associate Professor Department of Cse
23 pages
Introduction To Hadoop
No ratings yet
Introduction To Hadoop
56 pages
HDFS
No ratings yet
HDFS
46 pages
Hadoop Architecture
No ratings yet
Hadoop Architecture
8 pages
Business Intelligence & Big Data Analytics-CSE3124Y
No ratings yet
Business Intelligence & Big Data Analytics-CSE3124Y
26 pages
Hadoop Overview: Open Source Framework Processing Large Amounts of Heterogeneous Data Sets Distributed Fashion
No ratings yet
Hadoop Overview: Open Source Framework Processing Large Amounts of Heterogeneous Data Sets Distributed Fashion
62 pages
BDA-Unit 4
No ratings yet
BDA-Unit 4
20 pages
Lecture Notes Hadoop
100% (1)
Lecture Notes Hadoop
11 pages
Unit V Cloud Technologies and Advancements
No ratings yet
Unit V Cloud Technologies and Advancements
33 pages
Hadoop Class 1 PDF
No ratings yet
Hadoop Class 1 PDF
27 pages
Bda - Unit 3
No ratings yet
Bda - Unit 3
41 pages
Hadoop Physical Organization
No ratings yet
Hadoop Physical Organization
7 pages
Unit 5
No ratings yet
Unit 5
101 pages
Wa0002.
No ratings yet
Wa0002.
32 pages
Module 2
No ratings yet
Module 2
17 pages
Hadoop 1
No ratings yet
Hadoop 1
26 pages
Unit 3
No ratings yet
Unit 3
18 pages
Hadoop
No ratings yet
Hadoop
5 pages
BDP 2023 03
No ratings yet
BDP 2023 03
59 pages
Hadoop Ankit
No ratings yet
Hadoop Ankit
20 pages
Hadoop Presentaton
No ratings yet
Hadoop Presentaton
47 pages
Big Data Unit-2 PPT Part1
No ratings yet
Big Data Unit-2 PPT Part1
76 pages
Introduction To Hadoop
No ratings yet
Introduction To Hadoop
52 pages
Wa0002.
No ratings yet
Wa0002.
66 pages
2-Hadoop History Terminologies DFS-03-01-2025
No ratings yet
2-Hadoop History Terminologies DFS-03-01-2025
52 pages
Bda Unit-Iv
No ratings yet
Bda Unit-Iv
37 pages
Biodiesel Research
No ratings yet
Biodiesel Research
29 pages
Unit-3 BDA
No ratings yet
Unit-3 BDA
30 pages
Unit - 2
No ratings yet
Unit - 2
42 pages
Introduction To Hadoop
No ratings yet
Introduction To Hadoop
18 pages
4 UNIT-4 Introduction To Hadoop
No ratings yet
4 UNIT-4 Introduction To Hadoop
154 pages
NYOUG Hadoop Presentaton
No ratings yet
NYOUG Hadoop Presentaton
47 pages
Unit 5 Print
No ratings yet
Unit 5 Print
32 pages
1 Bda Chapter1 Answer
No ratings yet
1 Bda Chapter1 Answer
7 pages
Module II
No ratings yet
Module II
46 pages
Unit 3 Bba
No ratings yet
Unit 3 Bba
11 pages
Hadoopintro
No ratings yet
Hadoopintro
31 pages
Lovely Professional University (Lpu) : Mittal School of Business (Msob)
No ratings yet
Lovely Professional University (Lpu) : Mittal School of Business (Msob)
10 pages
Unit 2
No ratings yet
Unit 2
18 pages
Lecture-1 - 3 Hadoop - HDFS - Mapreduce (Self Study)
No ratings yet
Lecture-1 - 3 Hadoop - HDFS - Mapreduce (Self Study)
25 pages
Hadoop 1
No ratings yet
Hadoop 1
75 pages
Lecture 2
No ratings yet
Lecture 2
28 pages
Unit 3
No ratings yet
Unit 3
44 pages
BD Sec B
No ratings yet
BD Sec B
19 pages
DW - Bigdata9
No ratings yet
DW - Bigdata9
113 pages
U-3 Big Data
No ratings yet
U-3 Big Data
23 pages
CC Unit 5 Notes
No ratings yet
CC Unit 5 Notes
30 pages
02 Unit-II Hadoop Architecture and HDFS
No ratings yet
02 Unit-II Hadoop Architecture and HDFS
18 pages
Lecture 5 - Hadoop and Mapreduce
No ratings yet
Lecture 5 - Hadoop and Mapreduce
30 pages
Lecture 5 - Hadoop and Mapreduce
No ratings yet
Lecture 5 - Hadoop and Mapreduce
30 pages
Prepared By: Manoj Kumar Joshi & Vikas Sawhney
No ratings yet
Prepared By: Manoj Kumar Joshi & Vikas Sawhney
47 pages
BDA Unit - 4
No ratings yet
BDA Unit - 4
16 pages
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet
Mastering Apache Cassandra - Second Edition
From Everand
Mastering Apache Cassandra - Second Edition
Nishant Neeraj
No ratings yet