Module_3_Session_2 Features and Components of Hadoop

The document outlines the features of Hadoop, highlighting its fault-tolerant, scalable, and modular design that efficiently handles Big Data storage and processing. It describes the robustness of the Hadoop Distributed File System (HDFS) and its ability to continue operations despite server failures, along with its open-source nature and reliance on Java and Linux. Additionally, it details the core components of the Apache Hadoop framework, including Hadoop Common, HDFS, YARN, and MapReduce.

Uploaded by

s903019.1265

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Module_3_Session_2 Features and Components of Hadoop

Uploaded by

s903019.1265

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

CS6CRT19 Big Data Analytics Module 3

Features of Hadoop
● Fault-efficient scalable, flexible and modular design which uses a simple
and modular programming model. The system provides servers at high
scalability. The system is scalable by adding new nodes to handle larger
data. Hadoop proves very helpful in storing, managing, processing and
analyzing Big Data. Modular functions make the system flexible. One can
add or replace components at ease. Modularity allows replacing its
components for a different software tool.
● Robust design of HDFS: Execution of Big Data applications continue even
when an individual server or cluster fails. This is because of Hadoop
provisions for backup and data recovery mechanism. HDFS thus has high
reliability.
● Store and process Big Data: Processes Big Data of 3V characteristics.
● Distributed clusters computing model with data locality: Processes Big
Data at high speed as the application tasks and subtasks submit to the
DataNodes. One can achieve more computing power by increasing the
number of computing nodes. The processing splits across multiple
DataNodes, and thus fast processing and aggregated results.
● Hardware fault-tolerant: A fault does not affect data and application
processing. If a node goes down, the other nodes take care of the residue.
This is due to multiple copies of all data blocks which replicate
automatically. Default is three copies of data blocks.
● Open-source framework: Open source access and cloud services enable
large data stores. Hadoop uses a cluster of multiple inexpensive servers or
the cloud.
● Java and Linux based: Hadoop uses Java interfaces. Hadoop base is Linux
but has its own set of shell commands support.

Swamy Saswathikananda College, Poothotta 1

CS6CRT19 Big Data Analytics Module 3

The base Apache Hadoop framework is composed of the following modules:

● Hadoop Common – contains libraries and utilities needed by other Hadoop
modules
● Hadoop Distributed File System (HDFS) – a distributed file-system that
stores data on commodity machines, providing very high aggregate
bandwidth across the cluster
● Hadoop YARN – (introduced in 2012) a platform responsible for managing
computing resources in clusters and using them for scheduling users'
applications
● Hadoop MapReduce – an implementation of the MapReduce programming
model for large-scale data processing

Swamy Saswathikananda College, Poothotta 2

vdj8 2020 Keys
0% (1)
vdj8 2020 Keys
3 pages
Unit Iii
No ratings yet
Unit Iii
20 pages
FullStack - Grade 10 - ICT - மீட்டல் வினாத்தாள்-1
75% (4)
FullStack - Grade 10 - ICT - மீட்டல் வினாத்தாள்-1
14 pages
Fundy Album Builder 6 Crack PDF
0% (1)
Fundy Album Builder 6 Crack PDF
2 pages
Module 2 BDA
No ratings yet
Module 2 BDA
64 pages
BDA Mod2@AzDOCUMENTS - in
No ratings yet
BDA Mod2@AzDOCUMENTS - in
64 pages
Module 2. 16974328568170
No ratings yet
Module 2. 16974328568170
113 pages
Module - 2 Half
No ratings yet
Module - 2 Half
12 pages
Bda (21cs71) Module-2
No ratings yet
Bda (21cs71) Module-2
64 pages
2-Notes
No ratings yet
2-Notes
61 pages
Module-2 - Introduction To Hadoop
No ratings yet
Module-2 - Introduction To Hadoop
13 pages
Module 2 CN
No ratings yet
Module 2 CN
23 pages
Big Data Unit II
No ratings yet
Big Data Unit II
42 pages
data analyst
No ratings yet
data analyst
9 pages
BDA Unit-3
No ratings yet
BDA Unit-3
47 pages
BDAunit-II
No ratings yet
BDAunit-II
4 pages
Unit III
No ratings yet
Unit III
15 pages
Unit 3 ETI (BDA)
No ratings yet
Unit 3 ETI (BDA)
34 pages
Bda 18CS72 Mod-2
No ratings yet
Bda 18CS72 Mod-2
152 pages
Big Data Module 2
No ratings yet
Big Data Module 2
23 pages
CC unit5
No ratings yet
CC unit5
27 pages
Syllabus:: Introduction To Hadoop (T1)
No ratings yet
Syllabus:: Introduction To Hadoop (T1)
23 pages
Bigdata Module2 7th-Sem 18cs72
No ratings yet
Bigdata Module2 7th-Sem 18cs72
64 pages
Chapter 2
No ratings yet
Chapter 2
19 pages
Bda Module 2
No ratings yet
Bda Module 2
12 pages
BDA Unit 3
No ratings yet
BDA Unit 3
6 pages
Chapter - 2 Hadoop
No ratings yet
Chapter - 2 Hadoop
32 pages
Module-2
No ratings yet
Module-2
23 pages
shawn
No ratings yet
shawn
4 pages
Chapter 3 Hadoop
No ratings yet
Chapter 3 Hadoop
10 pages
Guided By:-Prof. K. Kakwani: Payal M. Wadhwani
No ratings yet
Guided By:-Prof. K. Kakwani: Payal M. Wadhwani
24 pages
Big Data - Unit 2 Hadoop Framework
100% (1)
Big Data - Unit 2 Hadoop Framework
19 pages
BDA Module 2
No ratings yet
BDA Module 2
40 pages
Unit 4 Hadoop
No ratings yet
Unit 4 Hadoop
31 pages
Big - Data - Analytics - Srii (2) - Read-Only
No ratings yet
Big - Data - Analytics - Srii (2) - Read-Only
11 pages
02 Unit-II Hadoop Architecture and HDFS
No ratings yet
02 Unit-II Hadoop Architecture and HDFS
18 pages
Hadoop
No ratings yet
Hadoop
13 pages
Big Data Analysis IAT-1
No ratings yet
Big Data Analysis IAT-1
43 pages
Unit - 3
No ratings yet
Unit - 3
34 pages
BDA Unit 2
No ratings yet
BDA Unit 2
39 pages
Assignment 5 (Hadoop)
No ratings yet
Assignment 5 (Hadoop)
1 page
Big Data Technologies On Map Reduce and Hadoop
No ratings yet
Big Data Technologies On Map Reduce and Hadoop
2 pages
Haddob Lab Report
No ratings yet
Haddob Lab Report
12 pages
Bda Unit 2
No ratings yet
Bda Unit 2
44 pages
CC UNIT 2 (1)
No ratings yet
CC UNIT 2 (1)
29 pages
Hadoop 10
No ratings yet
Hadoop 10
8 pages
Report On An Exploratory Analysis of The
No ratings yet
Report On An Exploratory Analysis of The
19 pages
Cloud - UNIT V
No ratings yet
Cloud - UNIT V
18 pages
Hadoop Chapter 1
No ratings yet
Hadoop Chapter 1
6 pages
BDA Module 2 Chapter 1
No ratings yet
BDA Module 2 Chapter 1
12 pages
Hadoop Notesforstudents
No ratings yet
Hadoop Notesforstudents
13 pages
Unit-I Material
No ratings yet
Unit-I Material
32 pages
INTRO hadoop-ecosystem
No ratings yet
INTRO hadoop-ecosystem
6 pages
Bda Unit 4 Material
No ratings yet
Bda Unit 4 Material
37 pages
DSCI 5350 - Lecture 2 PDF
No ratings yet
DSCI 5350 - Lecture 2 PDF
54 pages
Unit1
No ratings yet
Unit1
50 pages
HADOOP
No ratings yet
HADOOP
10 pages
INTRODUCTION TO DATA SCIENCE
No ratings yet
INTRODUCTION TO DATA SCIENCE
14 pages
CC Unit - 03
No ratings yet
CC Unit - 03
10 pages
BDA CW Chapter 2
No ratings yet
BDA CW Chapter 2
6 pages
Unit 2-1
No ratings yet
Unit 2-1
43 pages
Hadoop Ecosystem for Big Data
From Everand
Hadoop Ecosystem for Big Data
Dr. Zemelak Goraga
No ratings yet
Mastering Big Data and Hadoop: From Basics to Expert Proficiency
From Everand
Mastering Big Data and Hadoop: From Basics to Expert Proficiency
William Smith
No ratings yet
Juniors 2020 Computer Science: Creating Macros To Automate Routine Tasks in
No ratings yet
Juniors 2020 Computer Science: Creating Macros To Automate Routine Tasks in
11 pages
Me Project Youtube Transcript
100% (1)
Me Project Youtube Transcript
11 pages
Cisco-Catalyst-SD-WAN
No ratings yet
Cisco-Catalyst-SD-WAN
5 pages
Unit 4
No ratings yet
Unit 4
16 pages
Alexey Lobyak: Frontend Developer
No ratings yet
Alexey Lobyak: Frontend Developer
3 pages
9 - Test Evaluation Report Template
No ratings yet
9 - Test Evaluation Report Template
13 pages
WTRL ZRL RulesandRegulationv11.01
No ratings yet
WTRL ZRL RulesandRegulationv11.01
16 pages
LAB 1 - Khan Academy: Esys 50 Lab Activity Student Guide
No ratings yet
LAB 1 - Khan Academy: Esys 50 Lab Activity Student Guide
2 pages
Chapter Four - Javascript (JS)
No ratings yet
Chapter Four - Javascript (JS)
16 pages
os
No ratings yet
os
23 pages
Notes On Fitting Wilson, Uniquac and NRTL Bips To Unifac Generated Data
No ratings yet
Notes On Fitting Wilson, Uniquac and NRTL Bips To Unifac Generated Data
15 pages
Menus, Sub Procedures and Sub Functions
No ratings yet
Menus, Sub Procedures and Sub Functions
28 pages
Project
100% (1)
Project
5 pages
Fin Irjmets1677604573
No ratings yet
Fin Irjmets1677604573
6 pages
Introduction To MapReduce
No ratings yet
Introduction To MapReduce
26 pages
Editing Techniques & Auto Fill: Student - Feedback@sti - Edu
No ratings yet
Editing Techniques & Auto Fill: Student - Feedback@sti - Edu
1 page
Discrete Math Slide 01
No ratings yet
Discrete Math Slide 01
31 pages
Hotel Booking: A Project Report Submitted in Partial Fulfillment of The Requirement For The Award of The Degree of
No ratings yet
Hotel Booking: A Project Report Submitted in Partial Fulfillment of The Requirement For The Award of The Degree of
12 pages
CSS Practical Answers
No ratings yet
CSS Practical Answers
52 pages
List of Google Products
No ratings yet
List of Google Products
16 pages
Custom Providers - NestJS - A Progressive Node - Js Framework
No ratings yet
Custom Providers - NestJS - A Progressive Node - Js Framework
10 pages
Service Support Forms: (Strategic Partnership in Ricoh Education)
No ratings yet
Service Support Forms: (Strategic Partnership in Ricoh Education)
25 pages
R20qs0011eu0103 Ek Ra6m3 QSG
No ratings yet
R20qs0011eu0103 Ek Ra6m3 QSG
25 pages
Readme
No ratings yet
Readme
4 pages
Deadlock Handling DBMS
No ratings yet
Deadlock Handling DBMS
30 pages
Mysql Workbench Tutorial: Ron Mak
No ratings yet
Mysql Workbench Tutorial: Ron Mak
8 pages
The Programmable Pocket Calculator TI59: April 2007
No ratings yet
The Programmable Pocket Calculator TI59: April 2007
46 pages

Module_3_Session_2 Features and Components of Hadoop

Uploaded by

Module_3_Session_2 Features and Components of Hadoop

Uploaded by

CS6CRT19 Big Data Analytics​ ​ ​ ​ ​ ​ ​ Module 3

Swamy Saswathikananda College, Poothotta​ ​ ​ ​ ​ ​ ​ 1

The base Apache Hadoop framework is composed of the following modules:

Swamy Saswathikananda College, Poothotta​ ​ ​ ​ ​ ​ ​ 2

You might also like

CS6CRT19 Big Data Analytics Module 3

Swamy Saswathikananda College, Poothotta 1

Swamy Saswathikananda College, Poothotta 2