0% found this document useful (0 votes)

0 views

Big Data Analytics Unit-2

The document provides an overview of the Google File System (GFS) and Hadoop Distributed File System (HDFS), detailing their architecture, components, and configurations. It explains the characteristics of big data, including volume, variety, velocity, variability, and veracity, along with its applications in various fields. Additionally, it outlines the building blocks of Hadoop, including NameNode, DataNode, Secondary NameNode, JobTracker, and TaskTracker, and describes how to configure a Hadoop cluster in different modes.

Uploaded by

minnie.netala

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views

Big Data Analytics Unit-2

Uploaded by

minnie.netala

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Unit 2

Working with Big Data: Google File System, Hadoop Distributed File System (HDFS) – Building blocks of
Hadoop (Namenode, Datanode, Secondary Namenode, JobTracker, TaskTracker), Introducing and Configuring
Hadoop cluster (Local, Pseudo-distributed mode, Fully Distributed mode), Configuring XML files.

The Google File System

 The Google File System(GFS) is a scalable distributed file system for large distributed
data intensive applications.
 It provides fault tolerance while running on inexpensive commodity hardware, and it
delivers high aggregate performance to a large number of clients.
 GFS provides a familiar file system interface, though it does not implement a standard
API such as POSIX.
 Files are organized hierarchically in directories and identified by path-names. GFS
support the usual operations such as create, delete, open, close, read, and write files.
 GFS has snapshot and record append operations. Snapshot creates a copy of a file or a
directory tree at low cost.
 Record append allows multiple clients to append data to the same file concurrently while
guaranteeing the atomicity of each individual clients append
Architecture
 A GFS cluster consists of a single master and multiple chunk servers and is accessed by
multiple clients, as shown in the following figure .

1|
 Each of these is typically a commodity Linux machine running a user-level server process.
 Files are divided into fixed-size chunks. Each chunk is identified by a fixed and globally
unique 64-bit chunk handle assigned by the master at the time of chunk creation.
 Chunk servers store chunks on local disks as Linux files
 For reliability, each chunk is replicated on multiple chunk servers. By default, there will
three replicas and this value can be changed by user
 The master maintains all file system metadata. This includes the namespace, access
control information, the mapping from files to chunks, and the current locations of
chunks.
 It also controls system-wide activities such as chunk lease management, garbage
collection of orphaned chunks, and chunk migration between chunk servers.
 The master periodically communicates with each chunk server in Heart Beat messages
to give it instructions and collect its state.
 GFS client code linked into each application implements the file system API and
communicates with the master and chunk servers to read or write data on behalf of the
application.
 Clients interact with the master for metadata operations, but all data-bearing
communication goes directly to the chunk servers.
Chunk Size
 A large chunk size offers several important advantages
 First, it reduces clients’ need to interact with the master because reads and writes on the

2|
same chunk requires only one initial request to the master for chunk location information.
 Second, it can reduce network overhead by keeping a persistent TCP connection to the
chunkserver over an extended period of time.
 Third, it reduces the size of the metadata stored on the master.
Disadvantages
1. Large block size can lead to internal fragmentation.
2. Even with lazy space allocation, a small file consists of a small number of chunks,
perhaps just one. The chunk servers storing those chunks may become hot spots if many
clients are accessing the same file. In practice, hot spots have not been a major issue
because the applications mostly read large multi-chunk files sequentially. To mitigate it,
replication and allowance to read from other clients can be done.

3|
Big Data

 Big data is a term for data sets that are so large or complex that traditional data
processing application software is inadequate or insufficient to deal with them.
 Big data can be described by the following characteristics:
Volume
 The quantity of generated and stored data. The size of the data determines the value and
potential insight- and whether it can actually be considered big data or not.
Variety
 The type and nature of the data. This helps people who analyse it to effectively use the
resulting insight. This includes three types:
o Structured:
 Structured data refers to information with a high degree of organization,
such that inclusion in a relational database is easily and readily searchable
by simple, straightforward search engine algorithms or other search
operations;
o Unstructured
 Unstructured data is essentially the opposite. Unstructured data refers to
information that either does not have a pre-defined data model or is not
organized in a pre-defined manner. Unstructured information is
typically text-heavy, but may contain data such as dates, numbers, and facts
as well.
o Semi structured
 Semi-structured data is information that doesn’t reside in a relational
database but that does have some organizational properties that make it
easier to analyze. With some process we can store them in relation database
 Examples of semi-structured : CSV but XML and JSON documents are
semi structured documents, NoSQL databases are considered as semi
structured.
Velocity
 In this context, the speed at which the data is generated and processed to meet the
demands and challenges that lie in the path of growth and development.
Variability
 Inconsistency of the data set .
Veracity
 The quality of captured data can vary greatly, affecting accurate analysis.

4|
5|
Applications of Big Data:

 Big data analysis played a large role in Barack Obama's successful 2012 re-election
campaign.
 Big data analysis was tried out for the BJP to win the Indian General Election 2014
 Big data and the IoT work in conjunction. Data extracted from IoT devices provides a
mapping of device inter-connectivity.
 eBay.com uses two data warehouses at 7.5 petabytes and 40PB as well as a 40PB Hadoop
cluster for search, consumer recommendations, and merchandising.
 Amazon.com handles millions of back-end operations every day
 Facebook handles 50 billion photos from its user base(1,930,025,943 Facebook active
users for more info goto http://www.internetlivestats.com/)
 Big data has its application in fields like Health Care, Retail banking, Marketing etc.

Implementations of Big Data:

Building blocks of Hadoop:

 A fully configured cluster, “running Hadoop” means running a set of daemons, or resident programs, on the
different servers in your network.
 These daemo ns have specific roles; some exist only on one server, some exist across multiple servers.
 The daemons include

■ NameNode
■ DataNode
■ Secondary NameNode
■ JobTracker
■ TaskTracker

NameNode

 Hadoop employs a master/slave architecture for both distributed storage and distributed comp utation. 
 The distributed storage system is called the Hadoop Distributed File System, or HDFS.
 The NameNode is the master of HDFS that directs the slave DataNode daemons to perform the low-level I/O
tasks.
 The NameNode is the bookkeeper of HDFS; it keeps track of how your files are broken down into file blocks,
which nodes store those blocks, and the overall health of the distributed file system.
 The server hosting the NameNode typically doesn’t store any user data or perform any computations for a
MapReduce program.
 The negative aspect of NameNode is that if the NameNode fail then the entire Hadoop cluster will fail.

6|
DataNode
 Each slave machine in HDFS cluster will host a DataNode daemon to perform the reading and writing HDFS
blocks to actual files on the local file system.
 When we want to read or write a HDFS file, the file is broken into blocks and the NameNode will tell your client
which DataNode each block resides in. Your client communicates directly with the DataNode daemons to
process the local files corresponding to the blocks.
 A DataNode may communicate with other DataNode to replicate its data blocks for redundancy
 Figure 2.1 illustrates the roles of the NameNode and DataNodes.

 In this figure, we show two data files, one at /user/chuck/data1 and another at /user/james/data2. The data1
file takes up three blocks, which we denote 1, 2, and 3, and the data2 file consists of blocks 4 and 5. The content
of the files are distributed among t h e DataNodes.
 In the above figure each block each has three replicas to ensure that if any one DataNode crashes or becomes
inaccessible over the network, we’ll still be able to read the files.
 DataNodes are constantly reporting to the NameNode.
 The DataNodes continually communicate with the NameNode to provide information regarding local changes as well as
receive instructions to create, move, or delete blocks from the local disk.
Secondary Namenode

 The Secondary NameNode (SNN) is an assistant daemon for monitoring the state of the cluster HDFS.
 Each cluster has one SNN.
 The SNN communicates with the NameNode to take snapshots of the HDFS metadata at intervals defined by the
cluster configuration
 The NameNode is a single point of failure for a Hadoop cluster, and the SNN snapshots help minimize the
downtime and loss of data
 A NameNode failure requires human involvement to reconfigure the cluster to use the SNN as the primary
NameNode

JobTracker

7|
 The JobTracker daemon is the link between your application and Hadoop.
 Once we submit our code to the cluster, the JobTracker determines the execution plan by determining which
files to process, assigns nodes to different tasks, and monitors all tasks as they’re running.
 If a task fail, the JobTracker will automatically re-launch the task, possibly on a different node, up to a
predefined limit of retries.
 There is only one JobTracker daemon per Hadoop cluster. It’s typically run on a server as a master node of the
cluster.

TaskTracker

 Just like the storage daemons, the computing daemons also follow a master/slave architecture: the JobTracker
is the master overseeing the overall execution of a MapReduce job and the TaskTrackers manage the execution
of individual tasks on each slave node.
 Figure 2.2 illustrates this interaction

 One responsibility of the TaskTracker is to constantly communicate with the JobTracker. If the JobTracker fails
to receive a heartbeat from a TaskTracker within a specified amount of time, it will assume the TaskTracker
has crashed and will resubmit the corresponding tasks to other nodes in the cluster.
 The following figure depict the topology of one typical Hadoop cluster in figure 2.3.

8|
 This topology features a master node running the NameNode and JobTracker daemons and a standalone node
with the SNN in case the master node fails.
 The slave machines each host a DataNode and TaskTracker, for running tasks on the same node w h e r e their
data is stored.

Introducing and Configuring Hadoop cluster:

 The majority of Hadoop settings are contained in XML configuration files.

 In order to create a Hadoop cluster we need to configure several xml files.
 All the configuration files are stored in conf directory of HADOOP_HOME.

[hadoop-user@master]$ cd $HADOOP_HOME

[hadoop-user@master]$ ls -l conf/ total 100

-rw-rw-r-- 1 hadoop-user hadoop 2065 Dec 1 10:07 capacity-scheduler.xml

-rw-rw-r-- 1 hadoop-user hadoop 535 Dec 1 10:07 configuration.xsl

-rw-rw-r-- 1 hadoop-user hadoop 49456 Dec 1 10:07 hadoop-default.xml

-rwxrwxr-x 1 hadoop-user hadoop 2314 Jan 8 17:01 hadoop-env.sh

-rw-rw-r-- 1 hadoop-user hadoop 2234 Jan 2 15:29 hadoop-site.xml

-rw-rw-r-- 1 hadoop-user hadoop 2815 Dec 1 10:07 log4j.properties

-rw-rw-r-- 1 hadoop-user hadoop 28 Jan 2 15:29 masters

-rw-rw-r-- 1 hadoop-user hadoop 84 Jan 2 15:29 slaves

-rw-rw-r-- 1 hadoop-user hadoop 401 Dec 1 10:07 sslinfo.xml.example

 In hadoop-env.sh define the JAVA_HOME environment variable to point to the Java installation directory

export JAVA_HOME=/usr/share/jdk

9|
 Before version 0 . 20 , these X M L files are hadoop-default.xml and h a d o o p -site.xml.
 The hadoop-default.xml contains the default Hadoop settings to be used unless they are explicitly overridden in
hadoop-site.xml.
 In version 0.20 the hadoop-site.xml file has been separated out into three XML files: core-site.xml, h d f s -
site.xml, and ma pred-site.xml.
 A Hadoop cluster can be configured in one of the following 3 modes by modifying the above XML files.
o Local (standalone) mode
o Pseudo-distributed mode
o Fully Distributed mode

Local (standalone) mode :

 The standalone mode is the default mode for Hadoop.
 When we first uncompressed the Hadoop source package, it does not consider our hardware setup. Hadoop chooses to
be conservative and assumes a minimal configuration.
 All three XML files (or hadoop- site.xml before version 0.20) are empty under this default mode:
<?xml version=”1.0”?>
<?xml-stylesheet type=”text/xsl” href=”configuration.xsl”?>

</configuration>

 Its primary use is for developing and debugging the application logic of a MapReduce program without the
additional complexity of interacting with the daemons.

Pseudo-distributed mode
 The pseudo-distributed mode is running Hadoop in a “cluster of one” with all daemons running on a single machine.
 This mode allowing us to examine memory usage, HDFS input/output issues, and other daemon interactions.
 Listing 2.1 provides simple XML files to configure a single server in this mode.

10 |
hdfs-site.xml
<?xml version=”1.0”?>
<?xml-stylesheet type=”text/xsl” href=”configuration.xsl”?>

<property>
<name>dfs.replication</name>
<value>1</value>
<description>The actual number of replications can be specified when the file is
created.</description>
</property>

</configuration>

 In core-site.xml and mapred-site.xml we specify the hostname and port of the NameNode and the JobTracker,
respectively.
 In hdfs-site.xml we specify the default replication factor for HDFS
 We must also specify the location of the Secondar y NameNode in the masters file and the slave nodes in the
slaves file:

[hadoop-user@master]$ cat masters

localhost
[hadoop-user@master]$ cat slaves
localhost

 While all the daemons are running on the same machine, they still communicate with each other using the same SSH
protocol as if they were distributed over a cluster.
 We need to check the machine allows you to ssh back to itself.
[hadoop-user@master]$ ssh localhost

 If the above command results an error then set up shh by using the following commands
[hadoop-user@master]$ ssh-keygen -t dsa -P ‘’ -f ~/.ssh/id_dsa
[hadoop-user@master]$ cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys

 Next ,we need to format the namenode by using the following command
[hadoop-user@master]$ bin/hadoop namenode -format

 To launch the daemons by use of the start-all.sh script. The Java jps command will list all daemons to
verify the setup was successful.
[hadoop-user@master]$ bin/start-all.sh
[hadoop-user@master]$ jps

11 |
26893 Jps
26832 TaskTracker
26620 SecondaryNameNode
26333 NameNode
26484 DataNode
26703 JobTracker
 We can shut down all the daemons using the command
[hadoop-user@master]$ bin/stop-all.sh

Fully Distributed mode

 An actual Hadoop cluster runs in the third mode, the fully distributed mode that emphasizing the benefits of
distributed storage and distributed computation
 In the discussion below we’ll use the following server names:

■ master—The master node of the cluster and host of the NameNode and Job- Tracker daemons
■ backup—the server that hosts the Secondary NameNode daemon
■ hadoop1, hadoop2, hadoop3,—the slave boxes of the cluster running both
DataNode and TaskTracker daemons
 Listing 2.2 is a modified version of the pseudo-distributed configuration files (listing 2.1) that can be used as a skeleton
for our cluster’s setup.

12 |
 We also need to update the masters and slaves files to reflect the locations of the other daemons.

[hadoop-user@master]$ cat masters

backup
[hadoop-user@master]$ cat slaves
hadoop1
hadoop2
hadoop3
...
 We to format HDFS to prepare it for storage:

[hadoop-user@master]$ bin/hadoop namenode - format

 Now we can start the Hadoop daemons:

[hadoop-user@master]$ bin/start-all.sh

 To Verify the nodes are running their assigned jobs.

[hadoop-user@master]$ jps

30879 JobTracker

30717 NameNode

30965 Jps

[hadoop-user@backup]$ jps
2099 Jps

13 |
1679 SecondaryNameNode
[hadoop-user@hadoop1]$ jps
7101 TaskTracker
7617 Jps
6988 DataNode

14 |

4
No ratings yet
4
53 pages
Slide 2 GFS and Hadoop
No ratings yet
Slide 2 GFS and Hadoop
95 pages
Unit 3 Da
No ratings yet
Unit 3 Da
43 pages
BDA Module 2 - Notes PDF
No ratings yet
BDA Module 2 - Notes PDF
101 pages
21ai402 Data Analytics Unit-2
No ratings yet
21ai402 Data Analytics Unit-2
44 pages
Unit 2 Hadoop
No ratings yet
Unit 2 Hadoop
60 pages
BDA Unit 1 Notes
No ratings yet
BDA Unit 1 Notes
24 pages
Unit-1 Introduction To Big Data
No ratings yet
Unit-1 Introduction To Big Data
38 pages
Big Data
No ratings yet
Big Data
51 pages
CH-05_cc
No ratings yet
CH-05_cc
21 pages
Unit-I
No ratings yet
Unit-I
38 pages
RTK Notes m1
No ratings yet
RTK Notes m1
16 pages
Hadoop: A Software Framework For Data Intensive Computing Applications
No ratings yet
Hadoop: A Software Framework For Data Intensive Computing Applications
47 pages
5.Apache Hadoop Updated
No ratings yet
5.Apache Hadoop Updated
57 pages
Hadoop Class 1 PDF
No ratings yet
Hadoop Class 1 PDF
27 pages
Business Intelligence & Big Data Analytics-CSE3124Y
No ratings yet
Business Intelligence & Big Data Analytics-CSE3124Y
26 pages
02 Unit-II Hadoop Architecture and HDFS
No ratings yet
02 Unit-II Hadoop Architecture and HDFS
18 pages
Unit Ii LM
No ratings yet
Unit Ii LM
18 pages
3.1 Hadoop Ecosystem
No ratings yet
3.1 Hadoop Ecosystem
48 pages
Apache Hadoop Filesystem and Its Usage in Facebook
No ratings yet
Apache Hadoop Filesystem and Its Usage in Facebook
33 pages
Hadoop Intro
No ratings yet
Hadoop Intro
40 pages
BDA_Unit_2
No ratings yet
BDA_Unit_2
29 pages
Introduction To Hadoop: Dr. G Sudha Sadhasivam Professor, CSE PSG College of Technology Coimbatore
No ratings yet
Introduction To Hadoop: Dr. G Sudha Sadhasivam Professor, CSE PSG College of Technology Coimbatore
34 pages
Big Data
No ratings yet
Big Data
51 pages
1- HADOOP crash course
No ratings yet
1- HADOOP crash course
52 pages
Introduction To Hadoop
No ratings yet
Introduction To Hadoop
5 pages
DW - Bigdata9
No ratings yet
DW - Bigdata9
113 pages
Unit-4 BDA as on 25-11-2024
No ratings yet
Unit-4 BDA as on 25-11-2024
248 pages
Hadoop Intro and Hdfs
No ratings yet
Hadoop Intro and Hdfs
37 pages
HCIA Big Data
No ratings yet
HCIA Big Data
20 pages
Introduction To Hadoop Ecosystem
No ratings yet
Introduction To Hadoop Ecosystem
46 pages
Hadoop Training in Bangalore
No ratings yet
Hadoop Training in Bangalore
31 pages
Cloud Computing - Unit 3
No ratings yet
Cloud Computing - Unit 3
38 pages
Big-Data Computing: Hadoop Distributed File System: B. Ramamurthy
No ratings yet
Big-Data Computing: Hadoop Distributed File System: B. Ramamurthy
43 pages
Lecture-1 - 3 Hadoop - HDFS - Mapreduce (Self Study)
No ratings yet
Lecture-1 - 3 Hadoop - HDFS - Mapreduce (Self Study)
25 pages
Chap4_BigDataStorageAndManagement
No ratings yet
Chap4_BigDataStorageAndManagement
46 pages
Bda Mod 1
No ratings yet
Bda Mod 1
32 pages
NYOUG Hadoop Presentaton
No ratings yet
NYOUG Hadoop Presentaton
47 pages
Prepared By: Manoj Kumar Joshi & Vikas Sawhney
No ratings yet
Prepared By: Manoj Kumar Joshi & Vikas Sawhney
47 pages
UNIT -2
No ratings yet
UNIT -2
27 pages
DA
No ratings yet
DA
51 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Big Data Notes (2)
No ratings yet
Big Data Notes (2)
191 pages
Printing Big Data Hadoop
No ratings yet
Printing Big Data Hadoop
24 pages
CS19741-Cloud Computing-Unit 3 Notes
No ratings yet
CS19741-Cloud Computing-Unit 3 Notes
37 pages
Unit 3
No ratings yet
Unit 3
5 pages
Unit 5 Print
No ratings yet
Unit 5 Print
32 pages
Big-Data Final
No ratings yet
Big-Data Final
7 pages
UNIT 5-PLH
No ratings yet
UNIT 5-PLH
34 pages
Chapter 2
No ratings yet
Chapter 2
19 pages
HDFS
No ratings yet
HDFS
8 pages
Bda Unit 2
No ratings yet
Bda Unit 2
79 pages
Big-Data Computing: Hadoop Distributed File System: B. Ramamurthy
No ratings yet
Big-Data Computing: Hadoop Distributed File System: B. Ramamurthy
45 pages
Unit-Iv CC&BD CS71
No ratings yet
Unit-Iv CC&BD CS71
148 pages
Hadoop - The Final Product
100% (2)
Hadoop - The Final Product
42 pages
BDP 2023 03
No ratings yet
BDP 2023 03
59 pages
Big Data Lecture Presentation
No ratings yet
Big Data Lecture Presentation
28 pages
3
No ratings yet
3
11 pages
What Are Basic Characteristics of Data and How Is Parallel Processing System Different From Distributed System?
No ratings yet
What Are Basic Characteristics of Data and How Is Parallel Processing System Different From Distributed System?
24 pages
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
Model Name: GA-H61M-D2-B3: Revision 1.02
No ratings yet
Model Name: GA-H61M-D2-B3: Revision 1.02
33 pages
Final, (Ieee) Salvador
No ratings yet
Final, (Ieee) Salvador
6 pages
Power Bi Moving Beyond The Basics of Power Bi and Learning About Dax Language
100% (1)
Power Bi Moving Beyond The Basics of Power Bi and Learning About Dax Language
137 pages
Lecture 01
No ratings yet
Lecture 01
22 pages
121TEST
No ratings yet
121TEST
2 pages
DAA000036H WellGate 3804A 2540 Quick Guide
No ratings yet
DAA000036H WellGate 3804A 2540 Quick Guide
6 pages
Lab01 - Symmetric Cryptography
No ratings yet
Lab01 - Symmetric Cryptography
13 pages
Lenovo B570 10290-2 10290-2 48.4PA01.021 LZ57 MB PDF
No ratings yet
Lenovo B570 10290-2 10290-2 48.4PA01.021 LZ57 MB PDF
102 pages
SEminar Report On Cloud Computing PDF
50% (2)
SEminar Report On Cloud Computing PDF
25 pages
2010 Canadian Computing Competition: Junior Division: Sponsor
No ratings yet
2010 Canadian Computing Competition: Junior Division: Sponsor
10 pages
Atul Summer Internship Report
No ratings yet
Atul Summer Internship Report
31 pages
ANN MINI Project
No ratings yet
ANN MINI Project
8 pages
General-Purpose Multiparadigm Programming Languages: An Enabling Technology For Constructing Complex Systems
No ratings yet
General-Purpose Multiparadigm Programming Languages: An Enabling Technology For Constructing Complex Systems
8 pages
ICT Lecture 7
No ratings yet
ICT Lecture 7
33 pages
Mobatime: New Products 2017 - 2018
No ratings yet
Mobatime: New Products 2017 - 2018
20 pages
cmp.2008.0223 - Gemini Compressor Performance Monitor
No ratings yet
cmp.2008.0223 - Gemini Compressor Performance Monitor
8 pages
Avoiding Bad Comments - JetBrains Academy - Learn Programming by Building Your Own Apps
No ratings yet
Avoiding Bad Comments - JetBrains Academy - Learn Programming by Building Your Own Apps
3 pages
Raheel Updated CV 2
No ratings yet
Raheel Updated CV 2
2 pages
Why You Should Never Use Zipcrypto
No ratings yet
Why You Should Never Use Zipcrypto
4 pages
CC Unit 3
No ratings yet
CC Unit 3
22 pages
Unit I - V
No ratings yet
Unit I - V
180 pages
An Evaluation of Unified Memory Technology on NVIDIA GPUs
No ratings yet
An Evaluation of Unified Memory Technology on NVIDIA GPUs
7 pages
MSBI 2019 Installation Steps - Udemy
No ratings yet
MSBI 2019 Installation Steps - Udemy
35 pages
Cisco Catalyst 9000 Platform Stackwise Virtual: White Paper
No ratings yet
Cisco Catalyst 9000 Platform Stackwise Virtual: White Paper
41 pages
Subject: PRF192-PFC Workshop 02: Objectives
No ratings yet
Subject: PRF192-PFC Workshop 02: Objectives
9 pages
TCS Confidential
No ratings yet
TCS Confidential
15 pages
03u 3 PDF
No ratings yet
03u 3 PDF
20 pages
Irig Mic Lav User Manual
No ratings yet
Irig Mic Lav User Manual
9 pages
BT Moto Handheld Instructions and Info v5
No ratings yet
BT Moto Handheld Instructions and Info v5
5 pages
Computer Technician
No ratings yet
Computer Technician
2 pages