0% found this document useful (0 votes)

16 views16 pages

SSJ Bda File

Uploaded by

chaitanyagndh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views16 pages

SSJ Bda File

Uploaded by

chaitanyagndh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Guru Tegh Bahadur Institute of Technology

Mayapuri, 110064

2024

Name SHIV SHEKHAR JHA

Enrolment No. 00276807721
Branch Information Technology
Year/Sem 4th Year/8th Sem
Subject Big Data Analytics
Subject Code ETIT - 406
Submitted to

1 | SHIV SHEKHAR JHA

00276807721
INDEX

S.No. EXPERIMENT Date Signature

1. How to Use Hadoop Cluster

2. Create a directory in HDFS at given path(s)

3. Upload and download a file in HDFS.

See contents of a file Same as UNIX CAT

4.
Command

5. Copy a file from Source to Destination

6. Remove a File or Directory in HDFS

7. Copy a file from Source to Destination

8. Move file from Source to Destination

9. Display the Aggregate Length of a File

Implement a Program of Word Count Map

10. Reduce program to understand Map Reduce
Paradigm

2 | SHIV SHEKHAR JHA

00276807721
EXPERIMENT 1
How to Use Hadoop Cluster

There are three modes in which you can get the experience of Hadoop:

- Standalone Mode

In this mode you need an IDE like eclipse and the Hadoop library files (which you can
download from the Apache website). You can create your MapReduce program and run it in
your local machine. You will be able to check the logic of the code and you can check any
syntax errors and this needs some sample data to perform these actions but you will not get
the full experience of Hadoop.

- Pseudo-Distributed mode

In this mode you get all the daemons of Hadoop running on a single machine and you can get
a VM from Cloudera or Hortonworks which is just plug and play type of thing. It will have all
the necessary tools installed and configured. In this mode you can scale up your data to check
how your code performs and optimize accordingly to get the job done in the required time.

- Fully-Distributed mode

In this mode you get all the daemons running on different machines. This is mostly used in
the production stage of your project. When you have already verified your code you will get
a chance to implement it in this mode.

Since you request an online service where you can practice your Hadoop code. Install eclipse
on pc and download the libraries and start coding.

3 | SHIV SHEKHAR JHA

00276807721
EXPERIMENT 2
Create a directory in HDFS at given path(s)
Aim: The aim of this practical is to demonstrate how to create directories in the Hadoop
Distributed File System (HDFS) at specified paths using the hadoop fs -mkdir command.
Procedure:
1. Accessing Hadoop Cluster:
• Connect to the Hadoop cluster using SSH or any other remote access method.
2. Creating Directories:
• Use the hadoop fs -mkdir command followed by the paths of the directories you
want to create.
• Syntax: hadoop fs -mkdir <path1> <path2> ...
• Replace <path1>, <path2>, etc., with the paths where you want to create
directories.
• Example: hadoop fs -mkdir /user/saurzcode/dir1 /user/saurzcode/dir2
• This command will create two directories named dir1 and dir2 under the
/user/saurzcode directory in HDFS.
3. Verification:
• After running the command, verify that the directories have been created
successfully by listing the contents of the parent directory.
• Use the hadoop fs -ls command to list the contents of the parent directory and
check for the newly created directories.
• Example: hadoop fs -ls /user/saurzcode

Conclusion:
In this practical, we learned how to create directories in the Hadoop Distributed File System
(HDFS) at specified paths using the hadoop fs -mkdir command. Creating directories in HDFS
is a fundamental operation when organizing and managing data in a Hadoop cluster.
Mastering this skill allows users to efficiently structure their data storage in HDFS for various
big data processing tasks.

4 | SHIV SHEKHAR JHA

00276807721
EXPERIMENT 3
Upload and download a file in HDFS
Aim: The aim of this practical is to demonstrate how to upload and download files between
the local file system and the Hadoop Distributed File System (HDFS) using the hadoop fs -put
and hadoop fs -get commands, respectively.
Procedure:
1. Uploading a File to HDFS:
• Use the hadoop fs -put command to upload a file or files from the local file system to
HDFS.
• Syntax: hadoop fs -put <localsrc> <HDFS_dest_Path>
• Replace <localsrc> with the path to the file(s) in the local file system.
• Replace <HDFS_dest_Path> with the destination path in HDFS where you want to
upload the file(s).
• Example: hadoop fs -put /home/saurzcode/Samplefile.txt /user/saurzcode/dir3/
• This command uploads the file Samplefile.txt from the local file system to the
/user/saurzcode/dir3/ directory in HDFS.
2. Downloading a File from HDFS:
• Use the hadoop fs -get command to download a file from HDFS to the local file system.
• Syntax: hadoop fs -get <HDFS_src_Path> <local_dest_Path>
• Replace <HDFS_src_Path> with the path to the file in HDFS.
• Replace <local_dest_Path> with the destination path in the local file system where
you want to download the file.
• Example: hadoop fs -get /user/saurzcode/dir3/Samplefile.txt /home/
• This command downloads the file Samplefile.txt from the /user/saurzcode/dir3/
directory in HDFS to the /home/ directory in the local file system.
Conclusion:
In this practical, we learned how to upload and download files between the local file system
and the Hadoop Distributed File System (HDFS) using the hadoop fs -put and hadoop fs -get
commands, respectively. Uploading files to HDFS allows you to store data for processing by
Hadoop, while downloading files from HDFS enables you to access and use the processed
data in the local file system. These operations are essential for data management and
workflow in Hadoop clusters.

5 | SHIV SHEKHAR JHA

00276807721
EXPERIMENT 4
See contents of a file Same as UNIX CAT Command

Aim: The aim of this practical is to demonstrate how to view the contents of a file in the
Hadoop Distributed File System (HDFS) using the hadoop fs -cat command, similar to the cat
command in Unix/Linux.
Procedure:
1. Viewing Contents of a File:
• Use the hadoop fs -cat command to view the contents of a file in HDFS.
• Syntax: hadoop fs -cat <HDFS_file_path>
• Replace <HDFS_file_path> with the path to the file in HDFS whose contents you
want to view.
• Example: hadoop fs -cat /user/saurzcode/dir1/abc.txt5
• This command will display the contents of the file named abc.txt5 located in the
/user/saurzcode/dir1/ directory in HDFS.

Conclusion:
In this practical, we learned how to view the contents of a file in the Hadoop Distributed File
System (HDFS) using the hadoop fs -cat command. This command is similar to the cat
command in Unix/Linux and allows you to quickly view the contents of a file stored in HDFS.
Being able to view file contents is essential for data inspection and debugging purposes in
Hadoop environments.

6 | SHIV SHEKHAR JHA

00276807721
EXPERIMENT 5
Copy a file from Source to Destination

Aim: The aim of this practical is to demonstrate how to copy files from a source location to a
destination location in the Hadoop Distributed File System (HDFS) using the hadoop fs -cp
command.
Procedure:
1. Copying a File from Source to Destination:
• Use the hadoop fs -cp command to copy a file from a source location to a
destination location in HDFS.
• Syntax: hadoop fs -cp <source_path> <destination_path>
• Replace <source_path> with the path to the source file in HDFS.
• Replace <destination_path> with the path to the destination location in HDFS.
• Example: hadoop fs -cp /user/saurzcode/dir1/abc.txt /user/saurzcode/dir2
• This command will copy the file abc.txt from the /user/saurzcode/dir1/
directory to the /user/saurzcode/dir2/ directory in HDFS.

Conclusion:
In this practical, we learned how to copy files from a source location to a destination location
in the Hadoop Distributed File System (HDFS) using the hadoop fs -cp command. This
command is useful for moving files within HDFS and organizing data in the Hadoop cluster.
Mastering file copying operations in HDFS is essential for efficient data management and
workflow in Hadoop environments.

7 | SHIV SHEKHAR JHA

00276807721
EXPERIMENT 6
Remove a File or Directory in HDFS

Aim: The aim of this practical is to demonstrate how to remove files or directories from the
Hadoop Distributed File System (HDFS) using the hadoop fs -rm command for files and
hadoop fs -rmr command for directories.
Procedure:
1. Removing a File:
• Use the hadoop fs -rm command to remove a file from HDFS.
• Syntax: hadoop fs -rm <file_path>
• Replace <file_path> with the path to the file you want to remove from HDFS.
• Example: hadoop fs -rm /user/saurzcode/dir1/abc.txt
• This command will remove the file abc.txt from the /user/saurzcode/dir1/
directory in HDFS.
2. Removing a Directory (Recursive):
• Use the hadoop fs -rmr command to remove a directory and its contents recursively
from HDFS.
• Syntax: hadoop fs -rmr <directory_path>
• Replace <directory_path> with the path to the directory you want to remove from
HDFS.
• Example: hadoop fs -rmr /user/saurzcode/dir1/
• This command will remove the directory /user/saurzcode/dir1/ and all its contents
recursively from HDFS.

Conclusion:
In this practical, we learned how to remove files or directories from the Hadoop Distributed
File System (HDFS) using the hadoop fs -rm command for files and hadoop fs -rmr command
for directories. These commands are essential for managing data in HDFS and maintaining
the organization of files and directories within the Hadoop cluster. Understanding how to
remove files and directories safely is crucial to avoid unintended data loss in Hadoop
environments.

8 | SHIV SHEKHAR JHA

00276807721
EXPERIMENT 7
Copy a file from Source to Destination

9 | SHIV SHEKHAR JHA

00276807721
EXPERIMENT 8
Move file from Source to Destination
Note: - Moving files across filesystem is not permitted.
Aim: The aim of this practical is to demonstrate how to move files from a source location to
a destination location in the Hadoop Distributed File System (HDFS) using the hadoop fs -mv
command.
Procedure:
1. Moving a File from Source to Destination:
• Use the hadoop fs -mv command to move a file from a source location to a
destination location in HDFS.
• Syntax: hadoop fs -mv <source_path> <destination_path>
• Replace <source_path> with the path to the source file in HDFS.
• Replace <destination_path> with the path to the destination location in HDFS.
• Example: hadoop fs -mv /user/saurzcode/dir1/abc.txt /user/saurzcode/dir2
• This command will move the file abc.txt from the /user/saurzcode/dir1/
directory to the /user/saurzcode/dir2/ directory in HDFS.

Conclusion:
In this practical, we learned how to move files from a source location to a destination location
in the Hadoop Distributed File System (HDFS) using the hadoop fs -mv command. Moving
files in HDFS allows you to reorganize data within the Hadoop cluster efficiently. Mastering
file moving operations in HDFS is essential for maintaining data organization and managing
workflows in Hadoop environments.

10 | S H I V SHEKHAR JHA
00276807721
EXPERIMENT 9
Display the Aggregate Length of a File

Aim: The aim of this practical is to demonstrate how to display the aggregate length of a file
in the Hadoop Distributed File System (HDFS) using the hadoop fs -du command.
Procedure:
1. Displaying Aggregate Length of a File:
• Use the hadoop fs -du command to display the aggregate length of a file in HDFS.
• Syntax: hadoop fs -du <file_path>
• Replace <file_path> with the path to the file in HDFS for which you want to
display the aggregate length.
• Example: hadoop fs -du /user/saurzcode/dir1/abc.txt
• This command will display the aggregate length of the file abc.txt located in the
/user/saurzcode/dir1/ directory in HDFS.

Conclusion:
In this practical, we learned how to display the aggregate length of a file in the Hadoop
Distributed File System (HDFS) using the hadoop fs -du command. This command provides
information about the total length occupied by the specified file in HDFS. Understanding how
to retrieve file size information is essential for managing and monitoring data storage in
Hadoop environments

11 | S H I V SHEKHAR JHA
00276807721
EXPERIMENT 10
Implement a Program of Word Count Map Reduce program to understand
Map Reduce Paradigm

Program-

import java.io.IOException;
import java.util.StringTokenizer;
import org.apache.Hadoop.io.IntWritable;
import org.apache.Hadoop.io.LongWritable;
import org.apache.Hadoop.io.Text;
import org.apache.Hadoop.mapreduce.Mapper;
import org.apache.Hadoop.mapreduce.Reducer;
import org.apache.Hadoop.conf.Configuration;
import org.apache.Hadoop.mapreduce.Job;
import org.apache.Hadoop.mapreduce.lib.input.TextInputFormat;
import org.apache.Hadoop.mapreduce.lib.output.TextOutputFormat;
import org.apache.Hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.Hadoop.mapreduce.lib.output.FileOutputFormat;
import org.apache.Hadoop.fs.Path;

public class WordCount

{
public static class Map extends Mapper<LongWritable,Text,text,Inkwritable>
{
public void map(LongWritable key, Text value,Context context) throws
IOException,InterruptedException
{
String line = value.toString();
StringTokenizer tokenizer = new StringTokenizer(line);
while (tokenizer.hasMoreTokens())
{
value.set(tokenizer.nextToken());
context.write(value, new IntWritable(1));
}
}
}
public static class Reduce extends Reducer
{
public void reduce(Text key, Iterable values,Context context) throws
IOException,InterruptedException
{
12 | S H I V SHEKHAR JHA
00276807721
int sum=0;for(IntWritable x: values)
{
sum+=x.get();
}
context.write(key, new IntWritable(sum));
}
}
public static void main(String[] args) throws Exception
{
Configuration conf= new Configuration();
Job job = new Job(conf,"My Word Count Program");
job.setJarByClass(WordCount.class);
job.setMapperClass(Map.class);
job.setReducerClass(Reduce.class);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(IntWritable.class);
job.setInputFormatClass(TextInputFormat.class);
job.setOutputFormatClass(TextOutputFormat.class);
Path outputPath = new Path(args[1]);

//Configuring the input/output path from the filesystem into the job
FileInputFormat.addInputPath(job, new Path(args[0]));
FileOutputFormat.setOutputPath(job, new Path(args[1]));

//deleting the output path automatically from hdfs so that we don't have to delete it
explicitly
outputPath.getFileSystem(conf).delete(outputPath); //exiting the job only if the flag value
becomes false
System.exit(job.waitForCompletion(true) ? 0 : 1);
}
}

The entire MapReduce program can be fundamentally divided into three parts:
• Mapper Phase Code
• Reducer Phase Code
• Driver Code
We will understand the code for each of these three parts sequentially.

Mapper code:
public static class Map extends Mapper { public void map(LongWritable key, Text value,
Context context) throws IOException,InterruptedException
{
String line = value.toString();
StringTokenizer tokenizer = new StringTokenizer(line);
13 | S H I V SHEKHAR JHA
00276807721
while (tokenizer.hasMoreTokens())
{
value.set(tokenizer.nextToken());
context.write(value, new IntWritable(1));
}
• We have created a class Map that extends the class Mapper which is already defined in the
MapReduce Framework
• We define the data types of input and output key/value pair after the class declaration
using angle brackets.
• Both the input and output of the Mapper is a key/value pair.
• Input:
◦ The key is nothing but the offset of each line in the text file:LongWritable
◦ The value is each individual line (as shown in the figure at the right): Text• Output:
◦ The key is the tokenized words: Text ◦ We have the hardcoded value in our case
which is
1: IntWritable
◦ Example – Dear 1, Bear 1, etc.
• We have written a java code where we have tokenized each word and assigned them a
hardcoded value equal to 1.

Reducer Code:
public static class Reduce extends Reducer
{
public void reduce(Text key, Iterable values,Context context) throws
IOException,InterruptedException
{
int sum=0; for(IntWritable x: values)
{
sum+=x.get();
}
context.write(key, new IntWritable(sum));
}
}
• We have created a class Reduce which extends class Reducer like that of Mapper.
• We define the data types of input and output key/value pair after the class declaration
using angle brackets as done for Mapper.
• Both the input and the output of the Reducer is a keyvalue pair.
• Input:
◦ The key nothing but those unique words which have been generated after the sorting
14 | S H I V SHEKHAR JHA
00276807721
and shuffling phase: Text
◦ The value is a list of integers corresponding to each key: IntWritable
◦ Example – Bear, [1, 1], etc.

Output:
◦ The key is all the unique words present in the input text file: Text
◦ The value is the number of occurrences of each of the unique words: IntWritable
◦ Example – Bear, 2; Car, 3, etc.
• We have aggregated the values present in each of the list corresponding to each key and
produced the final answer.
• In general, a single reducer is created for each of the unique words, but, you can specify
the number of reducer in map red-site.xml

Driver Code:
Configuration conf= new Configuration ();
Job job = new Job(conf,"My Word Count Program");
job.setJarByClass(WordCount.class);
job.setMapperClass(Map.class);
job.setReducerClass(Reduce.class);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(IntWritable.class);
job.setInputFormatClass(TextInputFormat.class);
job.setOutputFormatClass(TextOutputFormat.class);
Path outputPath = new Path(args[1]);

//Configuring the input/output path from the filesystem into the job
FileInputFormat.addInputPath(job, new Path(args[0]));
FileOutputFormat.setOutputPath(job, new Path(args[1]));
1. In the driver class, we set the configuration of our MapReduce job to run in Hadoop.
2. We specify the name of the job , the data type of input/ output of the mapper and
reducer.
3. We also specify the names of the mapper and reducer classes.
4. The path of the input and output folder is also specified.

15 | S H I V SHEKHAR JHA
00276807721
5. The method setInputFormatClass () is used for specifying that how a Mapper will read
the input data or what will be the unit of work. Here, we have chosen TextInputFormat
so that single line is read by the mapper at a time from the input text file.
6. The main () method is the entry point for the driver. In this method, we instantiate a
new Configuration object for the job.

Run the MapReduce Code:

The command for running a MapReduce code is:
Hadoop jar Hadoop-mapreduce-example.jar WordCount / sample/input /sample/output

16 | S H I V SHEKHAR JHA
00276807721

CCS334 Bda Lab Manual
No ratings yet
CCS334 Bda Lab Manual
48 pages
Ccs334 Bda Lab Manual PRINT
No ratings yet
Ccs334 Bda Lab Manual PRINT
53 pages
Ccs 334 Bigdata Manual
No ratings yet
Ccs 334 Bigdata Manual
45 pages
Hadoop Command Line Interface
No ratings yet
Hadoop Command Line Interface
10 pages
Bigdatamanualfinal 231019063224 d211cb48
No ratings yet
Bigdatamanualfinal 231019063224 d211cb48
45 pages
Ai&Ml (Bdamanual)
No ratings yet
Ai&Ml (Bdamanual)
24 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
26 pages
Cloud PDF
No ratings yet
Cloud PDF
47 pages
Bda Practical File
No ratings yet
Bda Practical File
28 pages
BigData Lab Manual
No ratings yet
BigData Lab Manual
44 pages
Big Data Analytics lab-JD
No ratings yet
Big Data Analytics lab-JD
49 pages
Labs Hadoop1
No ratings yet
Labs Hadoop1
9 pages
Big Data
No ratings yet
Big Data
23 pages
Ccs334 Bda Lab Ex
No ratings yet
Ccs334 Bda Lab Ex
45 pages
Bigdata Lab File
No ratings yet
Bigdata Lab File
20 pages
BDA-Lab Record
No ratings yet
BDA-Lab Record
43 pages
Experiment No 1
No ratings yet
Experiment No 1
13 pages
Mini Project Iot
No ratings yet
Mini Project Iot
43 pages
BDC Output 2
No ratings yet
BDC Output 2
4 pages
Lab Manual
No ratings yet
Lab Manual
34 pages
Card Replacement Procedures Vol 2 of 7 PDF
100% (1)
Card Replacement Procedures Vol 2 of 7 PDF
788 pages
Unit 2-HDFS SGS
No ratings yet
Unit 2-HDFS SGS
29 pages
Hadoop 1
No ratings yet
Hadoop 1
15 pages
PDC All Labs
100% (1)
PDC All Labs
129 pages
579 BDA Week-03
No ratings yet
579 BDA Week-03
2 pages
BIG Data File
No ratings yet
BIG Data File
28 pages
1498770-40 Service Manual 4-Series
No ratings yet
1498770-40 Service Manual 4-Series
58 pages
CCS334-BDA LAB MANUAL Final
No ratings yet
CCS334-BDA LAB MANUAL Final
46 pages
Big Data Record 2024-25
No ratings yet
Big Data Record 2024-25
46 pages
Apache Hadoop
No ratings yet
Apache Hadoop
3 pages
Bigdatamanual
No ratings yet
Bigdatamanual
45 pages
Bda Manual
No ratings yet
Bda Manual
33 pages
Bdafile
No ratings yet
Bdafile
9 pages
H3C UniServer R4900 G3用户指南
No ratings yet
H3C UniServer R4900 G3用户指南
315 pages
Practical 8
No ratings yet
Practical 8
5 pages
Big Datalab
No ratings yet
Big Datalab
4 pages
Big Data
No ratings yet
Big Data
28 pages
Big Data & Analytics Lab Manual
No ratings yet
Big Data & Analytics Lab Manual
51 pages
Qshell - Iseries
No ratings yet
Qshell - Iseries
226 pages
Big Data Analytics Lab Experiments
No ratings yet
Big Data Analytics Lab Experiments
16 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
Course: Big Data Analytics Lab Scheme: 2017
No ratings yet
Course: Big Data Analytics Lab Scheme: 2017
25 pages
Big Data - ASSIGNMENT 3
No ratings yet
Big Data - ASSIGNMENT 3
2 pages
Hadoop Commands Only
No ratings yet
Hadoop Commands Only
19 pages
Moxa AWK-5232 User Manual
No ratings yet
Moxa AWK-5232 User Manual
81 pages
@bigdatalabfile 09
No ratings yet
@bigdatalabfile 09
35 pages
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
No ratings yet
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
35 pages
Dsa Practical File
No ratings yet
Dsa Practical File
16 pages
BCS502 Module 2 Word
No ratings yet
BCS502 Module 2 Word
43 pages
Solplanet-ASW WiFi-configuration EN 202011-2
No ratings yet
Solplanet-ASW WiFi-configuration EN 202011-2
6 pages
HDFS
No ratings yet
HDFS
6 pages
CC Hadoop Lab
No ratings yet
CC Hadoop Lab
6 pages
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
No ratings yet
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
74 pages
HDFS (Hadoop Distributed File System) : HDFS Architecture Components of The Architecture
No ratings yet
HDFS (Hadoop Distributed File System) : HDFS Architecture Components of The Architecture
10 pages
Unit - 5 Iot
No ratings yet
Unit - 5 Iot
33 pages
HDFS Commands
No ratings yet
HDFS Commands
7 pages
HOL - Exploring HDFS
No ratings yet
HOL - Exploring HDFS
6 pages
DA Lab Program-1
No ratings yet
DA Lab Program-1
3 pages
Voice Record Pro
0% (1)
Voice Record Pro
2 pages
Extracting Real Value From Your Data With Apache Hadoop: Sarah Sproehnle
No ratings yet
Extracting Real Value From Your Data With Apache Hadoop: Sarah Sproehnle
51 pages
Programming Lang Processing
No ratings yet
Programming Lang Processing
70 pages
Extreme Computing Lab Exercises Session One: 1 Getting Started
No ratings yet
Extreme Computing Lab Exercises Session One: 1 Getting Started
6 pages
Hadoop Commands
100% (1)
Hadoop Commands
6 pages
Homework Labs Lecture01
No ratings yet
Homework Labs Lecture01
9 pages
Hands On
No ratings yet
Hands On
26 pages
How To Set Up A Hadoop Cluster in Docker
No ratings yet
How To Set Up A Hadoop Cluster in Docker
13 pages
Application Development With Android Operating System
No ratings yet
Application Development With Android Operating System
47 pages
Layers of OSI Model - GeeksforGeeks
No ratings yet
Layers of OSI Model - GeeksforGeeks
9 pages
Create A Directory in HDFS at Given Path(s) .: Upload
No ratings yet
Create A Directory in HDFS at Given Path(s) .: Upload
11 pages
MAS MC 100 120 Leafleat
No ratings yet
MAS MC 100 120 Leafleat
8 pages
2 - Mobile Backhauling (FTTM)
No ratings yet
2 - Mobile Backhauling (FTTM)
29 pages
HDFS File System Shell Guide
No ratings yet
HDFS File System Shell Guide
10 pages
Disk Partitioning - Wikipedia
No ratings yet
Disk Partitioning - Wikipedia
24 pages
Comparison Table Between Programming Language and Scripting Language
No ratings yet
Comparison Table Between Programming Language and Scripting Language
8 pages
COA Home Assignment-1
No ratings yet
COA Home Assignment-1
5 pages
VPC Configurator Software: User Manual
No ratings yet
VPC Configurator Software: User Manual
17 pages
Power Water Cybersecurity Suite Configuration Management en 326846
No ratings yet
Power Water Cybersecurity Suite Configuration Management en 326846
2 pages
zt400 Series Rfid Specification Sheet en Us
No ratings yet
zt400 Series Rfid Specification Sheet en Us
4 pages
Features: MINIPLEX Transponders With ES-PS Power Supplies
No ratings yet
Features: MINIPLEX Transponders With ES-PS Power Supplies
12 pages
A3977 Datasheet
No ratings yet
A3977 Datasheet
17 pages
Packet Scheduling in Multipath TCP Fundamentals Lessons and Opportunities
No ratings yet
Packet Scheduling in Multipath TCP Fundamentals Lessons and Opportunities
13 pages
Software Requirements Specification: Prepared by
No ratings yet
Software Requirements Specification: Prepared by
7 pages
Pa2423l Brief
No ratings yet
Pa2423l Brief
1 page
Unit 6 - Week 5: Metal-Semiconductor Junctions: Assignment 5
No ratings yet
Unit 6 - Week 5: Metal-Semiconductor Junctions: Assignment 5
4 pages
SSC CGL Computer Knowledge Mock-1
No ratings yet
SSC CGL Computer Knowledge Mock-1
6 pages
Custom Iw 106: Product Specification Sheet
No ratings yet
Custom Iw 106: Product Specification Sheet
1 page
Hadoop实际解决方案手册: Chinese Edition
From Everand
Hadoop实际解决方案手册: Chinese Edition
Posts & Telecom Press
No ratings yet
Mastering Data Engineering: Advanced Techniques with Apache Hadoop and Hive
From Everand
Mastering Data Engineering: Advanced Techniques with Apache Hadoop and Hive
Peter Jones
No ratings yet
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet

SSJ Bda File

Uploaded by

SSJ Bda File

Uploaded by

Guru Tegh Bahadur Institute of Technology

Name SHIV SHEKHAR JHA

1 | SHIV SHEKHAR JHA

S.No. EXPERIMENT Date Signature

1. How to Use Hadoop Cluster

2. Create a directory in HDFS at given path(s)

3. Upload and download a file in HDFS.

See contents of a file Same as UNIX CAT

5. Copy a file from Source to Destination

6. Remove a File or Directory in HDFS

7. Copy a file from Source to Destination

8. Move file from Source to Destination

9. Display the Aggregate Length of a File

Implement a Program of Word Count Map

2 | SHIV SHEKHAR JHA

3 | SHIV SHEKHAR JHA

4 | SHIV SHEKHAR JHA

5 | SHIV SHEKHAR JHA

6 | SHIV SHEKHAR JHA

7 | SHIV SHEKHAR JHA

8 | SHIV SHEKHAR JHA

9 | SHIV SHEKHAR JHA

public class WordCount

Run the MapReduce Code:

You might also like