0% found this document useful (0 votes)

64 views

Week 1 Assignment Solution

1. The document provides solutions to 6 questions about using HDFS and Linux commands related to creating directories, files, copying files between local and HDFS filesystems, sorting and filtering file listings, concatenating files, and merging HDFS files into a local file. 2. The solutions demonstrate commands for HDFS file operations like mkdir, touch, ls, cat, tail, grep, put, get, mv, and rm as well as Linux commands for creating and editing files. 3. The final question involves using the hadoop fs getmerge command to merge multiple files from HDFS into a single local file and display the merged contents.

Uploaded by

pali.rajtrader

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views

Week 1 Assignment Solution

Uploaded by

pali.rajtrader

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

1

Assignment Solution
Week1: Getting Started with Big Data and Understanding
HDFS Concept along with Linux Commands

TRENDYTECH 9108179578
1 2

Week 1 Assignment Solutions

Note:

Total Marks : 50
Each Subpart carries 2 Marks

Question 1) (18 Marks)

1. Create two directories ‘dir1’ and ‘dir2’ using a single hdfs command
inside
home directory of hdfs in cloudera VM.dir2 should be subdirectory of
dir1.
2. Verify that the two folders have been created in the above path.
3. Inside dir 2 Create an empty file, file1.txt
4. Create a file file2.txt in local filesystem with some text inside it
5. Copy file2.txt from local to hdfs inside dir2.
6. List the subdirectories and files inside dir1 recursively
7. List the files inside dir2 ,sorted by size but size should be displayed in
KBs/MBs and not bytes
8. Rename the file, file2.txt to file3.txt
9. Remove the directory dir1 using a single command.

Solution:

1) hadoop fs -mkdir -p /user/cloudera/dir1/dir2

2) hadoop fs -ls /user/cloudera

hadoop fs -ls /user/cloudera/dir1

3) hadoop fs -touchz /user/cloudera/dir1/dir2/file1.txt

4) gedit ./Desktop/file2.txt

5) hadoop fs -put /home/cloudera/Desktop/file2.txt

/user/cloudera/dir1/dir2/

TRENDYTECH 9108179578
1 3

6) hadoop fs -ls -R /user/cloudera/dir1

7) hadoop fs -ls -S -h /user/cloudera/dir1/dir2/

8) hadoop fs -mv /user/cloudera/dir1/dir2/file2.txt

/user/cloudera/dir1/dir2/file3.txt

9) hadoop fs -rm -R /user/cloudera/dir1

Question 2) (2 Marks)

Suppose there is file of size 514 MB stored in HDFS (Hadoop 2.x) using
default block size configuration and default replication factor.Then, how
many blocks will be created in total and what will be the size of each
block?

Solution:

There will be 5 blocks created. 4 blocks of size 128MB each and 1 Block
of size 2MB. Default replication factor is 3. Considering RF ,Totally 5*3
= 15 blocks are created.

Question 3) (8 Marks)

1. Create a directory inside home directory of local filesystem named

‘test’
2. Create few empty files inside the test directory namely a.pdf, b.html,
c.xml
3. List the files in reverse alphabetical order of file name
4. Display only the file which ends with .html extension

Solution:

i) mkdir test
ii) touch test/a.pdf test/b.html test/c.xml
iii) ls -lr test
iv) ls -ltr test | grep html OR ls -l | grep .html

TRENDYTECH 9108179578
1 4

Question 4) (8 Marks)

1. Create two new text files, file1 and file2 , with following content using
cat command in your linux home directory.
file1 : This is from file1
file2: This is from file2
2. Display the contents of the file 1 and file2 using cat command
3. Concatenate the contents of the two files and put them into a new file
file3 and display the results.
4. Count the number of lines and number of words in the file3.

Solution:

i) cat > file1

cat > file2

ii) cat file1

cat file2

cat file1 file2

iii) cat file1 file2 > file3

cat file3

cat file1file2 >> file3

iv) wc -l file3
wc -w file3

wc-l-w file3

TRENDYTECH 9108179578
1 5

Question 5) (4 Marks)

Create a text file myfile.txt with 5 lines in home directory of local

filesystem

1. Display last 3 lines of that file.

2. Display all lines of that text file except first line.

Solution:

i) cat > myfile.txt

tail -n3 myfile.txt

tail -3 myfile.txt

ii) tail -n+2 myfile.txt

Question 6) (10 Marks)

The getmerge command in Hadoop is for merging files existing in the

HDFS file system into a single file in the local file system.
1. Use the help for getmerge command to see the arguments it takes
2. Create file1.txt in local with contents “ Hello, this is from file1 “,Create
file2.txt in local with contents “Hello, this is from file2”
3. Copy the file1.txt and file2.txt into a location in hdfs inside home
directory in hdfs
4. Use the getmerge command to merge the contents of two files
present in hdfs and put the merged content into a single local
destination file named filenew.txt.
5. Display the merged contents of the file filenew.txt

Solution:

1) hadoop fs -help getmerge

2) cd Desktop

TRENDYTECH 9108179578
1 6

gedit file1.txt
gedit file2.txt

3 ) hadoop fs -put file1.txt /user/cloudera/

hadoop fs -put file2.txt /user/cloudera/

4) hadoop fs -getmerge /user/cloudera/file1.txt

/user/cloudera/file2.txt /home/cloudera/Desktop/filenew.txt

5) cat ./Desktop/filenew.txt

***********************************************************

TRENDYTECH 9108179578

os_Assignment 1
No ratings yet
os_Assignment 1
2 pages
Linux Command Line and Shell Scripting Bible
From Everand
Linux Command Line and Shell Scripting Bible
Richard Blum
3/5 (3)
Assignment Week 1
No ratings yet
Assignment Week 1
9 pages
Hadoop Assignement Sumit 241111 133837
No ratings yet
Hadoop Assignement Sumit 241111 133837
13 pages
BDA Record
No ratings yet
BDA Record
36 pages
DE Week-7
No ratings yet
DE Week-7
4 pages
Hadoop Commands Only
No ratings yet
Hadoop Commands Only
19 pages
566EXP2
No ratings yet
566EXP2
4 pages
HOL - Exploring HDFS
No ratings yet
HOL - Exploring HDFS
6 pages
Labs Hadoop1
No ratings yet
Labs Hadoop1
9 pages
C:/Users/HP Hdfs Namenode - Format
No ratings yet
C:/Users/HP Hdfs Namenode - Format
7 pages
PDF - HDFS Commandsdsa
No ratings yet
PDF - HDFS Commandsdsa
22 pages
Exercise 03
No ratings yet
Exercise 03
2 pages
Homework Labs Lecture01
No ratings yet
Homework Labs Lecture01
9 pages
Big Data AnalyticUnit2
No ratings yet
Big Data AnalyticUnit2
19 pages
Vnrvjiet
No ratings yet
Vnrvjiet
70 pages
Hafs Commands
No ratings yet
Hafs Commands
17 pages
Experiment No 1
No ratings yet
Experiment No 1
13 pages
Exp-2 Hadoop Commands
No ratings yet
Exp-2 Hadoop Commands
6 pages
SL Lab
No ratings yet
SL Lab
9 pages
Dsa Practical File
No ratings yet
Dsa Practical File
16 pages
Group HH Task 1 (Internal & External Commands)
No ratings yet
Group HH Task 1 (Internal & External Commands)
2 pages
HDFS Commands
No ratings yet
HDFS Commands
8 pages
OSY Practice Questions
No ratings yet
OSY Practice Questions
4 pages
lab2_BD
No ratings yet
lab2_BD
20 pages
NurulAqilah Lab3
No ratings yet
NurulAqilah Lab3
6 pages
HDFS File System Shell Guide
No ratings yet
HDFS File System Shell Guide
10 pages
Unit 2-HDFS SGS
No ratings yet
Unit 2-HDFS SGS
29 pages
PDC All Labs
100% (1)
PDC All Labs
129 pages
Hdfs
No ratings yet
Hdfs
1 page
Muneeb Mid Report
No ratings yet
Muneeb Mid Report
5 pages
Basic Hadoop Commands
No ratings yet
Basic Hadoop Commands
7 pages
Hadoop Commands
100% (1)
Hadoop Commands
6 pages
Hands On Exercises 2013
No ratings yet
Hands On Exercises 2013
51 pages
HDFS (Hadoop Distributed File System) : HDFS Architecture Components of The Architecture
No ratings yet
HDFS (Hadoop Distributed File System) : HDFS Architecture Components of The Architecture
10 pages
Big Data - ASSIGNMENT 3
No ratings yet
Big Data - ASSIGNMENT 3
2 pages
Linux Environment Lab - Linux File System
No ratings yet
Linux Environment Lab - Linux File System
16 pages
Linux Based Commands
No ratings yet
Linux Based Commands
17 pages
Homework advanced programing 1
No ratings yet
Homework advanced programing 1
10 pages
Hadoop HDFS Commands
No ratings yet
Hadoop HDFS Commands
6 pages
ccs 334 bigdata manual
No ratings yet
ccs 334 bigdata manual
45 pages
Bdaa
No ratings yet
Bdaa
6 pages
Hadoop1
No ratings yet
Hadoop1
15 pages
L2 Accessing HDFS On Cloudera Distribution
No ratings yet
L2 Accessing HDFS On Cloudera Distribution
5 pages
DA Lab Program-1
No ratings yet
DA Lab Program-1
3 pages
Group FF Task 1 (Internal & External Commands)
No ratings yet
Group FF Task 1 (Internal & External Commands)
2 pages
Hadoop File System: CSC 369 Distributed Computing Alexander Dekhtyar
No ratings yet
Hadoop File System: CSC 369 Distributed Computing Alexander Dekhtyar
5 pages
2 HDFS Commands
No ratings yet
2 HDFS Commands
7 pages
@bigdatalabfile 09
No ratings yet
@bigdatalabfile 09
35 pages
basic HDFS commands
No ratings yet
basic HDFS commands
7 pages
HDFS Commands v02 PDF
No ratings yet
HDFS Commands v02 PDF
7 pages
SSJ Bda File
No ratings yet
SSJ Bda File
16 pages
Hadoop-HDFS-commands
No ratings yet
Hadoop-HDFS-commands
1 page
OS Assignment#1 Spring2020 PDF
No ratings yet
OS Assignment#1 Spring2020 PDF
4 pages
UP
No ratings yet
UP
8 pages
Linux Exo Two
No ratings yet
Linux Exo Two
10 pages
BDH Record - Merged
No ratings yet
BDH Record - Merged
47 pages
Unix Exercise 1
No ratings yet
Unix Exercise 1
2 pages
Mastering Shell Commands On Linux
From Everand
Mastering Shell Commands On Linux
Urko Galen
No ratings yet
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Akash Humar Panigrahi: Contact No.: 91-8328919036
No ratings yet
Akash Humar Panigrahi: Contact No.: 91-8328919036
3 pages
BigQuery Connector For SAP
No ratings yet
BigQuery Connector For SAP
7 pages
Real-Time Eventual Consistency
No ratings yet
Real-Time Eventual Consistency
14 pages
Problem Statement
No ratings yet
Problem Statement
4 pages
Intro To Geoweb
No ratings yet
Intro To Geoweb
24 pages
Relational Database Management System
No ratings yet
Relational Database Management System
30 pages
UGRD ITE6220 Information Management Finals
No ratings yet
UGRD ITE6220 Information Management Finals
9 pages
Veritas Netbackup Cheat Sheet
No ratings yet
Veritas Netbackup Cheat Sheet
4 pages
Visual Usability Design For Mobile
No ratings yet
Visual Usability Design For Mobile
6 pages
Blackbook 2023
No ratings yet
Blackbook 2023
4 pages
Siri
100% (1)
Siri
18 pages
Project Sem 4 Convert The User Security Management To Step To PDF
No ratings yet
Project Sem 4 Convert The User Security Management To Step To PDF
4 pages
Individual Assignment - 1
No ratings yet
Individual Assignment - 1
2 pages
Database Design Lecture Notes
No ratings yet
Database Design Lecture Notes
9 pages
Dbms Homework
100% (1)
Dbms Homework
6 pages
PDF Moi Truong Vi Mo Cua Highlands Coffee 1 Ban Final Compress
No ratings yet
PDF Moi Truong Vi Mo Cua Highlands Coffee 1 Ban Final Compress
38 pages
Jacob Riis - How The Other Half Lives: Studies Among The Tenements of New York
No ratings yet
Jacob Riis - How The Other Half Lives: Studies Among The Tenements of New York
325 pages
Youtube Video Summarizer
No ratings yet
Youtube Video Summarizer
4 pages
Chapter 1. Information System in Business
No ratings yet
Chapter 1. Information System in Business
14 pages
Tutorial 1: Tutorial 1 Isp250 - Information Systems Development
No ratings yet
Tutorial 1: Tutorial 1 Isp250 - Information Systems Development
4 pages
What Is A DBMS?
No ratings yet
What Is A DBMS?
47 pages
Mid-Level Resume
No ratings yet
Mid-Level Resume
1 page
Computer: Modern Computers Defined
No ratings yet
Computer: Modern Computers Defined
4 pages
Chapter I: HUMAN RESOURCE INFORMATION SYSTEM
No ratings yet
Chapter I: HUMAN RESOURCE INFORMATION SYSTEM
6 pages
Fast Detection of Transformed Data Leaks PPT 1
No ratings yet
Fast Detection of Transformed Data Leaks PPT 1
11 pages
P2_10
No ratings yet
P2_10
7 pages
Project Report
No ratings yet
Project Report
45 pages
Introduction to Data Science
No ratings yet
Introduction to Data Science
3 pages
Government Documents
No ratings yet
Government Documents
5 pages
Data Management With SAS
100% (2)
Data Management With SAS
88 pages

Week 1 Assignment Solution

Uploaded by

Week 1 Assignment Solution

Uploaded by

1

Week 1 Assignment Solutions

Question 1) (18 Marks)

1) hadoop fs -mkdir -p /user/cloudera/dir1/dir2

2) hadoop fs -ls /user/cloudera

3) hadoop fs -touchz /user/cloudera/dir1/dir2/file1.txt

5) hadoop fs -put /home/cloudera/Desktop/file2.txt

6) hadoop fs -ls -R /user/cloudera/dir1

7) hadoop fs -ls -S -h /user/cloudera/dir1/dir2/

8) hadoop fs -mv /user/cloudera/dir1/dir2/file2.txt

9) hadoop fs -rm -R /user/cloudera/dir1

1. Create a directory inside home directory of local filesystem named

i) cat > file1

ii) cat file1

cat file1 file2

iii) cat file1 file2 > file3

cat file1file2 >> file3

Create a text file myfile.txt with 5 lines in home directory of local

1. Display last 3 lines of that file.

i) cat > myfile.txt

ii) tail -n+2 myfile.txt

Question 6) (10 Marks)

The getmerge command in Hadoop is for merging files existing in the

1) hadoop fs -help getmerge

3 ) hadoop fs -put file1.txt /user/cloudera/

4) hadoop fs -getmerge /user/cloudera/file1.txt

You might also like