0% found this document useful (0 votes)
3 views14 pages

CA01

Uploaded by

syednavedanwar99
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views14 pages

CA01

Uploaded by

syednavedanwar99
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

Name – Naved Anwar

Reg.no - 12006158

COURSE CODE-312

ASSIGNMENT – 01

1. Write the Hadoop commands for the following:


a. How do I check the Hadoop version?
SOLUTION- HADOOP VERSION

b. use Hadoop to join two large datasets stored in HDFS


efficiently. Provide the sequence of Hadoop commands
and map-reduce job configuration to perform the join.
SOLUTION-- = Create two folders to store the datasets
as dataset1 & dataset2 = hadoop fs - mkdir /dataset1
hadoop fs -mkdir /dataset2 hadoop fs -ls / (to list out
the folders present in the directory

Copy the datasets into HDFS = hadoop fs -put "D:/New folder (2)/Downloads\archive
(9)/recipe_ingredient_assignments.csv" /dataset1 hadoop fs -put "D:/New folder (2)/Downloads\archive
(9)/recipes.csv" /dataset2 hadoop fs -ls /dataset1 hadoop fs -ls /dataset2
c. Append 2 text files in third text file and save the result
file on desktop

2. You need to run a MapReduce job to count the frequency


of words in a large text document. Walk through the
process, from writing the Map and Reduce functions to
running the job using Hadoop commands.
3. Apply all the cases of test command and stat command
on any data present in csv types of file and then check the
disk usage.
4. Write hive queries for the following:
a. Create a database

b. Under the same database create 3 tables regarding


bank
> ,
c. Load the external data in the tables
d. Show the data inserted in all the tables
5. Write hive queries for following:
a. Show the usage of all the arithmetic operators on
any data
b. Show the usage of all the relational operators on any
data
c. Show the usage of all the comparison operators on
any data
6.create a student database along with data and perform hive
partioning and bucketing on it .

You might also like