0% found this document useful (0 votes)

8 views16 pages

1 To 8

The document outlines various Hadoop-related tasks, including implementing matrix multiplication, executing Hive commands, managing files in HDFS, and counting words and characters using MapReduce. Each section provides aims, procedures, example commands, and results of the operations performed. Additionally, it covers the installation and manipulation of HBase, detailing commands for creating, updating, and deleting tables.

Uploaded by

Sneha.j Sneha.j

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views16 pages

1 To 8

Uploaded by

Sneha.j Sneha.j

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

1.

MapReduce Program for Matrix Multiplication

Aim:

To implement matrix multiplication using MapReduce in Hadoop.

Procedure:

1. Prepare two input matrices in HDFS with the format: <MatrixName, Row, Column, Value>.

2. Write a Mapper to emit intermediate key-value pairs for matrix entries.

3. Write a Reducer to calculate partial products and sum them up for final results.

4. Execute the MapReduce job using Hadoop.

Program:

import java.io.IOException;

import org.apache.hadoop.io.*;

import org.apache.hadoop.mapreduce.*;

public class MatrixMultiplication {

public static class MatrixMapper extends Mapper<LongWritable, Text, Text, Text> {

public void map(LongWritable key, Text value, Context context) throws IOException,
InterruptedException {

String[] elements = value.toString().split(",");

String matrix = elements[0];

String row = elements[1];

String column = elements[2];

String val = elements[3];

if (matrix.equals("A")) {

for (int k = 0; k < 10; k++) { // Assuming 10x10 matrices

context.write(new Text(row + "," + k), new Text("A," + column + "," + val));

} else {

for (int i = 0; i < 10; i++) {

context.write(new Text(i + "," + column), new Text("B," + row + "," + val));

}
}

public static class MatrixReducer extends Reducer<Text, Text, Text, Text> {

public void reduce(Text key, Iterable<Text> values, Context context) throws IOException,
InterruptedException {

double[] A = new double[10];

double[] B = new double[10];

for (Text value : values) {

String[] elements = value.toString().split(",");

if (elements[0].equals("A")) {

A[Integer.parseInt(elements[1])] = Double.parseDouble(elements[2]);

} else {

B[Integer.parseInt(elements[1])] = Double.parseDouble(elements[2]);

double sum = 0;
for (int i = 0; i < 10; i++) {

sum += A[i] * B[i];

context.write(key, new Text(String.valueOf(sum)));

Output:

The output will be the resultant matrix stored in the HDFS directory.

Result:

Matrix multiplication was successfully implemented using MapReduce.

2. Hive Commands

Aim:

To execute Hive commands for importing, distributing, sorting, clustering, and exporting data.

Procedure:

1. Start the Hive environment.

2. Execute commands step by step for importing, distributing, sorting, clustering, and exporting.

Commands and Output:

1. IMPORT:

IMPORT TABLE table_name FROM 'hdfs_path';

Output: Data is imported into the Hive table.

2. DISTRIBUTE BY:

SELECT * FROM table_name DISTRIBUTE BY column_name;

Output: Data is distributed into partitions based on the column.

3. EXPORT:

EXPORT TABLE table_name TO 'hdfs_path';

Output: Data is exported to the specified HDFS path.

4. SORT BY:

SELECT * FROM table_name SORT BY column_name;

Output: Data is sorted by the specified column.

5. CLUSTER BY:

SELECT * FROM table_name CLUSTER BY column_name;

Output: Data is clustered into partitions.

Result:

Hive commands were executed successfully.

3. Hadoop File Management Commands

Aim:

To manage files in HDFS using Hadoop commands.

Procedure:

1. Use the Hadoop file system commands to create directories, add files, and list their contents.

Commands:

# Create a directory in HDFS

hadoop fs -mkdir /user/mydir

# Upload a file to HDFS

hadoop fs -put localfile.txt /user/mydir

# List the contents of a directory

hadoop fs -ls /user/mydir

# Delete a file

hadoop fs -rm /user/mydir/localfile.txt

Output:

1. Directory created.

2. File uploaded.

3. Contents listed.

4. File deleted.

Result:

File management tasks were successfully performed in HDFS.

4. MapReduce Program for Word Count

Aim:

To count the number of words in a text file using MapReduce.

Procedure:

1. Write a Mapper to emit (word, 1) pairs for each word.

2. Write a Reducer to sum up counts for each word.

3. Execute the program on input text files stored in HDFS.

Program:

public static class TokenizerMapper extends Mapper<Object, Text, Text, IntWritable> {

private final static IntWritable one = new IntWritable(1);

private Text word = new Text();

public void map(Object key, Text value, Context context) throws IOException,
InterruptedException {

String[] words = value.toString().split("\\s+");

for (String w : words) {

word.set(w);

context.write(word, one);

public static class IntSumReducer extends Reducer<Text, IntWritable, Text, IntWritable> {

public void reduce(Text key, Iterable<IntWritable> values, Context context) throws IOException,
InterruptedException {

int sum = 0;

for (IntWritable val : values) {

sum += val.get();

context.write(key, new IntWritable(sum));

}
}

Output:

The output is a list of words with their respective counts stored in HDFS.

Result:

Word count was successfully implemented using MapReduce.

5. Download and Install HBase with Start-Up Scripts

Aim:

To install and configure HBase.

Procedure:

1. Download HBase from the official Apache HBase website.

2. Extract and configure hbase-site.xml for Zookeeper and HDFS.

3. Start HBase services.

Commands:

# Download and extract HBase

wget https://archive.apache.org/dist/hbase/X.X.X/hbase-X.X.X-bin.tar.gz

tar -xzvf hbase-X.X.X-bin.tar.gz

cd hbase-X.X.X

# Start HBase

bin/start-hbase.sh

Output:

HBase is installed and running.

Result:

HBase was successfully downloaded, installed, and started.

6. HBase Commands

Aim:

To create, manipulate, and delete tables in HBase.

Procedure:

1. Start the HBase shell.

2. Execute commands for table creation, data insertion, and retrieval.

Commands:

1. Create Table:

create 'customer', 'info'

2. Insert Data:
put 'customer', '1', 'info:name', 'John'

3. Get Data:

get 'customer', '1'

4. Delete Data:

delete 'customer', '1', 'info:name'

Output:

Commands successfully create, update, retrieve, and delete data in the HBase table.

Result:

HBase operations were successfully performed.

7. MapReduce Program for Counting Characters

Aim:

To count the number of characters in a text file using MapReduce.

Procedure:

1. Write a Mapper to emit (character, 1) pairs for each character.

2. Write a Reducer to sum up counts for each character.

Program:

Similar to word count, but split the input into characters instead of words.

8. Hive DML Commands

Aim:

To execute Hive DML commands such as INSERT, SELECT, and DELETE.

Procedure:

1. Use Hive to create and manipulate tables.

Commands:

1. INSERT:

INSERT INTO TABLE student VALUES (1, 'John', 'CS');

2. SELECT:

SELECT * FROM student;

3. DELETE:

DELETE FROM student WHERE id=1;

Big Data & Analytics Lab Manual
No ratings yet
Big Data & Analytics Lab Manual
51 pages
(TS) HS70A - Booting Failed On System Start
No ratings yet
(TS) HS70A - Booting Failed On System Start
6 pages
Sentinel One SOP
No ratings yet
Sentinel One SOP
4 pages
Marvel Demo
100% (1)
Marvel Demo
11 pages
Big Data Lab
No ratings yet
Big Data Lab
12 pages
All
No ratings yet
All
11 pages
BDA Exp Removed Removed
No ratings yet
BDA Exp Removed Removed
33 pages
Big Data Lab
No ratings yet
Big Data Lab
52 pages
Bda Lab Manual 2024
No ratings yet
Bda Lab Manual 2024
45 pages
Exp 9 - Merged
No ratings yet
Exp 9 - Merged
13 pages
BDA
No ratings yet
BDA
19 pages
Using Map Reduce Concept, Implement A Java Pro...
No ratings yet
Using Map Reduce Concept, Implement A Java Pro...
2 pages
Bda Lab
No ratings yet
Bda Lab
4 pages
MapReduce Programs
No ratings yet
MapReduce Programs
10 pages
Exp 5 Bdafinal
No ratings yet
Exp 5 Bdafinal
7 pages
Sets Bda
No ratings yet
Sets Bda
19 pages
BDA Exp5
No ratings yet
BDA Exp5
12 pages
11 To 16
No ratings yet
11 To 16
13 pages
BDAV Practical
No ratings yet
BDAV Practical
17 pages
BIGDATALABCURRENT
No ratings yet
BIGDATALABCURRENT
54 pages
Big Data Manual
No ratings yet
Big Data Manual
82 pages
Big Data Lab Manual
No ratings yet
Big Data Lab Manual
27 pages
Bda Record (24-25)
No ratings yet
Bda Record (24-25)
50 pages
BDA Practicalfile
No ratings yet
BDA Practicalfile
19 pages
BIGDATA LAB MANUAL
No ratings yet
BIGDATA LAB MANUAL
27 pages
@bigdatalabfile 09
No ratings yet
@bigdatalabfile 09
35 pages
BDT Lab Manual
No ratings yet
BDT Lab Manual
48 pages
22MCC20017 Suraj Kumar Thakur BIG Data 2.1
No ratings yet
22MCC20017 Suraj Kumar Thakur BIG Data 2.1
7 pages
BDA Lab 8 Manual
No ratings yet
BDA Lab 8 Manual
7 pages
BDC Final Record
No ratings yet
BDC Final Record
36 pages
Big Data Analytics IT
No ratings yet
Big Data Analytics IT
55 pages
Bda Lab Record
No ratings yet
Bda Lab Record
32 pages
Data Science
No ratings yet
Data Science
82 pages
Bda Lab Manual - Cse 8 Sem - Compl
No ratings yet
Bda Lab Manual - Cse 8 Sem - Compl
57 pages
CS-702 (D) BigData
No ratings yet
CS-702 (D) BigData
61 pages
Exp 9
No ratings yet
Exp 9
7 pages
Palak
No ratings yet
Palak
10 pages
Notes
No ratings yet
Notes
53 pages
Bda Megh
No ratings yet
Bda Megh
50 pages
MapReduce - Notes
No ratings yet
MapReduce - Notes
17 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
BDA Record
No ratings yet
BDA Record
58 pages
CS702 Big Data Programs
No ratings yet
CS702 Big Data Programs
59 pages
Map Reduce
No ratings yet
Map Reduce
30 pages
CCS334 BDA Practical Question
No ratings yet
CCS334 BDA Practical Question
2 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
54 pages
Big Data Analytics Lab
No ratings yet
Big Data Analytics Lab
18 pages
Big Data Testing
100% (1)
Big Data Testing
34 pages
SplitPDFFile 1 To 7
No ratings yet
SplitPDFFile 1 To 7
7 pages
Mapreduce Programming Model and Design Patterns: Andrea Lottarini January 17, 2012
No ratings yet
Mapreduce Programming Model and Design Patterns: Andrea Lottarini January 17, 2012
23 pages
Bda Lab S
No ratings yet
Bda Lab S
92 pages
Big Data Analytics Lab Manual (BE AI&DS)
No ratings yet
Big Data Analytics Lab Manual (BE AI&DS)
29 pages
Bda Lab
No ratings yet
Bda Lab
94 pages
Lab Manual
No ratings yet
Lab Manual
86 pages
Bigdata Lab
No ratings yet
Bigdata Lab
55 pages
BDF Programs
No ratings yet
BDF Programs
32 pages
Hadoop Course Outline UPDATED SURESH
No ratings yet
Hadoop Course Outline UPDATED SURESH
5 pages
Big Data All Kumar
No ratings yet
Big Data All Kumar
24 pages
CSF443 Lab-Report Nimish Shandilya 1000016934
No ratings yet
CSF443 Lab-Report Nimish Shandilya 1000016934
17 pages
Mapreduce Final
No ratings yet
Mapreduce Final
55 pages
Assignment 4
No ratings yet
Assignment 4
7 pages
Answer Key Model Data Warehousing
No ratings yet
Answer Key Model Data Warehousing
48 pages
1st Unit NNDL
No ratings yet
1st Unit NNDL
20 pages
DPCO Unit - 1
No ratings yet
DPCO Unit - 1
66 pages
Algorithm Unit 1
No ratings yet
Algorithm Unit 1
16 pages
New Microsoft PowerPoint Presentation NEW
No ratings yet
New Microsoft PowerPoint Presentation NEW
19 pages
English Assignment
No ratings yet
English Assignment
16 pages
Tamil-How To Get High Paid Job
No ratings yet
Tamil-How To Get High Paid Job
23 pages
STT Notes
No ratings yet
STT Notes
64 pages
AIML Book
No ratings yet
AIML Book
238 pages
OOSE All 25 Answers Visual Preview
No ratings yet
OOSE All 25 Answers Visual Preview
1 page
Civil Practical PDF
No ratings yet
Civil Practical PDF
31 pages
DBMS Book
No ratings yet
DBMS Book
190 pages
Oops Lab Mannual Java
No ratings yet
Oops Lab Mannual Java
50 pages
VLAN Notes QA Full
No ratings yet
VLAN Notes QA Full
2 pages
QEMU User Documentation
No ratings yet
QEMU User Documentation
246 pages
MAIAD Lab 01 - Student Guide
No ratings yet
MAIAD Lab 01 - Student Guide
20 pages
Interview Q&A
No ratings yet
Interview Q&A
8 pages
Grlib
No ratings yet
Grlib
92 pages
J Mss 19090201
No ratings yet
J Mss 19090201
8 pages
All-Products - Esuprt - Electronics - Esuprt - Display - Dell-St2420l - User's Guide - En-Us
No ratings yet
All-Products - Esuprt - Electronics - Esuprt - Display - Dell-St2420l - User's Guide - En-Us
33 pages
CS 152 Computer Architecture and Engineering Multicycle Controller Design
No ratings yet
CS 152 Computer Architecture and Engineering Multicycle Controller Design
49 pages
Blue Link
No ratings yet
Blue Link
2 pages
Cambridge International AS & A Level: Information Technology 9626/11 October/November 2022
No ratings yet
Cambridge International AS & A Level: Information Technology 9626/11 October/November 2022
9 pages
Zpad Plus QR 4g Datasheet
No ratings yet
Zpad Plus QR 4g Datasheet
2 pages
Class 4th Computer
No ratings yet
Class 4th Computer
2 pages
AguaSense An Automated Fishpond Monitoring and Filtration System
No ratings yet
AguaSense An Automated Fishpond Monitoring and Filtration System
51 pages
Group Decision Support System
No ratings yet
Group Decision Support System
7 pages
The Core Target Market For Ipod Is Young Adults and Teenagers
No ratings yet
The Core Target Market For Ipod Is Young Adults and Teenagers
5 pages
My CV
No ratings yet
My CV
2 pages
R1A P345 Cli Log
No ratings yet
R1A P345 Cli Log
31 pages
Yeeerererererzrzet
No ratings yet
Yeeerererererzrzet
7 pages
Xigmanas - Basic FTP Client Configuration
No ratings yet
Xigmanas - Basic FTP Client Configuration
6 pages
ABB Terra-DC-Wallbox Product-Guide H
No ratings yet
ABB Terra-DC-Wallbox Product-Guide H
12 pages
Presentation Cloud Computing by Sapan Shah
No ratings yet
Presentation Cloud Computing by Sapan Shah
25 pages
Hep 2024
No ratings yet
Hep 2024
16 pages
English Syllabus Level 3 2023-2024
No ratings yet
English Syllabus Level 3 2023-2024
289 pages
Java 3
No ratings yet
Java 3
1 page
JetpackCompose Navigation
No ratings yet
JetpackCompose Navigation
58 pages
Sapling SDLG Messaging Clock Manual V1.0
No ratings yet
Sapling SDLG Messaging Clock Manual V1.0
53 pages
NanoBeacon Config Tool User Guide EN
No ratings yet
NanoBeacon Config Tool User Guide EN
52 pages
Whack A Mole FPGA Report
100% (1)
Whack A Mole FPGA Report
8 pages