0% found this document useful (0 votes)

10 views9 pages

DA Lab Program-3

Uploaded by

Diksha Padiyar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views9 pages

DA Lab Program-3

Uploaded by

Diksha Padiyar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

DATA ANALYTICS LABORATORY (21CSL66)

3. IMPLEMENT AN MR PROGRAM THAT PROCESSES A WEATHER

DATASET.

Steps to be followed:

• Step-1: We can download the dataset from this Link, For various cities in
different years. choose the year of your choice and select any one of the
data text-file for analysing.

We can get information about data from README.txt file available on the
NCEI website.

• Step-2: Make a project in Eclipse with below steps:

§ First Open Eclipse à then select File à New à Java Project à

Name it MyProject à then select use an execution
environment à choose JavaSE-1.8 then next à Finish.

§ In this Project Create Java class with name MyMaxMin à then

click Finish.

§ Copy the below source code to this MyMaxMin java class.

// importing Libraries

import java.io.IOException;

import java.util.Iterator;

import org.apache.hadoop.fs.Path;

import org.apache.hadoop.io.LongWritable;

import org.apache.hadoop.io.Text;

import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;

import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;

1
import org.apache.hadoop.mapreduce.lib.output.TextOutputFormat;

import org.apache.hadoop.mapreduce.lib.input.TextInputFormat;

import org.apache.hadoop.mapreduce.Job;

import org.apache.hadoop.mapreduce.Mapper;

import org.apache.hadoop.mapreduce.Reducer;

import org.apache.hadoop.conf.Configuration;

public class MyMaxMin {

// Mapper

/*MaxTemperatureMapper class is static

* and extends Mapper abstract class

* having four Hadoop generics type

* LongWritable, Text, Text, Text.

public static class MaxTemperatureMapper extends

Mapper<LongWritable, Text, Text, Text> {

/**

* @method map

* This method takes the input as a text data type.

* Now leaving the first five tokens, it takes

* 6th token is taken as temp_max and

* 7th token is taken as temp_min. Now

* temp_max > 30 and temp_min < 15 are

* passed to the reducer.

2
*/

// the data in our data set with

// this value is inconsistent data

public static final int MISSING = 9999;

@Override

public void map(LongWritable arg0, Text Value, Context

context)

throws IOException, InterruptedException {

// Convert the single row(Record) to

// String and store it in String

// variable name line

String line = Value.toString();

// Check for the empty line

if (!(line.length() == 0)) {

// from character 6 to 14 we have

// the date in our dataset

String date = line.substring(6, 14);

// similarly we have taken the maximum

// temperature from 39 to 45 characters

float temp_Max =
Float.parseFloat(line.substring(39, 45).trim());

// similarly we have taken the minimum

// temperature from 47 to 53 characters

3
float temp_Min =
Float.parseFloat(line.substring(47, 53).trim());

// if maximum temperature is

// greater than 30, it is a hot day

if (temp_Max > 30.0) {

// Hot day

context.write(new Text("The Day is Hot

Day :" + date),

new Text(String.valueOf(temp_Max)));

// if the minimum temperature is

// less than 15, it is a cold day

if (temp_Min < 15) {

// Cold day

context.write(new Text("The Day is Cold

Day :" + date),

new
Text(String.valueOf(temp_Min)));

// Reducer

/*MaxTemperatureReducer class is static

and extends Reducer abstract class

4
having four Hadoop generics type

Text, Text, Text, Text.

public static class MaxTemperatureReducer extends

Reducer<Text, Text, Text, Text> {

/**

* @method reduce

* This method takes the input as key and

* list of values pair from the mapper,

* it does aggregation based on keys and

* produces the final context.

public void reduce(Text Key, Iterator<Text> Values, Context

context)

throws IOException, InterruptedException {

// putting all the values in

// temperature variable of type String

String temperature = Values.next().toString();

context.write(Key, new Text(temperature));

/**

* @method main

* This method is used for setting

* all the configuration properties.

5
* It acts as a driver for map-reduce

* code.

public static void main(String[] args) throws Exception {

// reads the default configuration of the

// cluster from the configuration XML files

Configuration conf = new Configuration();

// Initializing the job with the

// default configuration of the cluster

Job job = new Job(conf, "weather example");

// Assigning the driver class name

job.setJarByClass(MyMaxMin.class);

// Key type coming out of mapper

job.setMapOutputKeyClass(Text.class);

// value type coming out of mapper

job.setMapOutputValueClass(Text.class);

// Defining the mapper class name

job.setMapperClass(MaxTemperatureMapper.class);

// Defining the reducer class name

job.setReducerClass(MaxTemperatureReducer.class);

// Defining input Format class which is

// responsible to parse the dataset

// into a key value pair

job.setInputFormatClass(TextInputFormat.class);

6
// Defining output Format class which is

// responsible to parse the dataset

// into a key value pair

job.setOutputFormatClass(TextOutputFormat.class);

// setting the second argument

// as a path in a path variable

Path OutputPath = new Path(args[1]);

// Configuring the input path

// from the filesystem into the job

FileInputFormat.addInputPath(job, new Path(args[0]));

// Configuring the output path from

// the filesystem into the job

FileOutputFormat.setOutputPath(job, new Path(args[1]));

// deleting the context path automatically

// from hdfs so that we don't have

// to delete it explicitly

OutputPath.getFileSystem(conf).delete(OutputPath);

// exiting the job only if the

// flag value becomes false

System.exit(job.waitForCompletion(true) ? 0 : 1);

7
§ Now we need to add external jar for the packages that we have
import. Download the jar package Hadoop Common and Hadoop
MapReduce Core according to the Hadoop version.

§ Now we add these external jars to our MyProject.

Right Click on MyProject à then select Build Path à Click

on Configure Build Path and select Add External jars…. and add
jars from its download location then click à Apply and Close.

§ Now export the project as jar file.

Right-click on MyProject choose Export.. and go to Java à JAR

file click à Next and choose your export destination then click
à Next.
choose Main Class as MyMaxMin by clicking à Browse and then
clickàFinish àOk.

• Step-4: Start the Hadoop daemons.

start-dfs.sh

start-yarn.sh

• Step-5: Move the dataset to Hadoop HDFS.

hdfs dfs -put /file_path /destination

In below command / shows the root directory of our HDFS,

hdfs dfs -put /home/…./……./datasetname.txt /

hdfs dfs -ls /

8
• Step-6: Now Run your Jar File with below command and produce the
output in MyOutput File.

hadoop jar /jar_file_location /dataset_location_in_HDFS /output-file_name

hadoop jar /…./…./…./Project.jar /datasetname.txt /MyOutput

• Step-7: Now Move to localhost:50070/, under utilities select Browse the file
system and download part-r-00000 in /MyOutput directory to see result.

• Step-8: See the result in downloaded file.

Aden - Kerker. Scattering Efficiency For A Layered Sphere. 1951
100% (2)
Aden - Kerker. Scattering Efficiency For A Layered Sphere. 1951
6 pages
Datasheet - Ultrasonic Heat Meter RC82
No ratings yet
Datasheet - Ultrasonic Heat Meter RC82
5 pages
ADBMS-Module 3
No ratings yet
ADBMS-Module 3
115 pages
Unit-Iii: A Weather Dataset
No ratings yet
Unit-Iii: A Weather Dataset
12 pages
BDA
No ratings yet
BDA
19 pages
Lab Manual
No ratings yet
Lab Manual
86 pages
Map Reduce
No ratings yet
Map Reduce
46 pages
Big Data Analytics AAM Unit 4
No ratings yet
Big Data Analytics AAM Unit 4
80 pages
Mcsl26 See QP Solution 2024
No ratings yet
Mcsl26 See QP Solution 2024
33 pages
Developing a MapReduce Application
No ratings yet
Developing a MapReduce Application
30 pages
ADBMS Module4
No ratings yet
ADBMS Module4
31 pages
Bda Lab Manual 2024
No ratings yet
Bda Lab Manual 2024
45 pages
Analyzing_Data_with_Hadoop
No ratings yet
Analyzing_Data_with_Hadoop
54 pages
MapReduce - Notes
No ratings yet
MapReduce - Notes
17 pages
Document 6
No ratings yet
Document 6
15 pages
CSF443 Lab-Report Nimish Shandilya 1000016934
No ratings yet
CSF443 Lab-Report Nimish Shandilya 1000016934
17 pages
MR Progs For Self Excercise
No ratings yet
MR Progs For Self Excercise
14 pages
bd-2lab
No ratings yet
bd-2lab
7 pages
EXP_3_4
No ratings yet
EXP_3_4
7 pages
Cloud_LAB_10.1,11.1,12.1
No ratings yet
Cloud_LAB_10.1,11.1,12.1
6 pages
BDA MapReduce Program (1)
No ratings yet
BDA MapReduce Program (1)
8 pages
Group B PR 3 DSBDA
No ratings yet
Group B PR 3 DSBDA
6 pages
cl3-exp-09
No ratings yet
cl3-exp-09
4 pages
AP20110010464.docx
No ratings yet
AP20110010464.docx
7 pages
3 MapReduce Framework
No ratings yet
3 MapReduce Framework
28 pages
Hadoop
No ratings yet
Hadoop
19 pages
2 Z-Test
No ratings yet
2 Z-Test
16 pages
22MCC20017_Suraj_Kumar_Thakur_BIG_Data_2.2
No ratings yet
22MCC20017_Suraj_Kumar_Thakur_BIG_Data_2.2
5 pages
Short Programs
No ratings yet
Short Programs
41 pages
Practical 2-2
No ratings yet
Practical 2-2
9 pages
17. Using Map Reduce Concept, Implement a Java Pro...
No ratings yet
17. Using Map Reduce Concept, Implement a Java Pro...
2 pages
Hydraulics
No ratings yet
Hydraulics
25 pages
MapReduce Programs
No ratings yet
MapReduce Programs
10 pages
Map Reduce
No ratings yet
Map Reduce
15 pages
Mathematical Modelling of An Outbreak of Zombie Infection 2
No ratings yet
Mathematical Modelling of An Outbreak of Zombie Infection 2
24 pages
sets_bda
No ratings yet
sets_bda
19 pages
Map Reduce 1
No ratings yet
Map Reduce 1
8 pages
2 Lectures Me315 071 Chapter 3
No ratings yet
2 Lectures Me315 071 Chapter 3
22 pages
Worksheet 6th
No ratings yet
Worksheet 6th
6 pages
Hadoop Weather
No ratings yet
Hadoop Weather
4 pages
Big Data Lab
No ratings yet
Big Data Lab
12 pages
Palak
No ratings yet
Palak
10 pages
BDA4
No ratings yet
BDA4
7 pages
Practical 2-3
No ratings yet
Practical 2-3
3 pages
MapReduce and Yarn
No ratings yet
MapReduce and Yarn
39 pages
22341 2023 Summer Question Paper[Msbte Study Resources] (1)
No ratings yet
22341 2023 Summer Question Paper[Msbte Study Resources] (1)
8 pages
Aimo Sample Paper: ×C 2010 Australian Mathematics Trust
No ratings yet
Aimo Sample Paper: ×C 2010 Australian Mathematics Trust
26 pages
Unit Iii LM
No ratings yet
Unit Iii LM
14 pages
Unit III EBDP 2022
No ratings yet
Unit III EBDP 2022
77 pages
Unit IV BDA
No ratings yet
Unit IV BDA
32 pages
Lecture 2 Survival Models - Handout
No ratings yet
Lecture 2 Survival Models - Handout
20 pages
Unit v Programming Model
No ratings yet
Unit v Programming Model
53 pages
05 Movies Data Analysis Using Mapreduce
No ratings yet
05 Movies Data Analysis Using Mapreduce
20 pages
18 - Weld Design Symbols
100% (3)
18 - Weld Design Symbols
62 pages
BDA Unit 4 Notes
No ratings yet
BDA Unit 4 Notes
20 pages
Ch-3 The Cellular Cocept-System Design Fundamentals
100% (2)
Ch-3 The Cellular Cocept-System Design Fundamentals
66 pages
Unit 4 Handouts
No ratings yet
Unit 4 Handouts
13 pages
Week 1 Hadoop and Hdfs Commands
No ratings yet
Week 1 Hadoop and Hdfs Commands
1 page
Running Map Reduce Program in Eclipse: C:/hadoop
No ratings yet
Running Map Reduce Program in Eclipse: C:/hadoop
6 pages
Big Data Fundamentals and Platforms Assginment 3
No ratings yet
Big Data Fundamentals and Platforms Assginment 3
6 pages
Cp5261 Da Lab Me-Cse 2021 - Edit
No ratings yet
Cp5261 Da Lab Me-Cse 2021 - Edit
88 pages
Modified Topic 13 Test
100% (1)
Modified Topic 13 Test
10 pages
Questions Junior Mathematics Competition 2017: Department of Mathematics and Statistics
No ratings yet
Questions Junior Mathematics Competition 2017: Department of Mathematics and Statistics
4 pages
Big Data Manual
No ratings yet
Big Data Manual
82 pages
Matlab Assignment - 1
No ratings yet
Matlab Assignment - 1
3 pages
Data Science
No ratings yet
Data Science
82 pages
Bda Material Unit 3
No ratings yet
Bda Material Unit 3
14 pages
4D WSC Sample
No ratings yet
4D WSC Sample
18 pages
Recovery of Silver and Mercury From Dental Amalgam Waste
No ratings yet
Recovery of Silver and Mercury From Dental Amalgam Waste
9 pages
Tutorial Partitioner
No ratings yet
Tutorial Partitioner
8 pages
Obstacle Detection System For Railways: Karthick S, Aishwarya Patil, Ullas S U K. Saravanakumar
No ratings yet
Obstacle Detection System For Railways: Karthick S, Aishwarya Patil, Ullas S U K. Saravanakumar
3 pages
SolidWorks 2013 For Engineers and Designers
No ratings yet
SolidWorks 2013 For Engineers and Designers
1 page
Analyzing The Data With Hadoop
No ratings yet
Analyzing The Data With Hadoop
13 pages
Jolt Charger User Manual 7120-15
100% (1)
Jolt Charger User Manual 7120-15
3 pages
SSC JE Syllabus
No ratings yet
SSC JE Syllabus
6 pages
Design Criteria For Petcoke Calciners
No ratings yet
Design Criteria For Petcoke Calciners
6 pages
Pen SDK Programming Guide
No ratings yet
Pen SDK Programming Guide
256 pages
MapReduce Hands On
No ratings yet
MapReduce Hands On
28 pages
CP5261 Data Analytics Laboratory LTPC0042 Objectives
No ratings yet
CP5261 Data Analytics Laboratory LTPC0042 Objectives
80 pages
A320 TKE by ATA 2
No ratings yet
A320 TKE by ATA 2
62 pages
Oil Pump Type D GEAR SIZES 45-47-55-57-67
No ratings yet
Oil Pump Type D GEAR SIZES 45-47-55-57-67
2 pages
USS0202 2+1 Downconverter Redundancy Switch With Dual Band Output - UF224
No ratings yet
USS0202 2+1 Downconverter Redundancy Switch With Dual Band Output - UF224
4 pages
Chapter 4 Metal Cutting
No ratings yet
Chapter 4 Metal Cutting
45 pages
Mains Compact NT 1.0 Reference Guide
No ratings yet
Mains Compact NT 1.0 Reference Guide
130 pages
Sankalp 022W - 1-3 - LOT-p1-PH-2-CPT-1-PTC
No ratings yet
Sankalp 022W - 1-3 - LOT-p1-PH-2-CPT-1-PTC
18 pages
Norsok M101
No ratings yet
Norsok M101
62 pages
Bda Unit 3
No ratings yet
Bda Unit 3
22 pages
Unit-Iii: A Weather Dataset
No ratings yet
Unit-Iii: A Weather Dataset
12 pages
Understanding Software Engineering Vol 3: Programming Basic Software Functionalities.
From Everand
Understanding Software Engineering Vol 3: Programming Basic Software Functionalities.
Gabriel Clemente
No ratings yet
Java Programming Tutorial With Screen Shots & Many Code Example
From Everand
Java Programming Tutorial With Screen Shots & Many Code Example
Desmond Ohwofosirai
No ratings yet