0% found this document useful (0 votes)

32 views8 pages

Run Word Count - Hive Job On EMR - V1 - Reviewed - Sks - Lab Guides

Uploaded by

Aniket Sonale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views8 pages

Run Word Count - Hive Job On EMR - V1 - Reviewed - Sks - Lab Guides

Uploaded by

Aniket Sonale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Big Data

Run Hive Job on EMR– Demo

Table of Contents

Steps to create EMR Cluster – Demo ............................................................................................. 2

Step 1: Select the S3 service. ................................................................ Error! Bookmark not defined.
Step 2: Click Create bucket.................................................................... Error! Bookmark not defined.
Step 3: Write the Bucket name. Click Create. ...................................... Error! Bookmark not defined.
Step 4: Click the bucket name. .............................................................. Error! Bookmark not defined.
Step 5: Click Create folder. .................................................................... Error! Bookmark not defined.
Step 6: Type the folder name. Click Save. ........................................... Error! Bookmark not defined.
Step 7: Select the EMR service. ............................................................ Error! Bookmark not defined.
Step 8: Click Create clusters. ................................................................. Error! Bookmark not defined.
Step 9: Type the Cluster name. Click the folder icon........................... Error! Bookmark not defined.
Step 10: Select the S3 bucket created earlier. Click Select. ............... Error! Bookmark not defined.
Step 11: Choose the latest version. ...................................................... Error! Bookmark not defined.
Step 12: Choose the instance type as “m4.large”. Choose the number of instances as per your
requirement. Enter the EC2 key pair. Click Create cluster. ................. Error! Bookmark not defined.
Step 13: Check the cluster status. ......................................................... Error! Bookmark not defined.
Step 14: Go to EC2 service. Three instances are created automatically. ....... Error! Bookmark not
defined.
Step 15: Click the master node Security group. ................................... Error! Bookmark not defined.
Step 16: Click the Inbound tab. Click Edit. ............................................ Error! Bookmark not defined.
Step 17: Click Add Rule button. ............................................................. Error! Bookmark not defined.
Step 18: Add “SSH” and make it anywhere. Click Save. ..................... Error! Bookmark not defined.
Step 19: SSH your instance. .................................................................. Error! Bookmark not defined.

1
Big Data

Steps to run Hive Job on EMR – Demo

Pre-requisite: -------- Commented [SKS[1]: Please add.

Step 1: Click the cluster you created earlier.

Step 2: Click Steps tab. Click “Add Step” button.

2
Big Data

Step 3: Select the step type “Hive program”. Give it a name. Enter the Script S3 location and
Input S3 location.
Script location: S3://us-east-1.elasticmapreduce.samples/cloudfront/code/Hive_CloudFront.q

Input location: s3://us-east-1.elasticmapreduce.samples

Step 4: Simultaneously, Go to S3 service on a new tab and click the bucket you created earlier.

3
Big Data

Step 5: Click Create folder.

Step 6: Type the folder name. Click Save.

4
Big Data

Step 7: Go to EMR service tab again. Click the folder icon.

Step 8: Select the folder “outputs” from the bucket. Click Select.

5
Big Data

Step 9: Check the cluster status.

Step 10: Select the S3 bucket you created earlier. Select the outputs folder.

6
Big Data

Step 11: Choose the os_requests.

Step 12: Download it. Open it in a notepad.

7
Big Data

Step 13: Check the file.

GCP Setup Guide Document
No ratings yet
GCP Setup Guide Document
29 pages
Cloud Computing
No ratings yet
Cloud Computing
18 pages
Deep Learning Algorithms
100% (1)
Deep Learning Algorithms
412 pages
Mca 307 A4 PDF
No ratings yet
Mca 307 A4 PDF
19 pages
Ramp-Up Guide Microsoft On AWS
No ratings yet
Ramp-Up Guide Microsoft On AWS
2 pages
BBT 3103 - Network Routing Protocals - April 2024
No ratings yet
BBT 3103 - Network Routing Protocals - April 2024
4 pages
Python Lab - Evaluation - Final (Ece)
No ratings yet
Python Lab - Evaluation - Final (Ece)
4 pages
TBDSFL MAT MUNTASIR 马宇诚
No ratings yet
TBDSFL MAT MUNTASIR 马宇诚
12 pages
Chaitanya Cloud Full
No ratings yet
Chaitanya Cloud Full
70 pages
Barcode Scanner Table
No ratings yet
Barcode Scanner Table
2 pages
Aws Simple Storage Service
No ratings yet
Aws Simple Storage Service
23 pages
1605192076066-614 DAS-C01 Study Guide
No ratings yet
1605192076066-614 DAS-C01 Study Guide
18 pages
AWS-SOP - S3 Bucket Creation
No ratings yet
AWS-SOP - S3 Bucket Creation
12 pages
BDA Unit 3
No ratings yet
BDA Unit 3
7 pages
Cognixia Course - AWS Cloud Practitioner - Schneider
No ratings yet
Cognixia Course - AWS Cloud Practitioner - Schneider
5 pages
Unit 4 - Creating and Validating Forms MCQs (22619 WBP)
No ratings yet
Unit 4 - Creating and Validating Forms MCQs (22619 WBP)
8 pages
Designing Algorithms and The Fairness Criteria They Should Satisfy
No ratings yet
Designing Algorithms and The Fairness Criteria They Should Satisfy
1 page
Downloaded Oct24 Lab5 Latestmanual
No ratings yet
Downloaded Oct24 Lab5 Latestmanual
24 pages
bd1718 12 Othertools
No ratings yet
bd1718 12 Othertools
50 pages
BDA Brijesh
No ratings yet
BDA Brijesh
113 pages
Cloud Architect Certification Masters Course
No ratings yet
Cloud Architect Certification Masters Course
14 pages
IM C2010 IM C2010AEX IM C2510 Spec Sheet v2
No ratings yet
IM C2010 IM C2010AEX IM C2510 Spec Sheet v2
3 pages
AWS SAA Diagrams
No ratings yet
AWS SAA Diagrams
200 pages
Cloud
No ratings yet
Cloud
5 pages
m1 Demo2 v1 99h bssn65f
No ratings yet
m1 Demo2 v1 99h bssn65f
19 pages
803 Web Application SQP
No ratings yet
803 Web Application SQP
10 pages
HPE - A50007027enw - HPE Compute Edge Server E930t
No ratings yet
HPE - A50007027enw - HPE Compute Edge Server E930t
18 pages
PCC Request For P&B Travel
No ratings yet
PCC Request For P&B Travel
4 pages
LabManual5 ProcessingLogs Using EMR
No ratings yet
LabManual5 ProcessingLogs Using EMR
29 pages
Big Data Developer
No ratings yet
Big Data Developer
81 pages
Metaswitch Datasheet Network Transformation Overview
No ratings yet
Metaswitch Datasheet Network Transformation Overview
5 pages
Cloud Computing Lab FILE
No ratings yet
Cloud Computing Lab FILE
28 pages
Concurrency Control problems-UNIT-4 Part DBMS
No ratings yet
Concurrency Control problems-UNIT-4 Part DBMS
6 pages
Cloud Practicals
No ratings yet
Cloud Practicals
30 pages
Introduction To Word 2026
No ratings yet
Introduction To Word 2026
10 pages
Lab Manual Big Data Analyticts
No ratings yet
Lab Manual Big Data Analyticts
67 pages
AWS Certified Developer - Associate
No ratings yet
AWS Certified Developer - Associate
7 pages
Cloud Computing
No ratings yet
Cloud Computing
39 pages
Branch and Bound
No ratings yet
Branch and Bound
49 pages
Production Data Processing With Apache Spark
No ratings yet
Production Data Processing With Apache Spark
7 pages
AWS Solutions Architect Cheat Sheet Feb 2025
No ratings yet
AWS Solutions Architect Cheat Sheet Feb 2025
65 pages
Big Data Specialisation
No ratings yet
Big Data Specialisation
8 pages
AWS Big Data Certification Course For DAS C01
No ratings yet
AWS Big Data Certification Course For DAS C01
10 pages
Sajjad Ahmed Portfolio
No ratings yet
Sajjad Ahmed Portfolio
39 pages
AWS Security Architecture
No ratings yet
AWS Security Architecture
153 pages
Lista Wadsa 20-10
No ratings yet
Lista Wadsa 20-10
31 pages
Powerpoint, or : Text & Multimedia in Class
No ratings yet
Powerpoint, or : Text & Multimedia in Class
47 pages
Aws Mini Project 1: Lab1: Iam Hands-On
No ratings yet
Aws Mini Project 1: Lab1: Iam Hands-On
92 pages
Lab 5 Storage
No ratings yet
Lab 5 Storage
4 pages
CS For DS Lab Record 2024 - 2
No ratings yet
CS For DS Lab Record 2024 - 2
50 pages
Syllabus Aws
No ratings yet
Syllabus Aws
15 pages
EMEA Online Summit - Getting Started - UseCases
No ratings yet
EMEA Online Summit - Getting Started - UseCases
1 page
Amazon EC2 Lab2
No ratings yet
Amazon EC2 Lab2
25 pages
Delete EMR Cluster - V1 - Reviewed - Sks - Lab Guides
No ratings yet
Delete EMR Cluster - V1 - Reviewed - Sks - Lab Guides
3 pages
4 Configure-Chef-workstation-on-ubuntu
No ratings yet
4 Configure-Chef-workstation-on-ubuntu
3 pages
Matrices One Shot #BB
100% (1)
Matrices One Shot #BB
158 pages
SFCDP Dumps
No ratings yet
SFCDP Dumps
16 pages
PG Program in Cloud Computing
No ratings yet
PG Program in Cloud Computing
14 pages
How To Configure Big Data Management 10.1 For Amazon EMR 4.6
No ratings yet
How To Configure Big Data Management 10.1 For Amazon EMR 4.6
10 pages
4.docker Volume
No ratings yet
4.docker Volume
3 pages
Oracle: Question & Answers
No ratings yet
Oracle: Question & Answers
9 pages
AWS Solution Architect Practical Assignments
No ratings yet
AWS Solution Architect Practical Assignments
52 pages
AWS MINI Project
No ratings yet
AWS MINI Project
63 pages
Guide - Part1 - Apache Hadoop Installation and Cluster Setup On AWS EC2 (Ubuntu) PDF
No ratings yet
Guide - Part1 - Apache Hadoop Installation and Cluster Setup On AWS EC2 (Ubuntu) PDF
23 pages
Cloud Computing Lab4 Kittu
No ratings yet
Cloud Computing Lab4 Kittu
15 pages
Cyber Scape
100% (1)
Cyber Scape
1 page
Cloud Architect Certification Masters Course PDF
No ratings yet
Cloud Architect Certification Masters Course PDF
14 pages
SQL Notes
No ratings yet
SQL Notes
30 pages
AWS Lab Notes
0% (1)
AWS Lab Notes
68 pages
65EP5G - Datasheet (Low) - LG OLED Pro Monitor - 210406
No ratings yet
65EP5G - Datasheet (Low) - LG OLED Pro Monitor - 210406
3 pages
Cloud Admini
No ratings yet
Cloud Admini
4 pages
Cloud Computing Lab Tanushri
No ratings yet
Cloud Computing Lab Tanushri
19 pages
Big Data Lab Manual and Syllabus
No ratings yet
Big Data Lab Manual and Syllabus
71 pages
Parth Savjani: Professional Summary
No ratings yet
Parth Savjani: Professional Summary
2 pages
Amazon Emr Management Guide
No ratings yet
Amazon Emr Management Guide
314 pages
@timwr For Not Forgetting Me.: @timwr @lucyoas @xaitax @314Ckc47
No ratings yet
@timwr For Not Forgetting Me.: @timwr @lucyoas @xaitax @314Ckc47
7 pages
Cloud and Ubiquitous Computing Practical Manual
100% (1)
Cloud and Ubiquitous Computing Practical Manual
20 pages
Macaw Power BI Cheat Sheet EN
100% (1)
Macaw Power BI Cheat Sheet EN
2 pages
Pro Tools Keyboard Shortcuts PDF
No ratings yet
Pro Tools Keyboard Shortcuts PDF
6 pages
Building 1000 Node Spark Cluster On EMR
No ratings yet
Building 1000 Node Spark Cluster On EMR
53 pages
Hive On Google Cloud
No ratings yet
Hive On Google Cloud
16 pages
Become Microsoft Certified: Azure Business Applications Modern Workplace
No ratings yet
Become Microsoft Certified: Azure Business Applications Modern Workplace
1 page
IIM Cal Big Data Course Slides
No ratings yet
IIM Cal Big Data Course Slides
131 pages
Introduction To VBA in Excel Week 1: Study Items For This Week
No ratings yet
Introduction To VBA in Excel Week 1: Study Items For This Week
7 pages
Javabykiran: Aws Solutions Architect
No ratings yet
Javabykiran: Aws Solutions Architect
11 pages
How To Suspend Your Virtual Machine Faster
No ratings yet
How To Suspend Your Virtual Machine Faster
1 page
Big Data On Aws at Edutronic PDF
No ratings yet
Big Data On Aws at Edutronic PDF
2 pages
Running Wordcount On AWS Elastic Map Reduce
100% (2)
Running Wordcount On AWS Elastic Map Reduce
26 pages
Getting Started With AWS: Analyzing Big Data
No ratings yet
Getting Started With AWS: Analyzing Big Data
29 pages
My 12 Aws 124 Cou 23
No ratings yet
My 12 Aws 124 Cou 23
13 pages
02 - Apache Spark On Amazon EMR
No ratings yet
02 - Apache Spark On Amazon EMR
31 pages
Amazon Elastic MapReduce Best Practices
No ratings yet
Amazon Elastic MapReduce Best Practices
38 pages
AWS Amazon EMR
100% (1)
AWS Amazon EMR
38 pages

Run Word Count - Hive Job On EMR - V1 - Reviewed - Sks - Lab Guides

Uploaded by

Run Word Count - Hive Job On EMR - V1 - Reviewed - Sks - Lab Guides

Uploaded by

Big Data

Run Hive Job on EMR– Demo

Steps to create EMR Cluster – Demo ............................................................................................. 2

Steps to run Hive Job on EMR – Demo

Step 1: Click the cluster you created earlier.

Step 2: Click Steps tab. Click “Add Step” button.

Input location: s3://us-east-1.elasticmapreduce.samples

Step 5: Click Create folder.

Step 6: Type the folder name. Click Save.

Step 7: Go to EMR service tab again. Click the folder icon.

Step 9: Check the cluster status.

Step 11: Choose the os_requests.

Step 12: Download it. Open it in a notepad.

Step 13: Check the file.

You might also like