Hive Architecture and Working

Uploaded by

Anime Time

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views2 pages

Hive Architecture and Working

Uploaded by

Anime Time

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Hive Architecture and Working

Apache Hive is a data warehouse built on top of Hadoop that allows querying of large datasets stored
in HDFS (Hadoop Distributed File System) using an SQL-like query language called HiveQL. It
abstracts the complexity of Hadoop's MapReduce framework, making it easier for users to analyze
large amounts of data without having to write complex code.
Components of Hive Architecture
1. User Interface (UI)
 The user interface enables interaction with Hive. Users submit queries, manage
databases, and perform other operations through:
 CLI (Command Line Interface): The most basic and widely used method to
interact with Hive.
 Web UI: A browser-based interface.
 ODBC/JDBC Drivers: For connecting external tools and applications to
Hive.
2. HiveQL Process Engine
 HiveQL is the query language used in Hive. It converts user queries into MapReduce
tasks or Tez jobs for execution on the Hadoop cluster.
3. Metastore
 The Metastore is a critical component of Hive that stores metadata, such as table
definitions, database schemas, column types, and partition information. The metadata
is stored in a relational database like MySQL or Derby.
4. Driver
 The Driver manages the lifecycle of the query. It coordinates between different
components:
 Parser: Breaks down the query and checks for syntax errors.
 Compiler: Translates the query into a logical plan.
 Optimizer: Optimizes the execution plan for better performance.
 Executor: Executes the plan by coordinating with the Execution Engine.
5. Execution Engine
 The Execution Engine takes the execution plan from the Driver and translates it into
Hadoop MapReduce jobs, which are submitted to the cluster for processing. Hive also
supports other execution engines like Apache Tez or Apache Spark for faster query
execution.
6. Hadoop (HDFS and MapReduce)
 Hive uses Hadoop’s HDFS for storage and MapReduce (or Tez/Spark) for distributed
processing. Hive queries are converted into MapReduce jobs, which are executed on
the Hadoop cluster.
Workflow of Hive
1. Query Submission: The user submits a HiveQL query via CLI, Web UI, or an external tool.
2. Parsing: The Driver parses the query and checks for syntax errors.
3. Plan Generation: The query is converted into a logical execution plan by the compiler.
4. Optimization: The logical plan is optimized for efficient execution (e.g., minimizing the
number of MapReduce tasks).
5. Execution: The optimized plan is converted into MapReduce tasks, which are submitted to
the Hadoop cluster.
6. Result Return: After execution, the results are fetched from HDFS and returned to the user.
+-----------------------+
| User Interface |
| (CLI, Web UI, JDBC) |
+-----------+-----------+
|
v
+-----------------------+
| HiveQL Process Engine |
+-----------+-----------+
|
v
+-----------------------+
| Driver |
| (Parser, Compiler, |
| Optimizer, Executor) |
+-----------+-----------+
|
v
+-----------------------+
| Execution Engine |
+-----------+-----------+
|
v
+-----------------------+
| Hadoop (HDFS, |
| MapReduce/Tez/Spark) |
+-----------------------+
|
v
+-----------------------+
| Data Storage |
| (HDFS, HBase, etc.) |
+-----------------------+

Python With Data Science
No ratings yet
Python With Data Science
102 pages
Unit 4 Hadoop Ecosystem - HIVE and PIG
No ratings yet
Unit 4 Hadoop Ecosystem - HIVE and PIG
157 pages
Architecture and Working of Hive
No ratings yet
Architecture and Working of Hive
7 pages
BDA Unit-5
No ratings yet
BDA Unit-5
25 pages
Chapter 5 Hive
No ratings yet
Chapter 5 Hive
69 pages
Bda Unit 4 - Mam
No ratings yet
Bda Unit 4 - Mam
57 pages
Bda Unit 5 Notes
No ratings yet
Bda Unit 5 Notes
23 pages
Bigdata Lecture 5
No ratings yet
Bigdata Lecture 5
19 pages
Big-Data-Unit 5
No ratings yet
Big-Data-Unit 5
54 pages
Hive
No ratings yet
Hive
52 pages
Big-Data-Unit 5
No ratings yet
Big-Data-Unit 5
54 pages
DA Unit-5
No ratings yet
DA Unit-5
78 pages
Big Data & Analytics (CSE6005) L6
No ratings yet
Big Data & Analytics (CSE6005) L6
56 pages
Hive
No ratings yet
Hive
49 pages
Unit-IV - BDA
No ratings yet
Unit-IV - BDA
42 pages
Unit 3-1
No ratings yet
Unit 3-1
41 pages
7 Hive
No ratings yet
7 Hive
30 pages
Unit 5 Lecture No-1 (Hive)
No ratings yet
Unit 5 Lecture No-1 (Hive)
30 pages
Hive Unit VI
No ratings yet
Hive Unit VI
39 pages
Chapter - 4 - Data Access - Hive
No ratings yet
Chapter - 4 - Data Access - Hive
35 pages
Big Data: Week - 11
No ratings yet
Big Data: Week - 11
28 pages
Unit 5 (BDC)
No ratings yet
Unit 5 (BDC)
59 pages
HIVE
No ratings yet
HIVE
18 pages
Hive
No ratings yet
Hive
30 pages
Ibiz Hive
No ratings yet
Ibiz Hive
27 pages
Hive Tutorial
No ratings yet
Hive Tutorial
19 pages
Course3 Module2 Intro To Hive Slides
No ratings yet
Course3 Module2 Intro To Hive Slides
76 pages
Unit 3
No ratings yet
Unit 3
23 pages
Execution Environments For Distributed Computing: Apache Hive
No ratings yet
Execution Environments For Distributed Computing: Apache Hive
23 pages
Hive Full Lecture
No ratings yet
Hive Full Lecture
17 pages
01 Introduction To Hive
No ratings yet
01 Introduction To Hive
17 pages
Bda Report
No ratings yet
Bda Report
16 pages
01 Introduction To Hive (1) 2 15
No ratings yet
01 Introduction To Hive (1) 2 15
14 pages
PostEx-COD API Integration Guide V4.1.9-1
No ratings yet
PostEx-COD API Integration Guide V4.1.9-1
33 pages
BD U-5 (Anupam Sir)
No ratings yet
BD U-5 (Anupam Sir)
12 pages
BDA Answers
No ratings yet
BDA Answers
10 pages
Day 4
No ratings yet
Day 4
10 pages
Hive
No ratings yet
Hive
12 pages
Unit-4 Hive
No ratings yet
Unit-4 Hive
10 pages
01 Introduction To Hive
No ratings yet
01 Introduction To Hive
14 pages
Unit V-Hive
No ratings yet
Unit V-Hive
10 pages
Web Based Data Management of Apache Hive
No ratings yet
Web Based Data Management of Apache Hive
22 pages
Hive Architecture
No ratings yet
Hive Architecture
7 pages
HIVE
No ratings yet
HIVE
7 pages
IET Udaipur BDA Unit-5
No ratings yet
IET Udaipur BDA Unit-5
9 pages
Bda Exp-6
No ratings yet
Bda Exp-6
10 pages
Unit 3 Hive Overview and Architecture
No ratings yet
Unit 3 Hive Overview and Architecture
5 pages
1 - Introduction
No ratings yet
1 - Introduction
5 pages
What Is Hive
No ratings yet
What Is Hive
4 pages
Hive 2
No ratings yet
Hive 2
4 pages
Introduction To Hive-5
No ratings yet
Introduction To Hive-5
4 pages
Unit 3 Hive
No ratings yet
Unit 3 Hive
3 pages
Hive
No ratings yet
Hive
5 pages
Execution Environments For Distributed Computing: Apache Hive
No ratings yet
Execution Environments For Distributed Computing: Apache Hive
23 pages
Working of Hive: Mapreduce: It Is A Parallel Programming Model For Processing Large Amounts
No ratings yet
Working of Hive: Mapreduce: It Is A Parallel Programming Model For Processing Large Amounts
3 pages
Assignment 4-Gcc: Hive Is Not
No ratings yet
Assignment 4-Gcc: Hive Is Not
3 pages
An Introduction To Visual Basic 2010
100% (1)
An Introduction To Visual Basic 2010
71 pages
2019 - 04 - 10 - KickOff
No ratings yet
2019 - 04 - 10 - KickOff
27 pages
HSBC PDF
No ratings yet
HSBC PDF
2 pages
Kolla Ansible Installation
100% (1)
Kolla Ansible Installation
12 pages
Scrum Terminology Updated
No ratings yet
Scrum Terminology Updated
3 pages
ISSUU PDF Downloader
No ratings yet
ISSUU PDF Downloader
9 pages
Session 1 - Basic of Product Management
No ratings yet
Session 1 - Basic of Product Management
51 pages
Luther Neil C. Ramos: 215 500 6925 Lramos@mailbank - Us
No ratings yet
Luther Neil C. Ramos: 215 500 6925 Lramos@mailbank - Us
6 pages
Introduction
No ratings yet
Introduction
22 pages
0415 Javascript Front End Web App Tutorial Part 1
No ratings yet
0415 Javascript Front End Web App Tutorial Part 1
48 pages
Firebase
No ratings yet
Firebase
19 pages
Docs Aiohttp Org en V3.7.4.post0
No ratings yet
Docs Aiohttp Org en V3.7.4.post0
269 pages
CHKDSK Command Syntax
No ratings yet
CHKDSK Command Syntax
2 pages
Ite6102 Computer Programming 1 Updated
No ratings yet
Ite6102 Computer Programming 1 Updated
23 pages
Project Peport Pranali
No ratings yet
Project Peport Pranali
88 pages
COS101 MCQ Questions
No ratings yet
COS101 MCQ Questions
17 pages
Online Hotel Booking System
No ratings yet
Online Hotel Booking System
17 pages
T
0% (1)
T
11 pages
Proposal For Cargo Load-Unloading Records App Development - CINTech - V1.0
No ratings yet
Proposal For Cargo Load-Unloading Records App Development - CINTech - V1.0
10 pages
Software Engineering
No ratings yet
Software Engineering
6 pages
CSS - DLL Week 4 3rd Quarter
No ratings yet
CSS - DLL Week 4 3rd Quarter
4 pages
Applications of C Programming
No ratings yet
Applications of C Programming
3 pages
CSS 22519 Master Notes
No ratings yet
CSS 22519 Master Notes
11 pages
My Most Painful Regret (On HOLD) MMPR 03 - Page 1 - Wattpad
No ratings yet
My Most Painful Regret (On HOLD) MMPR 03 - Page 1 - Wattpad
37 pages
Cst205 Oopj Dec 2022
No ratings yet
Cst205 Oopj Dec 2022
3 pages
PDS Lab Asg 7
No ratings yet
PDS Lab Asg 7
2 pages
Panjwani Softwares
No ratings yet
Panjwani Softwares
1 page
Norman Portillo
No ratings yet
Norman Portillo
2 pages
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
From Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
No ratings yet
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet
Mastering Data Engineering: Advanced Techniques with Apache Hadoop and Hive
From Everand
Mastering Data Engineering: Advanced Techniques with Apache Hadoop and Hive
Peter Jones
No ratings yet
Advanced Hadoop Techniques: A Comprehensive Guide to Mastery
From Everand
Advanced Hadoop Techniques: A Comprehensive Guide to Mastery
Adam Jones
No ratings yet
Apache Hive Handbook: Query, Analyze, and Optimize Big Data
From Everand
Apache Hive Handbook: Query, Analyze, and Optimize Big Data
Robert Johnson
No ratings yet
Learn Hive in 24 Hours
From Everand
Learn Hive in 24 Hours
Alex Nordeen
No ratings yet

Hive Architecture and Working

Uploaded by

Hive Architecture and Working

Uploaded by

Hive Architecture and Working

You might also like