HIVE Architecture

The document provides an overview of Hive architecture, detailing its components such as Hive Client, Hive Services, and the Metastore. It explains the creation of internal and external tables, partitioning, and bucketing in Hive, along with examples of static and dynamic partitioning. Additionally, it covers HiveQL operators and functions for performing operations on data within Hive tables.

Uploaded by

Komal Kumar Sahu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views5 pages

HIVE Architecture

Uploaded by

Komal Kumar Sahu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Hive Architecture

Hive
Client hritServer DBC Driverr ODBCADriver

Nive
Services Hive web UI Hive Server CLI

Hive Driver

Metastore

MapReucHh

HDFS

Hive Client
Hive allows writing applications in various
languages, including Java, Python,and C++. It
supportsdifferent types of clients such as:
Thrift Server - It is a cross-language service
provider platform that serves the request from all
those programming languages that supports
Thrift.
JDBC Driver -It is used to establish a connection
between hive and Java applications. The JDBC
Driver is present in the class
org.apache.hadoop.hive.jdbc.HiveDriver.
ODBC Driver - It allows the applications that
support the ODBC protocol to connect to Hive.
6/16/2023

Hive Services
The following are the services provided by Hive:
Hive CLI - The Hive CLI (Command Line Interface) is a shell
execute Hive queries and commands. where we can
Hive Web User Interface - The Hive Web UI is just an
alternative of Hive
CLI. It provides a web-based GUI for executing Hive queries
commands. and
Hive MetaStore - It is a central repository that stores all the
information of various tables and partitions in the warehouse. structure
It also
includes metadata of column and its type information, the serializers and
deserializers which is used to read and write data and the corresponding
HDFS files where the data is stored.
Hive Server - It is referred to as Apache Thrift Server. It
from different clients and provides it to Hive Driver. accepts the request
Hive Driver - It receives queries from different sources like web UI, CLI,
Thrift,and JDBC/ODBC driver. It transfers the queries to the compiler.
Hive Compiler - The purpose of the compiler is to parse the
perform semantic analysis on the different query blocks andquery and
expressions.
It converts HiveQL statements into
MapReduce jobs.
Hive Execution Engine - Optimizer generates the logical plan in
DAG of map-reduce tasks and HDFS tasks. In the end, the the form of
execution engine
Acycie grgeh executes the incoming tasks in the order of their dependencies.

theis ue fon to Snth oth 4

Hive -Create Table

In Hive, we can create a table by using the
conventions similar to
the SQL. It supports a wide range of flexibility
for tables are stored. It proyides two types of where-the data files
1. Internal table
table:
2. External table
Internal Table
The internal tables are also called managed
their data is controlled by the Hive. By tables as the lifecycle of
default,
stored in asubdirectory under the directory defined these tables are
by
hive.metastore.warehouse.dir (i.e. /user/hive/warehouse).
internal tables are not flexible enough to share with other
The
Pig. If we try to drop the internaltable, Hive deletes tools like
schema and data. both table
Let's create an internal table by using the following
hive> create table demo.employee (ldint,Name string,command:
row format delimited Salary float)
fields terminated by':

2
External Table
The external table allowS us to create and access a table and adata
externally. The external keyword is used to specify the external table,
whereas the location kevword is used to determine the location of loaded
data.
As the table is external, the data is not present in the Hive directory.
Therefore, if we try to drop the table, the metadata of the table wil be
deleted, but the data stillexists.
To create an external table, follow the below steps: -
Let's create a directory on HDES by using the following command: -
hdfs dfs -mkdir /Hive Directory
Now, store the file on the created directory.
hdfs dfs -put hive/emp_details /HiveDirectory
Let's create an external table using the following command: -
hive> create external table emplist (ld int,Name string, Salary float)
row format delimited
fields terminated by'"
location /HiveDirectory';

Partitioning in Hive
The partitioning in Hive means dividing the table into some parts
based on the values of a particular column like date, course, city or
country.The advantage of partitioning is that since the datais
stored in slices, the query response time becomes faster.
Aswe know that Hadoop is used to handle the huge amount of
data,it is always required to use the best approach to deal with it.
The partitioning in Hive is the best example of it.
Let's.assume we have a data of 10million students studying in an
institute. Now, we have to fetch the students of a particular course.
If we usea traditional approach,we have to gothrough the entire
data.This leads to performance degradation. In such a case, we can
adopt the better approach i.e., partitioning in Hive and divide the
data among the different datasets based on particular columns.
The partitioning in Hive can be executed in two ways -

Static partitioning
Dynamic partitioning
Static Partitioning
to pass the values of
Instatic or manual partitioning, it is required the data into the table.
partitioned columns manually while loading partitioned columns.
Hence, the data file doesn't contain the
Example of Static Partitioning
we want to create a table.
" First, select the database in which
hive> use test;
columns by using the
Create the table and provide the partitioned
following command: -
age int, institute strin
hive> create table student (id int, name string,
partitioned by (course string)
rowformat delimited
fields terminated by "
Dynamic Partitioning
In dynamicpartitioning, the values of partitioned columns exist
of
withinthe table. So, it is not required to pass the values
partitionedcolumns manually.
create a table.
First, select the database in which we want to

Bucketing in Hive
issimilar to
The bucketing in Hive is a data organizing technique. It divides
partitioning in Hive with an added functionality buckets. So, welarge
that it
can
datasets into more manageable parts known as
use bucketing in Hive when the implementationof partitioning
becomes difficult. However, we can also divide partitions further in
buckets.
Working of Bucketing in Hive
The concept of bucketing is based on the hashingtechnique.
Here, modules of current column value and the number of required
buckets is calculated (let say, F(x) %3).
Now, based on the resulted value, the data is stored into the
corresponding bucket.
Example of Bucketing in Hive
First, select the database in which we want to create a table.
hive> use showbucket;
HiveQL -Operators
The HiveQL operators facilitate to perform various arithmetic and
relational operations. Here, we aregoing toexecute such type of
operationson the records of the belowtable:
Example of Operators in Hive
Let's create atable and load the data into it by using the following steps: -
Select the database in which we want to create a table.
hive> use hgl;
Create a hive table using the following command: -
hive> create table employee (Id int, Name string,Salary float)
row format delimited
fields terminated by'
Now, load the data into the table.
table em
hive> load data local inpath '/home/codegyani/hive/emp_data' into
ployee;
Let's fetch the loaded data by using the following command: -
hive> select * from employee;
Arithmetic Operators in Hive
Relational Operators in Hive

HiveQL - Functions
perform mathematical and
The Hive provides various in-built functions to
aggregate type operations. Here, we are going to execute such type of
functionson the records of the below table:
Example of Functions in Hive
using the following steps: -
Let's create atable and load the data into it by
create a table.
Select the database in which we want to
hive> use hql;
command:
Create a hive table using the following
string, Salary float)
hive> create table employee_data (ld int, Name
row format delimited
fields terminated by'
Now, load the data into the table.
/home/codegyani/hive/emp details' intotabl
hive> load datalocal inpath
e employee_data;
command:
Let's fetch the loaded data by using the following
hive> select * from employee_data;

Semantics Term Paper
No ratings yet
Semantics Term Paper
14 pages
Chapter+9+ HIVE
No ratings yet
Chapter+9+ HIVE
50 pages
Hive Tutorial
No ratings yet
Hive Tutorial
25 pages
A Wrinkle in Time Questions Chapter 6: Name - Date
No ratings yet
A Wrinkle in Time Questions Chapter 6: Name - Date
5 pages
Hive
No ratings yet
Hive
9 pages
Apache HIVE
No ratings yet
Apache HIVE
44 pages
HIVE
No ratings yet
HIVE
28 pages
Learn Hive in 24 Hours
From Everand
Learn Hive in 24 Hours
Alex Nordeen
No ratings yet
Hive
No ratings yet
Hive
45 pages
6.1NoSQL ApacheHIVE Witha3
No ratings yet
6.1NoSQL ApacheHIVE Witha3
45 pages
Hive Main
No ratings yet
Hive Main
24 pages
Cse3002 Big Data m2
No ratings yet
Cse3002 Big Data m2
76 pages
HIVE
No ratings yet
HIVE
80 pages
Hive Final
No ratings yet
Hive Final
75 pages
Hive
No ratings yet
Hive
65 pages
Module 4
No ratings yet
Module 4
34 pages
Hive Table Session
No ratings yet
Hive Table Session
23 pages
Hive
No ratings yet
Hive
42 pages
Hiveppt
No ratings yet
Hiveppt
29 pages
Unit 5 Lecture No-1 (Hive)
No ratings yet
Unit 5 Lecture No-1 (Hive)
30 pages
Bda-Unit-Iv - 2020-21
100% (1)
Bda-Unit-Iv - 2020-21
30 pages
Big Data Analytics: Welcome
No ratings yet
Big Data Analytics: Welcome
69 pages
Hive Query Language
No ratings yet
Hive Query Language
33 pages
Wa0006.
No ratings yet
Wa0006.
53 pages
04 Bigdata Hive
No ratings yet
04 Bigdata Hive
22 pages
Hive
No ratings yet
Hive
15 pages
Hive Main
No ratings yet
Hive Main
33 pages
HIVE Lect
No ratings yet
HIVE Lect
91 pages
DSCI 5350 - Lecture 5 PDF
No ratings yet
DSCI 5350 - Lecture 5 PDF
64 pages
Unit-4 Pig Hive
No ratings yet
Unit-4 Pig Hive
40 pages
Hive File Format
No ratings yet
Hive File Format
38 pages
BDA Unit 4 Notes
No ratings yet
BDA Unit 4 Notes
33 pages
Hadoop Hive
No ratings yet
Hadoop Hive
61 pages
Hive
No ratings yet
Hive
26 pages
Hive Overview
No ratings yet
Hive Overview
28 pages
Unit IV
No ratings yet
Unit IV
22 pages
Big Data
No ratings yet
Big Data
120 pages
Introduction To Hive
No ratings yet
Introduction To Hive
14 pages
M4 Q&a
No ratings yet
M4 Q&a
22 pages
Module 3-1
No ratings yet
Module 3-1
32 pages
Course On: Big Data Analytics
No ratings yet
Course On: Big Data Analytics
59 pages
Unit-5 - Hive
No ratings yet
Unit-5 - Hive
31 pages
Big Data Record 2
No ratings yet
Big Data Record 2
117 pages
Unit 2.2 Hive
No ratings yet
Unit 2.2 Hive
80 pages
Hive 2nd Practical
No ratings yet
Hive 2nd Practical
11 pages
HIVE
No ratings yet
HIVE
24 pages
Unit IV Notes
No ratings yet
Unit IV Notes
47 pages
Unit Iv Part - 1
No ratings yet
Unit Iv Part - 1
60 pages
Cheat Sheet: Hive Basics
No ratings yet
Cheat Sheet: Hive Basics
1 page
Unit 5 Lecture No-1 (Hive)
No ratings yet
Unit 5 Lecture No-1 (Hive)
30 pages
LectureNotes Hive Final
No ratings yet
LectureNotes Hive Final
36 pages
BDA Unit-5
No ratings yet
BDA Unit-5
39 pages
Unit 5 (BDC)
No ratings yet
Unit 5 (BDC)
59 pages
Bda Unit 5 Hive Notes
No ratings yet
Bda Unit 5 Hive Notes
23 pages
Hive L1
No ratings yet
Hive L1
134 pages
Unit V BD LM Cse
No ratings yet
Unit V BD LM Cse
34 pages
Unit-Vi Hive Hadoop & Big Data
100% (1)
Unit-Vi Hive Hadoop & Big Data
24 pages
Hive
No ratings yet
Hive
30 pages
Unit 5-Hive
No ratings yet
Unit 5-Hive
18 pages
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
From Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
No ratings yet
Apache Hive Handbook: Query, Analyze, and Optimize Big Data
From Everand
Apache Hive Handbook: Query, Analyze, and Optimize Big Data
Robert Johnson
No ratings yet
Investigation of Diversity and Dominance of Fungal Biota in Stored Wheat Grains From Governmental Warehouses in West Bengal, India
No ratings yet
Investigation of Diversity and Dominance of Fungal Biota in Stored Wheat Grains From Governmental Warehouses in West Bengal, India
12 pages
Emerging Diseases Need For Focused Research in Smallmillets
No ratings yet
Emerging Diseases Need For Focused Research in Smallmillets
11 pages
3D Hologram Technology in Learning Environment
No ratings yet
3D Hologram Technology in Learning Environment
12 pages
Gus 1
No ratings yet
Gus 1
1 page
Histochemical GUS Assay (Exp 6, CSS451)
No ratings yet
Histochemical GUS Assay (Exp 6, CSS451)
2 pages
Dauli Teachers College
100% (3)
Dauli Teachers College
3 pages
Chapter 5 Gastrointestinal Agents Reviewer PDF
No ratings yet
Chapter 5 Gastrointestinal Agents Reviewer PDF
6 pages
Sensor Selection Getting It Right For Flammable Gases
No ratings yet
Sensor Selection Getting It Right For Flammable Gases
8 pages
FSM 4 Basic Baking Report
No ratings yet
FSM 4 Basic Baking Report
12 pages
Aramid Prepreg Market
No ratings yet
Aramid Prepreg Market
8 pages
Agri
No ratings yet
Agri
106 pages
Challenges That Face Entrepreneurships in Tanzania
No ratings yet
Challenges That Face Entrepreneurships in Tanzania
6 pages
Ansi Aga B109.4 - 2016
No ratings yet
Ansi Aga B109.4 - 2016
39 pages
Jis G4105
No ratings yet
Jis G4105
2 pages
Applied Buddhism An Academic Discipline On LBU PDF
No ratings yet
Applied Buddhism An Academic Discipline On LBU PDF
16 pages
Iconlibrary Production Oct2016
No ratings yet
Iconlibrary Production Oct2016
137 pages
UT525 526 User Manual
No ratings yet
UT525 526 User Manual
31 pages
Recent Advances in Microbial Biopolymer Production PDF
No ratings yet
Recent Advances in Microbial Biopolymer Production PDF
16 pages
2023 2024 Class Catch Up Friday Program
100% (1)
2023 2024 Class Catch Up Friday Program
6 pages
3 Amigos - SVS-Fault - Test & Mod - Sierrafery
No ratings yet
3 Amigos - SVS-Fault - Test & Mod - Sierrafery
11 pages
Identifying Ethical Issues in AI Partners in Human-AI Co-Creation
No ratings yet
Identifying Ethical Issues in AI Partners in Human-AI Co-Creation
6 pages
Fixed Displacement Vane Pumps Datasheet
No ratings yet
Fixed Displacement Vane Pumps Datasheet
6 pages
Sos 28 July Step Regular Session by Saeed Mdcat Team
100% (1)
Sos 28 July Step Regular Session by Saeed Mdcat Team
5 pages
CodeX The Real Time Code Editor-1
No ratings yet
CodeX The Real Time Code Editor-1
54 pages
Psychology: Frontiers and Applications, 7th Canadian Edition, Michael W
No ratings yet
Psychology: Frontiers and Applications, 7th Canadian Edition, Michael W
407 pages
CPTest Manual
No ratings yet
CPTest Manual
19 pages
LM4040, LM4041 Precision Micro-Power Shunt Voltage References
No ratings yet
LM4040, LM4041 Precision Micro-Power Shunt Voltage References
14 pages
All or Nothing SR Clare Resource Pack
No ratings yet
All or Nothing SR Clare Resource Pack
36 pages
National Museum of Rwanda
No ratings yet
National Museum of Rwanda
4 pages
Doggy Styles 3 - Loving Duke
No ratings yet
Doggy Styles 3 - Loving Duke
11 pages
Kuiper
No ratings yet
Kuiper
223 pages
Oceanic Feeling PDF
No ratings yet
Oceanic Feeling PDF
20 pages

HIVE Architecture

Uploaded by

HIVE Architecture

Uploaded by

Hive Architecture

theis ue fon to Snth oth 4

Hive -Create Table

You might also like