0% found this document useful (0 votes)

14 views15 pages

Hive

hive commands

Uploaded by

midhun reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views15 pages

Hive

hive commands

Uploaded by

midhun reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Hive commands to practice:

a. Creating a database,table
b. Dropping a database,table
c. Describe comand,alter, insert, select
d. Group by with having,Order by
Hive Commands with Syntax and Examples
Create Database
 This command will create a database.
hive> create database <database-name>;
For Ex - create database demo;

 This command will show all databases that are present.

hive> show databases;

 This command will only create a database if it is not present.

hive> create database if not exists <database-name>;
For ex - create database if not exists demo;

 Assigning properties with the database in the form of key-value

pair.
hive> create the database demo
> WITH DBPROPERTIES ('creator'='Varchasa Aggarwal',
'date'='18-04-2021');

 Let’s retrieve the information associated with the database.

hive> describe database extended demo;

Drop Database
Delete a defined database.
 This command will delete a database.
hive> drop database demo;

 To check that database is deleted or not.

hive> show databases;
 Drop database if and only if it exists.
hive> drop database if exists demo;

 In Hive, it is not allowed to drop the database that contains the

tables directly. In such a case, we can drop the database either by
dropping tables first or use Cascade keyword with the command.
 Let’s see the cascade command used to drop the database:-
hive> drop database if exists demo cascade.

This command automatically drops the table present in the database

first.
Hive — Create Table
In hive, we have two types of table —
 Internal table
 External table
Internal Table
 The internal tables are also called managed tables as the lifecycle
of their data is controlled by the Hive.
 By default, these tables are stored in a subdirectory under the
directory defined by hive.metastore.warehouse.dir (i.e.
/user/hive/warehouse).
hive> create table demo.employee(Id int, Name string, Salary
float)
> row format delimited
> fields terminated by ',';

Here, the command also includes the information that the data is
separated by ‘,’.
 Let’s see the metadata of the created table.
hive> describe demo.employee

 Let’s create a table if it not exists.

hive> create table if not exists demo.employee(Id int, Name
string, Salary float)
> row format delimited
> fields terminated by ','

 While creating a table, we can add the comments to the columns

and can also define the table properties.
hive> create table demo.new_employee(Id int comment 'Employee
Id' Name string comment 'Employee Name', Salary float comment
'Employee Salary') comment 'Table Description' TBLProperties
('creator'='Varchasa Aggarwal','created at'='18-04-2021');

 Let’s see the metadata of the created table.

hive> describe new_employee;

 Hive allows creating a new table by using the schema of an

existing table.
Schema is the skeleton structure that represents the logical view of
the entire database. It defines how the data is organized and how
the relations among them are associated.
hive> create table if not exists demo.copy_employee like
demo.employee;

Here, we can say that the new table is a copy of an existing table.
Hive — Load Data
Once the internal table has been created, the next step is to load the
data into it.
 Let’s load the data of the file into the database by using the
following command: -
load data local inpath '/home/<username>/hive/emp_details'into
table demo.employee;select * from.demo.employee;

Hive — Drop Table

Let’s delete a specific table from the database.
hive> show databases;
hive> use demo;
hive> show tables;
hive> drop table new_employee;
hive> show tables;

Hive — Alter table

In Hive, we can perform modifications in the existing table like
changing the table name, column name, comments, and table
properties.
 Rename a table
hive> Alter table <old_table_name> rename to <new_table_name>

Let’s check table name changed or not.

hive> show tables;

 Adding a column —
Alter table table_name add columns(columnName datatype);

 Change column —
hive> Alter table_name change <old_column_name>
<new_column_name> datatype;

 Delete or replace column —

alter table employee_data replace columns( id string, first_name
string, age int);

In hive with DML statements, we can add data to the Hive table in 2
different ways.
 Using INSERT Command
 Load Data Statement

1. Using INSERT Command

Syntax:
INSERT INTO TABLE <table_name> VALUES (<add values as per column
entity>);

Example:
To insert data into the table let’s create a table with the name student (By
default hive uses its default database to store hive tables).
Command:
CREATE TABLE IF NOT EXISTS student(

Student_Name STRING,

Student_Rollno INT,

Student_Marks FLOAT)

ROW FORMAT DELIMITED

FIELDS TERMINATED BY ',';

We have successfully created the student table in the

Hive default database with the attribute Student_Name, Student_Rollno,
and Student_Marks respectively.
Now, let’s insert data into this table with an INSERT query.

INSERT Query:
INSERT INTO TABLE student VALUES ('Dikshant',1,'95'),('Akshat', 2 ,
'96'),('Dhruv',3,'90');

We can check the data of the student table with the help of the below
command.
SELECT * FROM student;

2. Load Data Statement

Hive provides us the functionality to load pre-created table entities either
from our local file system or from HDFS. The LOAD DATA statement is
used to load data into the hive table.
Syntax:
LOAD DATA [LOCAL] INPATH '<The table data location>' [OVERWRITE]
INTO TABLE <table_name>;

Note:
 The LOCAL Switch specifies that the data we are loading is available
in our Local File System. If the LOCAL switch is not used, the hive will
consider the location as an HDFS path location.
 The OVERWRITE switch allows us to overwrite the table data.
Let’s make a CSV(Comma Separated Values) file with the
name data.csv since we have provided ‘,’ as a field terminator while
creating a table in the hive. We are creating this file in our local file system
at ‘/home/dikshant/Documents’ for demonstration purposes.
Command:
cd /home/dikshant/Documents // To change the directory

touch data.csv // use to create data.csv file

nano data.csv // nano is a linux command line

editor to edit files

cat data.csv // cat is used to see content of

file

LOAD DATA to the student hive table with the help of the below
command.
LOAD DATA LOCAL INPATH '/home/dikshant/Documents/data.csv' INTO
TABLE student;

Let’s see the student table content to observe the effect with the help of
the below command.
SELECT * FROM student;
We can observe that we have successfully added the data to
the student table.

Hive — Partitioning
The partitioning in hive can be done in two ways —
 Static partitioning
 Dynamic partitioning
Static Partitioning
In static or manual partitioning, it is required to pass the values of
partitioned columns manually while loading the data into the table.
Hence, the data file doesn’t contain the partitioned columns.
hive> use test;
hive> create table student (id int, name string, age int,
institute string)
> partitioned by (course string)
> row format delimited
> fields terminated by ',';

 Let’s retrieve the information.

hive> describe student;

 Load the data into the table and pass the values of partition
columns with it by using the following command: -
hive> load data local inpath
'/home/<username>/hive/student_details1' into table student
partition(course= "python");hive> load data local inpath
'/home/<username>/hive/student_details1' into table student
partition(course= "Hadoop");

 Now retrieve the data.

hive> select * from student;
hive> select * from student where course = 'Hadoop';

Dynamic Partitioning
In dynamic partitioning, the values of partitioned columns exist
within the table. So, it is not required to pass the values of
partitioned columns manually.
hive> use show;

 Enable the dynamic partitioning.

hive> set hive.exec.dynamic.partition=true;
hive> set hive.exec.dynamic.partition.mode=nonstrict;

 Create the dummy table.

hive> create table stud_demo(id int, name string, age int,
institute string, course string)
row format delimited
fields terminated by ',';

 Now load the data.

hive> load data local inpath
'/home/<username>/hive/student_details' into table stud_demo;

 Create a partition table.

hive> create table student_part (id int, name string, age int,
institute string)
partitioned by (course string)
row format delimited
fields terminated by ',';

 Insert the data of dummy table in the partition table.

hive> insert into student_part
partition(course)
select id, name age, institute, course
from stud_demo;

 Now you can view the table data with the help
of select command.
HiveQL — Operators
he HiveQL operators facilitate to perform various arithmetic and
relational operations.
hive> use hql;
hive> create table employee (Id int, Name string , Salary float)
row format delimited
fields terminated by ',' ;

 Now load the data.

hive> load data local inpath '/home/<username>/hive/emp_data'
into table employee;

 Fetch the data.

select * from employee;

Arithmetic Operators in Hive

 Adding 50 to salary column.

hive> select id, name, salary + 50 from employee;

 Substracting 50 from the salary column.

hive> select id, name, salary -50 from employee;

 Find out the 10% salary of each employee.

hive> select id, name, salary *10 from employee;

Relational Operators in Hive

 Fetch the details of the employee having salary>=25000.

hive> select * from employee where salary>=25000;

 Fetch the details of the employee having salary<25000.

hive> select * from employee where salary < 25000;

Functions in Hive
hive> use hql;
hive> create table employee_data (Id int, Name string , Salary
float)
row format delimited
fields terminated by ',' ;

 Now load the data.

hive> load data local inpath
'/home/<username>/hive/employee_data' into table employee;
 Fetch the data.
select * from employee_data;

Mathematical Functions in Hive

 Let’s see an example to fetch the square root of each employee’s

salary.
hive> select Id, Name, sqrt(Salary) from employee_data ;

Aggregate Functions
 Let’s see an example to fetch the maximum/minimum salary of
an employee.
hive> select max(Salary) from employee_data;
hive> select min(Salary) from employee_data;

Other functions in Hive

 Let’s see an example to fetch the name of each employee in

uppercase.
hive> select Id, upper(Name) from employee_data;
 Let’s see an example to fetch the name of each employee in
lowercase.
hive> select Id, lower(Name) from employee_data;

GROUP BY Clause
The HQL Group By clause is used to group the data from the
multiple records based on one or more column. It is generally used
in conjunction with the aggregate functions (like SUM, COUNT,
MIN, MAX and AVG) to perform an aggregation over each group.
hive> use hql;
hive> create table employee_data (Id int, Name string , Salary
float)
row format delimited
fields terminated by ',' ;

 Now load the data.

hive> load data local inpath
'/home/<username>/hive/employee_data' into table employee;

 Fetch the data.

select department, sum(salary) from employee_data group by
department;

HAVING CLAUSE
The HQL HAVING clause is used with GROUP BY clause. Its
purpose is to apply constraints on the group of data produced by
GROUP BY clause. Thus, it always returns the data where the
condition is TRUE.
 Let’s fetch the sum of employee’s salary based on department
having sum >= 35000 by using the following command:
hive> select department, sum(salary) from emp group by
department having sum(salary)>=35000;

HiveQL — ORDER BY Clause

In HiveQL, ORDER BY clause performs a complete ordering of the
query result set. Hence, the complete data is passed through a single
reducer. This may take much time in the execution of large datasets.
However, we can use LIMIT to minimize the sorting time.
hive> use hql;
hive> create table employee_data (Id int, Name string , Salary
float)
row format delimited
fields terminated by ',' ;

 Now load the data.

hive> load data local inpath
'/home/<username>/hive/employee_data' into table employee;

 Fetch the data.

select * from emp order by salary desc;

HiveQL — SORT BY Clause

The HiveQL SORT BY clause is an alternative of ORDER BY clause.
It orders the data within each reducer. Hence, it performs the local
ordering, where each reducer’s output is sorted separately. It may
also give a partially ordered result.
 Let’s fetch the data in the descending order by using the
following command:
select * from emp sort by order by salary desc;

Aplio 400 Service
100% (2)
Aplio 400 Service
492 pages
Unit-4 Pig Hive
No ratings yet
Unit-4 Pig Hive
40 pages
Hive L1
No ratings yet
Hive L1
134 pages
SAP2000 Tutorial Example: Analysis and Design of Continuous RC Beam
No ratings yet
SAP2000 Tutorial Example: Analysis and Design of Continuous RC Beam
21 pages
Bill of Engineering Measurements and Evaluation (BEME)
No ratings yet
Bill of Engineering Measurements and Evaluation (BEME)
18 pages
BDA Unit-5
No ratings yet
BDA Unit-5
39 pages
Bda-Unit-Iv - 2020-21
100% (1)
Bda-Unit-Iv - 2020-21
30 pages
Chapter+9+ HIVE
No ratings yet
Chapter+9+ HIVE
50 pages
Hive Tutorial
No ratings yet
Hive Tutorial
25 pages
Apache Hive DDL DML, Queries
100% (2)
Apache Hive DDL DML, Queries
4 pages
Hive and Pig
No ratings yet
Hive and Pig
57 pages
This Paper Is SAMPLE of The Official TSH Scholarship Event Exam (This Sample Is Missing The Optional Question 81 and Will Be Updated Soon)
100% (1)
This Paper Is SAMPLE of The Official TSH Scholarship Event Exam (This Sample Is Missing The Optional Question 81 and Will Be Updated Soon)
42 pages
Caps Maths English GR R FS
No ratings yet
Caps Maths English GR R FS
286 pages
Selected Problems in The Theory of Classical Cellular Automata
No ratings yet
Selected Problems in The Theory of Classical Cellular Automata
410 pages
Cheat Sheet: Hive Basics
No ratings yet
Cheat Sheet: Hive Basics
1 page
Optical Computers Technical Seminar Report Vtu Ece
100% (1)
Optical Computers Technical Seminar Report Vtu Ece
33 pages
Permutation
No ratings yet
Permutation
91 pages
Mimo Introduction
No ratings yet
Mimo Introduction
13 pages
Ethylene Oxide: Jump To
100% (1)
Ethylene Oxide: Jump To
31 pages
Hive Final
No ratings yet
Hive Final
75 pages
Cse3002 Big Data m2
No ratings yet
Cse3002 Big Data m2
76 pages
Wa0006.
No ratings yet
Wa0006.
53 pages
Unit 2.2 Hive
No ratings yet
Unit 2.2 Hive
80 pages
HIVE
No ratings yet
HIVE
80 pages
HIVE Lect
No ratings yet
HIVE Lect
91 pages
M4 Q&a
No ratings yet
M4 Q&a
22 pages
Aggregate Impact Value
No ratings yet
Aggregate Impact Value
8 pages
Hive
No ratings yet
Hive
42 pages
Hive Part 2
No ratings yet
Hive Part 2
53 pages
Hive File Format
No ratings yet
Hive File Format
38 pages
Hive Main
No ratings yet
Hive Main
33 pages
Hive Part 2
No ratings yet
Hive Part 2
47 pages
Unit Iv Part - 1
No ratings yet
Unit Iv Part - 1
60 pages
CAL Script For MDG - Governing Profit Center
No ratings yet
CAL Script For MDG - Governing Profit Center
29 pages
Error Detection and Correction
No ratings yet
Error Detection and Correction
38 pages
HIVE
No ratings yet
HIVE
28 pages
Hive Main
No ratings yet
Hive Main
24 pages
Exam - 1013S 2023 Final
No ratings yet
Exam - 1013S 2023 Final
20 pages
Hive Commands Syn
No ratings yet
Hive Commands Syn
27 pages
Introduction To Hive
No ratings yet
Introduction To Hive
14 pages
Hadoop Hive
No ratings yet
Hadoop Hive
61 pages
Hive Query Language
No ratings yet
Hive Query Language
33 pages
Apache HIVE
No ratings yet
Apache HIVE
44 pages
Module 3-1
No ratings yet
Module 3-1
32 pages
BDA - Exp-8 - Aarya Sawant
No ratings yet
BDA - Exp-8 - Aarya Sawant
18 pages
Hive Commands
No ratings yet
Hive Commands
15 pages
Hive
No ratings yet
Hive
45 pages
Hive PPTs
No ratings yet
Hive PPTs
34 pages
Hive
No ratings yet
Hive
29 pages
Hive Table Session
No ratings yet
Hive Table Session
23 pages
Practical-2 Hive (Show - Create - Load Commands)
No ratings yet
Practical-2 Hive (Show - Create - Load Commands)
13 pages
Ts1 ts2
No ratings yet
Ts1 ts2
61 pages
HDFSandhivecommands
No ratings yet
HDFSandhivecommands
15 pages
Hive
No ratings yet
Hive
65 pages
Hive 2nd Practical
No ratings yet
Hive 2nd Practical
11 pages
Big Data Analytics: Welcome
No ratings yet
Big Data Analytics: Welcome
69 pages
Hive
No ratings yet
Hive
9 pages
ABP W11-W12 Big Data Analytics Lab-HIVE
No ratings yet
ABP W11-W12 Big Data Analytics Lab-HIVE
8 pages
DSCI 5350 - Lecture 5 PDF
No ratings yet
DSCI 5350 - Lecture 5 PDF
64 pages
Hive Commands
No ratings yet
Hive Commands
7 pages
Inheritance B
No ratings yet
Inheritance B
7 pages
CPM and Pert
No ratings yet
CPM and Pert
40 pages
HIVE Architecture
No ratings yet
HIVE Architecture
5 pages
Hive
No ratings yet
Hive
13 pages
HIVE Data Types
No ratings yet
HIVE Data Types
6 pages
8051 Instruction Set
No ratings yet
8051 Instruction Set
50 pages
Hive Code
No ratings yet
Hive Code
6 pages
6 - Big - Data Vivek
No ratings yet
6 - Big - Data Vivek
5 pages
Hive Overview
No ratings yet
Hive Overview
28 pages
Caotic Mechanics Maxima
No ratings yet
Caotic Mechanics Maxima
25 pages
Hive - Hands On Exercises: Intellipaat Software Solutions Pvt. LTD
No ratings yet
Hive - Hands On Exercises: Intellipaat Software Solutions Pvt. LTD
8 pages
Hiveppt
No ratings yet
Hiveppt
29 pages
Week-11 - 12-Hivepdf - 2023 - 11 - 10 - 12 - 47 - 43
No ratings yet
Week-11 - 12-Hivepdf - 2023 - 11 - 10 - 12 - 47 - 43
8 pages
Exp Limiting Friction
No ratings yet
Exp Limiting Friction
2 pages
Hive Notes PDF
No ratings yet
Hive Notes PDF
12 pages
Ipiii
No ratings yet
Ipiii
8 pages
Hive Presentation
No ratings yet
Hive Presentation
18 pages
JASCO FT-IR Spectrometers
No ratings yet
JASCO FT-IR Spectrometers
2 pages
Nandha Engineering College, Erode-52 (: 15ME603 - Finite Element Analysis
No ratings yet
Nandha Engineering College, Erode-52 (: 15ME603 - Finite Element Analysis
4 pages
Specification - Bitumen Slip Layer (G&P)
No ratings yet
Specification - Bitumen Slip Layer (G&P)
4 pages
Python Tuples PDF
No ratings yet
Python Tuples PDF
3 pages
Charges Q (1) 1.5 MC, Q (2) 0.2 MC and Q (3) - 0.5 MC, Are Placed at
No ratings yet
Charges Q (1) 1.5 MC, Q (2) 0.2 MC and Q (3) - 0.5 MC, Are Placed at
1 page
Digital Filter Design (FIR) Using Frequency Sampling Method: Abstract
No ratings yet
Digital Filter Design (FIR) Using Frequency Sampling Method: Abstract
10 pages
Practice Problems For Mid Term Test
No ratings yet
Practice Problems For Mid Term Test
11 pages
Experiment 3: Hive: Aim: To Understand Data Processing Tool - Hive and HQL (Hive Query Language)
No ratings yet
Experiment 3: Hive: Aim: To Understand Data Processing Tool - Hive and HQL (Hive Query Language)
11 pages
SL No. Item Decription Unit Qty Unit Rate Total Supply Cost in Rs. Unit Rate Total Erection Cost in Rs. Supply Portion Erection Portion
No ratings yet
SL No. Item Decription Unit Qty Unit Rate Total Supply Cost in Rs. Unit Rate Total Erection Cost in Rs. Supply Portion Erection Portion
1 page
Rocker Gear and Valves
No ratings yet
Rocker Gear and Valves
10 pages
How to Write a Bulk Emails Application in Vb.Net and Mysql: Step by Step Fully Working Program
From Everand
How to Write a Bulk Emails Application in Vb.Net and Mysql: Step by Step Fully Working Program
Lotfi Ferchichi
No ratings yet
Learn SQLite in 24 Hours
From Everand
Learn SQLite in 24 Hours
Alex Nordeen
No ratings yet
Learn Hive in 24 Hours
From Everand
Learn Hive in 24 Hours
Alex Nordeen
No ratings yet

Hive

Uploaded by

Hive

Uploaded by

Hive commands to practice:

 This command will show all databases that are present.

 This command will only create a database if it is not present.

 Assigning properties with the database in the form of key-value

 Let’s retrieve the information associated with the database.

 To check that database is deleted or not.

 In Hive, it is not allowed to drop the database that contains the

This command automatically drops the table present in the database

 Let’s create a table if it not exists.

 While creating a table, we can add the comments to the columns

 Let’s see the metadata of the created table.

 Hive allows creating a new table by using the schema of an

Hive — Drop Table

Hive — Alter table

Let’s check table name changed or not.

 Delete or replace column —

1. Using INSERT Command

ROW FORMAT DELIMITED

FIELDS TERMINATED BY ',';

We have successfully created the student table in the

2. Load Data Statement

touch data.csv // use to create data.csv file

nano data.csv // nano is a linux command line

cat data.csv // cat is used to see content of

 Let’s retrieve the information.

 Now retrieve the data.

 Enable the dynamic partitioning.

 Create the dummy table.

 Now load the data.

 Create a partition table.

 Insert the data of dummy table in the partition table.

 Now load the data.

 Fetch the data.

Arithmetic Operators in Hive

 Adding 50 to salary column.

 Substracting 50 from the salary column.

 Find out the 10% salary of each employee.

Relational Operators in Hive

 Fetch the details of the employee having salary>=25000.

 Fetch the details of the employee having salary<25000.

 Now load the data.

Mathematical Functions in Hive

 Let’s see an example to fetch the square root of each employee’s

Other functions in Hive

 Let’s see an example to fetch the name of each employee in

 Now load the data.

 Fetch the data.

HiveQL — ORDER BY Clause

 Now load the data.

 Fetch the data.

HiveQL — SORT BY Clause

You might also like