0% found this document useful (0 votes)

43 views8 pages

ABP W11-W12 Big Data Analytics Lab-HIVE

The document provides a comprehensive guide on using Hive commands for Data Definition Language (DDL) and Data Manipulation Language (DML) in the Hadoop Hive framework. It includes syntax and examples for creating, altering, and dropping databases and tables, as well as loading, selecting, inserting, updating, and deleting data. Additionally, it covers data partitioning and the use of commands for exporting and importing data in Hive.

Uploaded by

srikeshshekapuram0711

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views8 pages

ABP W11-W12 Big Data Analytics Lab-HIVE

Uploaded by

srikeshshekapuram0711

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

BIG DATA ANALYTICS LAB

(A7902) (VCE-R21)

Week-11 Hive commands

a) Implement Data Definition Language (DDL) Commands for
databases in Hadoop Hive framework using Cloudera.
b) Implement Data Definition Language (DDL) Commands for
tables in Hive.

 Open Virtual box and then start cloudera quickstart Terminal and type “hive” to
launch hive shell
11.a) DDL Commands for Databases
1) CREATE database Statement is used to create a database in Hive. A database in Hive is a
namespace or a collection or catalog of tables.

Syntax: CREATE DATABASE|SCHEMA [IF NOT EXISTS] database_name

[COMMENT database_comment]

[LOCATION hdfs_path]

[WITH DBPROPERTIES (property_name=property_value, ...)];

[ ] are optional clauses. We can use SCHEMA in place of DATABASE in this command.
The following query is executed to create a database named employee. If everything
went good, you will see a ‘OK’ message, else you will see relevant error message.

Simple creation

hive> CREATE DATABASE facultycse;

Time taken: 0.033 seconds

hive> CREATE DATABASE facultyece;

Full creation

hive> CREATE DATABASE IF NOT EXISTS employee COMMENT ‘this is employee

database’ LOCATION ‘/user/hive/warehouse/hivedir/’ WITH DBPROPERTIES
(‘creator’=‘Bhanu’, ‘date’=‘2020-12-07’);

A. Bhanu Prasad, Associate Professor of CSE, VCE

BIG DATA ANALYTICS LAB
(A7902) (VCE-R21)

2) SHOW databases statement lists all the databases present in the metastore.
Syn: SHOW (DATABASES/SCHEMAS) [LIKE ‘wildcards'];
 Wildcards in the regular expression can only be '*' for any character(s) or '|' for a
choice. Examples are 'employees', 'emp*', 'emp*|*ees', all of which will match the
database named 'employees’:

hive> SHOW DATABASES; hive> SHOW DATABASES LIKE ‘*ee’;

default employee
employee
hive> SHOW DATABASES LIKE ‘fac*’;
facultycse
facultycse
facultyece
facultyece
3) DESCRIBE database statement in Hive shows the name of Database in Hive, its
comment (if set), its location, its owner name, owner type and its properties.
Syn: DESCRIBE DATABASE/SCHEMA [EXTENDED] db_name;
 EXTENDED can be used to get the database properties.
hive>DESCRIBE DATABASE facultycse;

facultycse hdfs://quickstart.cloudera:8020/user/hive/warehouse/faculty.db cloudera

USER
hive>DESCRIBE DATABASE EXTENDED employee;
employee this is employee database
hdfs://quickstart.cloudera:8020/user/hive/warehouse/ cloudera USER {date=2020-
12-07, creator=Bhanu};
4) USE database statement in Hive is used to select the specific database for a session on
which all subsequent HiveQL statements would be executed.
Syn: USE db_name;
hive> USE employee;
OK
5) DROP database statement in Hive is used to Drop (delete) the database. The default
behavior is RESTRICT which means that the database is dropped only when it is empty.
To drop the database with tables, we can use CASCADE.
Syn: DROP (DATABASE|SCHEMA) [IF EXISTS] db_name [RESTRICT|CASCADE];
hive> DROP DATABASE facultyece;
OK
hive> DROP DATABASE IF EXISTS facultycse CASCADE;
OK

A. Bhanu Prasad, Associate Professor of CSE, VCE

BIG DATA ANALYTICS LAB
(A7902) (VCE-R21)

6) ALTER database statement in Hive is used to change the metadata associated with the
database in Hive.
Syntax for changing Database Properties:
ALTER (DATABASE|SCHEMA) db_name SET DBPROPERTIES
(property_name=property_value, ...);
hive> ALTER DATABASE employee SET DBPROPERTIES (‘creator’=‘Bhanu Prasad’,
‘date’=‘07-12-2020’);
employee this is employee database hdfs://quickstart.cloudera:8020
/user/hive/warehouse/hivedir/ cloudera USER {date= 07-12-2020, creator=Bhanu
Prasad};

Syn for changing Database owner:

ALTER (DATABASE|SCHEMA) database_name SET OWNER [USER|ROLE]
user_or_role;
hive> ALTER DATABASE employee SET OWNER USER client;

employee this is employee database hdfs://quickstart.cloudera:8020

/user/hive/warehouse/hivedir/ client USER {date= 07-12-2020, creator=Bhanu
Prasad};
hive> ALTER DATABASE employee SET OWNER ROLE Admin;
employee this is employee database hdfs://quickstart.cloudera:8020
/user/hive/warehouse/hivedir/ Admin ROLE {date= 07-12-2020, creator=Bhanu
Prasad};

11.b) DDL Commands for Tables

1) CREATE TABLE statement in Hive is used to create a table with the given name. If a
table or view already exists with the same name, then the error is thrown. We can use
IF NOT EXISTS to skip the error.
Syn: CREATE TABLE [IF NOT EXISTS] [db_name.] table_name [(col_name
data_type [COMMENT col_comment], ... [COMMENT col_comment])]
[COMMENT table_comment]
[ROW FORMAT row_format]
[STORED AS file_format]
[LOCATION hdfs_path];

A. Bhanu Prasad, Associate Professor of CSE, VCE

BIG DATA ANALYTICS LAB
(A7902) (VCE-R21)

hive> CREATE TABLE IF NOT EXISTS employee.emptable (emp_id STRING COMMENT

‘This is Employee ID’, emp_name STRING COMMENT ‘This is Employee Name’, emp_sal
FLOAT COMMENT ‘This is Employee Salary’)
COMMENT ‘This table contains Employees Data’
ROW FORMAT DELIMITED
FIELDS TERMINATED BY ‘,’
STORED AS TEXTFILE;

2) SHOW tables statement in Hive lists all the base tables and views in the current
database.
Syn: SHOW TABLES [IN database_name];
hive> SHOW TABLES IN employee;
OK
emptable

3) DESCRIBE table statement in Hive shows the lists of columns for the specified table.
Syn: DESCRIBE [EXTENDED|FORMATTED] [db_name.] table_name[.col_name (
[.field_name])];
hive> DESCRIBE employee.emptable;
emp_id string This is Employee ID
emp_name string This is Employee Name
emp_sal float This is Employee Salary
hive> DESCRIBE EXTENDED employee.emptable;
hive> DESCRIBE FORMATTED employee.emptable;

4) ALTER table statement in Hive enables you to change the structure of an existing table,
rename the table, add columns to the table, change the table properties, etc.
Syntax for Rename a table:
ALTER TABLE table_name RENAME TO new_table_name;
hive> ALTER TABLE employee.emptable RENAME TO employee.facultytable;

A. Bhanu Prasad, Associate Professor of CSE, VCE

BIG DATA ANALYTICS LAB
(A7902) (VCE-R21)

Syn to Add columns to a table:

ALTER TABLE table_name ADD COLUMNS (column1, column2) ;
hive> ALTER TABLE employee.facultytable ADD COLUMNS (emp_post string
COMMENT ‘This is employee post’, emp_age INT COMMENT ‘This is employee age’);

Syn to set table properties:

ALTER TABLE table_name SET TBLPROPERTIES
(‘property_key’=’property_new_value’);
hive> ALTER TABLE employee.facultytable SET TBLPROPERTIES (‘table for’=’faculty
data’);

5) DROP table statement in Hive deletes the data for a particular table and remove all
metadata associated with it from Hive metastore.
 If PURGE is not specified, then the data is actually moved to the .Trash/current
directory.
 If PURGE is specified, then data is lost completely.
Syn: DROP TABLE [IF EXISTS] table_name [PURGE];
hive> DROP TABLE IF EXISTS employee.emptable PURGE;
OK

6) TRUNCATE table statement in Hive removes all the rows from the table or partition.
Syn: TRUNCATE TABLE table_name;
hive> TRUNCATE TABLE employee.emptable;
OK

A. Bhanu Prasad, Associate Professor of CSE, VCE

BIG DATA ANALYTICS LAB
(A7902) (VCE-R21)

Week-12 Hive commands

a) Implement Data Manipulation Language (DML) Commands
for tables in Hive.
b) Perform data partitioning to split the given larger dataset
into more meaningful chunks.

 Open Virtual box and then start cloudera quickstart Terminal and type “hive” to
launch hive shell
12.a) DML Commands for Tables
1) LOAD statement in Hive is used to copy/move data files into the locations
corresponding to Hive tables.

Syn: LOAD DATA [LOCAL] INPATH 'filepath' [OVERWRITE] INTO TABLE

tablename [PARTITION (partcol1=val1, partcol2=val2 ...)];

LOCAL keyword = file path in the local filesystem.

LOCAL not specified = file path in the hdfs

OVERWRITE contents of the target table (or partition) will be deleted and replaced by
the files otherwise contents are added to the table

hive> LOAD DATA LOCAL INPATH ‘/home/cloudera/HiveDir/emptextdata' INTO

TABLE employee.facultytable;

emptextdata contents

1,bob,25000.00,asstprof,35,male

2,mary,35000.00,assocprof,38,female

3,mike,50000.00,prof,45,male

2) SELECT statement in Hive is similar to the SELECT statement in SQL used for retrieving
data from the database.
Syn: SELECT * FROM tablename; //displays all records
hive> SELECT * FROM employee.facultytable;
1 bob 25000.00 asstprof 35 male

A. Bhanu Prasad, Associate Professor of CSE, VCE

BIG DATA ANALYTICS LAB
(A7902) (VCE-R21)

2 mary 35000.00 assocprof 38 female

3 mike 50000.00 prof 45 male

SELECT col1,col2 FROM tablename; //Retrieves only specified columns data

hive> SELECT emp_name,emp_salary FROM employee.facultytable;
bob 25000.00
mary 35000.00
mike 50000.00

3) a) INSERT INTO statement appends the data into existing data in the table or partition.
Syn: INSERT INTO TABLE tablename [PARTITION (partcol1=val1, partcol2=val2
...)] VALUES (col1value,col2value,…)
hive> INSERT INTO TABLE employee.facultytable VALUES (4, ‘jessy’, 45000.00,
‘assocprof’, 40, ‘female’);
hive> SELECT * FROM employee.facultytable;
4 jessy 45000.00 assocprof 40 female
1 bob 25000.00 asstprof 35 male
2 mary 35000.00 assocprof 38 female
3 mike 50000.00 prof 45 male

b) INSERT OVERWRITE table overwrites the existing data in the table or partition.
Syn: INSERT OVERWRITE TABLE tablename1 [PARTITION (partcol1=val1, ..) [IF
NOT EXISTS]] select_statement FROM from_statement;

4) DELETE statement in Hive deletes the table data. If the WHERE clause is specified, then
it deletes the rows that satisfy the condition in where clause.
Syn: DELETE FROM tablename [WHERE expression];
hive> DELETE FROM employee.facultytable WHERE emp_age=38;
hive> SELECT * FROM employee.facultytable;
4 jessy 45000.00 assocprof 40 female
1 bob 25000.00 asstprof 35 male
3 mike 50000.00 prof 45 male

A. Bhanu Prasad, Associate Professor of CSE, VCE

BIG DATA ANALYTICS LAB
(A7902) (VCE-R21)

5) UPDATE statement in Hive updates the table data. If the WHERE clause is specified, then
it updates the column of the rows that satisfy the condition in WHERE clause.
Partitioning and Bucketing columns cannot be updated.
Syn: UPDATE tablename SET column = value [, column = value ...] [WHERE
expression];
hive> UPDATE employee.facultytable SET emp_name = ‘mike tyson’ WHERE
emp_age=45;
hive> SELECT * FROM employee.facultytable;
4 jessy 45000.00 assocprof 40 female
1 bob 25000.00 asstprof 35 male
3 mike tyson 50000.00 prof 45 male

6) EXPORT statement exports the table or partition data along with the metadata to the
specified output location in the HDFS. Metadata is exported in a _metadata file, and data is
exported in a subdirectory ‘data.’
Syn: EXPORT TABLE tablename [PARTITION (part_column="value"[, ...])] TO
'export_target_path' [ FOR replication('eventid') ];
hive> EXPORT TABLE employee.drivertable TO ‘/user/hive/warehouse’;

7) IMPORT command imports the data from a specified location to a new table or already
existing table.
Syn: IMPORT [[EXTERNAL] TABLE new_or_original_tablename [PARTITION
(part_column="value"[, ...])]] FROM 'source_path' [LOCATION 'import_target_path’];
hive> IMPORT TABLE employee.importedtable FROM ‘/user/hive/warehouse’;

A. Bhanu Prasad, Associate Professor of CSE, VCE

Iso 27001
100% (8)
Iso 27001
183 pages
Deep Learning R18 Jntuh Lab Manual
0% (1)
Deep Learning R18 Jntuh Lab Manual
21 pages
IoT Lab Manual
No ratings yet
IoT Lab Manual
62 pages
User Manual 17534
No ratings yet
User Manual 17534
8 pages
Intelligent Web Security: Machine Learning-Based SQL Injection Detection and Honeypot Integration
No ratings yet
Intelligent Web Security: Machine Learning-Based SQL Injection Detection and Honeypot Integration
7 pages
Kartik Resume
No ratings yet
Kartik Resume
1 page
How To Sync On-Premises Active Directory To Azure Active Directory With Azure AD Connect
No ratings yet
How To Sync On-Premises Active Directory To Azure Active Directory With Azure AD Connect
15 pages
IAAA (Autosaved)
No ratings yet
IAAA (Autosaved)
55 pages
Excel Fundamentals Manual 41
No ratings yet
Excel Fundamentals Manual 41
1 page
MC 10217070 0001
No ratings yet
MC 10217070 0001
6 pages
Apache Hive
No ratings yet
Apache Hive
3 pages
Q2 WEEK 5-6-1 Answers
No ratings yet
Q2 WEEK 5-6-1 Answers
12 pages
Spring Professional Certification Study Guide
No ratings yet
Spring Professional Certification Study Guide
12 pages
M4 Q&a
No ratings yet
M4 Q&a
22 pages
Hive PGM
No ratings yet
Hive PGM
6 pages
Reverse Engineering and Its Application in Rapid Prototyping and Computer Integrated Manufacturing
No ratings yet
Reverse Engineering and Its Application in Rapid Prototyping and Computer Integrated Manufacturing
8 pages
Hive and Pig
No ratings yet
Hive and Pig
57 pages
Wa0006.
No ratings yet
Wa0006.
53 pages
Do Dissertations Go Through Turnitin
100% (2)
Do Dissertations Go Through Turnitin
4 pages
Ex3-Query Processing Using Hive and Beeswax
No ratings yet
Ex3-Query Processing Using Hive and Beeswax
4 pages
Komathig Resume
No ratings yet
Komathig Resume
2 pages
COBOL IMS DB Sample Program
No ratings yet
COBOL IMS DB Sample Program
9 pages
Bda-Unit-Iv - 2020-21
100% (1)
Bda-Unit-Iv - 2020-21
30 pages
Experiment-11 BDA Lab
No ratings yet
Experiment-11 BDA Lab
4 pages
Unit 3 BDA
No ratings yet
Unit 3 BDA
44 pages
HIVE
No ratings yet
HIVE
28 pages
Log
No ratings yet
Log
2 pages
Introduction To Hive
No ratings yet
Introduction To Hive
14 pages
CS411 Midterm Short Notes and Question and Answers
100% (1)
CS411 Midterm Short Notes and Question and Answers
8 pages
BDA Unit-5
No ratings yet
BDA Unit-5
39 pages
C Programming Operators
No ratings yet
C Programming Operators
11 pages
Electronics - Kantrak 2700and 2710
No ratings yet
Electronics - Kantrak 2700and 2710
2 pages
Assistive Technology For PWD
100% (2)
Assistive Technology For PWD
11 pages
Software Project Management Semester Project Iot Based Smart Medical Box Roll No: 19003105012,001,003 Submitted To: Ma'Am Sadia Naz
No ratings yet
Software Project Management Semester Project Iot Based Smart Medical Box Roll No: 19003105012,001,003 Submitted To: Ma'Am Sadia Naz
6 pages
Bitwise Operators in C
No ratings yet
Bitwise Operators in C
3 pages
Hive Table Session
No ratings yet
Hive Table Session
23 pages
5 - Hive
No ratings yet
5 - Hive
51 pages
Automatic Traffic Sign Detection and Recognition Using Deeplearning For Autonomous Driverless Vehicles
No ratings yet
Automatic Traffic Sign Detection and Recognition Using Deeplearning For Autonomous Driverless Vehicles
4 pages
Hive Part 2
No ratings yet
Hive Part 2
53 pages
Hive Part 2
No ratings yet
Hive Part 2
47 pages
Hive
No ratings yet
Hive
15 pages
Hive Final
No ratings yet
Hive Final
75 pages
Module 3-1
No ratings yet
Module 3-1
32 pages
Unit-4 Pig Hive
No ratings yet
Unit-4 Pig Hive
40 pages
Hive Commands
No ratings yet
Hive Commands
7 pages
BDA - Exp-8 - Aarya Sawant
No ratings yet
BDA - Exp-8 - Aarya Sawant
18 pages
Hive 2nd Practical
No ratings yet
Hive 2nd Practical
11 pages
Unit 2.2 Hive
No ratings yet
Unit 2.2 Hive
80 pages
Handheld High Resolution Inkjet Printer: Multiple Files Printing
No ratings yet
Handheld High Resolution Inkjet Printer: Multiple Files Printing
2 pages
Hive PPTs
No ratings yet
Hive PPTs
34 pages
Document Details Rev. Format Notes Location Document No. Date Compiled Current Revision Date Revision Status DOC Originator (Company)
No ratings yet
Document Details Rev. Format Notes Location Document No. Date Compiled Current Revision Date Revision Status DOC Originator (Company)
13 pages
BDA Hive Practical
No ratings yet
BDA Hive Practical
7 pages
Hive
No ratings yet
Hive
9 pages
Bollettino Informazione Tecnica / Technical Information Bulletin
No ratings yet
Bollettino Informazione Tecnica / Technical Information Bulletin
22 pages
Foundation University: Syed Shahabal Shah Hamdani F171-BCSE050
No ratings yet
Foundation University: Syed Shahabal Shah Hamdani F171-BCSE050
7 pages
Unit-5 - Hive
No ratings yet
Unit-5 - Hive
31 pages
6 - Big - Data Vivek
No ratings yet
6 - Big - Data Vivek
5 pages
ITR1 Schema AY2018-19 V1.3 PDF
No ratings yet
ITR1 Schema AY2018-19 V1.3 PDF
6 pages
HIVE
No ratings yet
HIVE
80 pages
Apache HIVE
No ratings yet
Apache HIVE
44 pages
Hive
No ratings yet
Hive
45 pages
Unit Iv Part - 1
No ratings yet
Unit Iv Part - 1
60 pages
CSC-334 C1
No ratings yet
CSC-334 C1
2 pages
HDFSandhivecommands
No ratings yet
HDFSandhivecommands
15 pages
Hive
No ratings yet
Hive
13 pages
BDA011GU04
No ratings yet
BDA011GU04
49 pages
Hive Code
No ratings yet
Hive Code
6 pages
Week-11 - 12-Hivepdf - 2023 - 11 - 10 - 12 - 47 - 43
No ratings yet
Week-11 - 12-Hivepdf - 2023 - 11 - 10 - 12 - 47 - 43
8 pages
HIVE Lect
No ratings yet
HIVE Lect
91 pages
Boston Consulting Group 1
No ratings yet
Boston Consulting Group 1
4 pages
Bigdata@master: 4.set The Environmental Variable HIVE - HOME in Bashrc File
No ratings yet
Bigdata@master: 4.set The Environmental Variable HIVE - HOME in Bashrc File
91 pages
Apache Hive DDL DML, Queries
100% (2)
Apache Hive DDL DML, Queries
4 pages
Chapter+9+ HIVE
No ratings yet
Chapter+9+ HIVE
50 pages
Cheat Sheet: Hive Basics
No ratings yet
Cheat Sheet: Hive Basics
1 page
Hive L1
No ratings yet
Hive L1
134 pages
Hive Tutorial
No ratings yet
Hive Tutorial
25 pages
Hive - Hands On Exercises: Intellipaat Software Solutions Pvt. LTD
No ratings yet
Hive - Hands On Exercises: Intellipaat Software Solutions Pvt. LTD
8 pages
Unit 3
No ratings yet
Unit 3
75 pages
Hive
No ratings yet
Hive
65 pages
Hadoop Hive
No ratings yet
Hadoop Hive
61 pages
Big Data Analytics: Welcome
No ratings yet
Big Data Analytics: Welcome
69 pages
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
Hive Overview
No ratings yet
Hive Overview
28 pages
Comandos Hive SQL
100% (1)
Comandos Hive SQL
5 pages
DSCI 5350 - Lecture 5 PDF
No ratings yet
DSCI 5350 - Lecture 5 PDF
64 pages
DSCI 5350 - Lecture 4 PDF
No ratings yet
DSCI 5350 - Lecture 4 PDF
33 pages
Experiment 3: Hive: Aim: To Understand Data Processing Tool - Hive and HQL (Hive Query Language)
No ratings yet
Experiment 3: Hive: Aim: To Understand Data Processing Tool - Hive and HQL (Hive Query Language)
11 pages
DBMS Lab Manual
From Everand
DBMS Lab Manual
Jitendra Patel
1.5/5 (3)
Apache Hive: An Introduction
No ratings yet
Apache Hive: An Introduction
51 pages
Learn Hive in 24 Hours
From Everand
Learn Hive in 24 Hours
Alex Nordeen
No ratings yet

ABP W11-W12 Big Data Analytics Lab-HIVE

Uploaded by

ABP W11-W12 Big Data Analytics Lab-HIVE

Uploaded by

BIG DATA ANALYTICS LAB

Week-11 Hive commands

Syntax: CREATE DATABASE|SCHEMA [IF NOT EXISTS] database_name

[WITH DBPROPERTIES (property_name=property_value, ...)];

hive> CREATE DATABASE facultycse;

Time taken: 0.033 seconds

hive> CREATE DATABASE facultyece;

hive> CREATE DATABASE IF NOT EXISTS employee COMMENT ‘this is employee

A. Bhanu Prasad, Associate Professor of CSE, VCE

hive> SHOW DATABASES; hive> SHOW DATABASES LIKE ‘*ee’;

facultycse hdfs://quickstart.cloudera:8020/user/hive/warehouse/faculty.db cloudera

A. Bhanu Prasad, Associate Professor of CSE, VCE

Syn for changing Database owner:

employee this is employee database hdfs://quickstart.cloudera:8020

11.b) DDL Commands for Tables

A. Bhanu Prasad, Associate Professor of CSE, VCE

hive> CREATE TABLE IF NOT EXISTS employee.emptable (emp_id STRING COMMENT

A. Bhanu Prasad, Associate Professor of CSE, VCE

Syn to Add columns to a table:

Syn to set table properties:

A. Bhanu Prasad, Associate Professor of CSE, VCE

Week-12 Hive commands

Syn: LOAD DATA [LOCAL] INPATH 'filepath' [OVERWRITE] INTO TABLE

LOCAL keyword = file path in the local filesystem.

LOCAL not specified = file path in the hdfs

hive> LOAD DATA LOCAL INPATH ‘/home/cloudera/HiveDir/emptextdata' INTO

A. Bhanu Prasad, Associate Professor of CSE, VCE

2 mary 35000.00 assocprof 38 female

SELECT col1,col2 FROM tablename; //Retrieves only specified columns data

A. Bhanu Prasad, Associate Professor of CSE, VCE

A. Bhanu Prasad, Associate Professor of CSE, VCE

You might also like