0% found this document useful (0 votes)

5 views9 pages

Installation of Hive On Ubuntu

This document provides a step-by-step guide to install and configure Apache Hive on an Ubuntu system with Hadoop 3.2.1. Key steps include downloading Hive, configuring environment variables, creating necessary directories in HDFS, and initiating the Derby database. It also addresses potential guava incompatibility errors and concludes with instructions on launching the Hive client shell.

Uploaded by

eswarannihil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views9 pages

Installation of Hive On Ubuntu

Uploaded by

eswarannihil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Install Apache Hive on Ubuntu

To configure Apache Hive, first you need to download and unzip Hive. Then you need to customize
the following files and settings:
• Edit .bashrc file
• Edit hive-config.sh file
• Create Hive directories in HDFS
• Configure hive-site.xml file
• Initiate Derby database

Step 1: Download and Untar Hive

Visit the Apache Hive official download page and determine which Hive version is best suited for
your Hadoop edition. Once you establish which version you need, select the Download a Release
Now! option.

The mirror link on the subsequent page leads to the directories containing available Hive tar
packages. This page also provides useful instructions on how to validate the integrity of files
retrieved from mirror sites.
The Ubuntu system presented in this guide already has Hadoop 3.2.1 installed. This Hadoop
version is compatible with the Hive 3.1.2 release.

Select the apache-hive-3.1.2-bin.tar.gz file to begin the download process.

Alternatively, access your Ubuntu command line and download the compressed Hive files using and
the wget command followed by the download path:
wget https://downloads.apache.org/hive/hive-3.1.2/apache-hive-3.1.2-bin.tar.gz

Once the download process is complete, untar the compressed Hive package:
tar xzf apache-hive-3.1.2-bin.tar.gz

The Hive binary files are now located in the apache-hive-3.1.2-bin directory.
Step 2: Configure Hive Environment Variables (bashrc)
The $HIVE_HOME environment variable needs to direct the client shell to the apache-hive-3.1.2-
bin directory. Edit the .bashrc shell configuration file using a text editor of your choice (we will be
using nano):
sudo nano .bashrc

Append the following Hive environment variables to the .bashrc file:

export HIVE_HOME= "home/hdoop/apache-hive-3.1.2-bin"
export PATH=$PATH:$HIVE_HOME/bin

The Hadoop environment variables are located within the same file.

Save and exit the .bashrc file once you add the Hive variables. Apply the changes to the current
environment with the following command:
source ~/.bashrc
Step 3: Edit hive-config.sh file
Apache Hive needs to be able to interact with the Hadoop Distributed File System. Access the hive-
config.sh file using the previously created $HIVE_HOME variable:
sudo nano $HIVE_HOME/bin/hive-config.sh

Note: The hive-config.sh file is in the bin directory within your Hive installation directory.
Add the HADOOP_HOME variable and the full path to your Hadoop directory:
export HADOOP_HOME=/home/hdoop/hadoop-3.2.1

Save the edits and exit the hive-config.sh file.

Step 4: Create Hive Directories in HDFS

Create two separate directories to store data in the HDFS layer:
• The temporary, tmp directory is going to store the intermediate results of Hive processes.
• The warehouse directory is going to store the Hive related tables.

Create tmp Directory

Create a tmp directory within the HDFS storage layer. This directory is going to store the
intermediary data Hive sends to the HDFS:
hdfs dfs -mkdir /tmp

Add write and execute permissions to tmp group members:

hdfs dfs -chmod g+w /tmp

Check if the permissions were added correctly:

hdfs dfs -ls /

The output confirms that users now have write and execute permissions.

Create warehouse Directory

Create the warehouse directory within the /user/hive/ parent directory:
hdfs dfs -mkdir -p /user/hive/warehouse
Add write and execute permissions to warehouse group members:
hdfs dfs -chmod g+w /user/hive/warehouse

Check if the permissions were added correctly:

hdfs dfs -ls /user/hive

The output confirms that users now have write and execute permissions.

Step 5: Configure hive-site.xml File (Optional)

Apache Hive distributions contain template configuration files by default. The template files are
located within the Hive conf directory and outline default Hive settings.
Use the following command to locate the correct file:
cd $HIVE_HOME/conf

List the files contained in the folder using the ls command.

Use the hive-default.xml.template to create the hive-site.xml file:

cp hive-default.xml.template hive-site.xml

Access the hive-site.xml file using the nano text editor:

sudo nano hive-site.xml

Note: The hive-site.xml file controls every aspect of Hive operations. The number of available
advanced settings can be overwhelming and highly specific. Consult the official Hive Configuration
Documentation regularly when customizing Hive and Hive Metastore settings.
Using Hive in a stand-alone mode rather than in a real-life Apache Hadoop cluster is a safe option
for newcomers. You can configure the system to use your local storage rather than the HDFS layer
by setting the hive.metastore.warehouse.dir parameter value to the location of your Hive warehouse
directory.
Step 6: Initiate Derby Database
Apache Hive uses the Derby database to store metadata. Initiate the Derby database, from the Hive
bin directory using the schematool command:
$HIVE_HOME/bin/schematool -dbType derby -initSchema

The process can take a few moments to complete.

Derby is the default metadata store for Hive. If you plan to use a different database solution, such as
MySQL or PostgreSQL, you can specify a database type in the hive-site.xml file.
How to Fix guava Incompatibility Error in Hive
If the Derby database does not successfully initiate, you might receive an error with the following
content:
“Exception in thread “main” java.lang.NoSuchMethodError:
com.google.common.base.Preconditions.checkArgument(ZLjava/lang/String;Ljava/lang/Object;)V”
This error indicates that there is most likely an incompatibility issue between Hadoop and Hive
guava versions.
Locate the guava jar file in the Hive lib directory:
ls $HIVE_HOME/lib

Locate the guava jar file in the Hadoop lib directory as well:
ls $HADOOP_HOME/share/hadoop/hdfs/lib

The two listed versions are not compatible and are causing the error. Remove the existing guava
file from the Hive lib directory:
rm $HIVE_HOME/lib/guava-19.0.jar

Copy the guava file from the Hadoop lib directory to the Hive lib directory:
cp $HADOOP_HOME/share/hadoop/hdfs/lib/guava-27.0-jre.jar $HIVE_HOME/lib/

Use the schematool command once again to initiate the Derby database:
$HIVE_HOME/bin/schematool -dbType derby -initSchema

Launch Hive Client Shell on Ubuntu

Start the Hive command-line interface using the following commands:
cd $HIVE_HOME/bin

hive

You are now able to issue SQL-like commands and directly interact with HDFS.

Conclusion
You have successfully installed and configured Hive on your Ubuntu system. Use HiveQL to query
and manage your Hadoop distributed storage and perform SQL-like tasks. Your Hadoop cluster now
has an easy-to-use gateway to previously inaccessible RDBMS.

Seat Leon (1P, 1P0,1P1) Workshop - Electrical System
67% (3)
Seat Leon (1P, 1P0,1P1) Workshop - Electrical System
365 pages
Outline Field Development & Project Management (5th Apr 22) Rev.2
No ratings yet
Outline Field Development & Project Management (5th Apr 22) Rev.2
67 pages
70T RT Tadano GR-700EX Load Charts PDF
No ratings yet
70T RT Tadano GR-700EX Load Charts PDF
12 pages
Spouses Cha Vs CA GR No. 124520
No ratings yet
Spouses Cha Vs CA GR No. 124520
2 pages
Apache Hive: Prashant Gupta
100% (1)
Apache Hive: Prashant Gupta
61 pages
Henari Security Business Profile1
100% (1)
Henari Security Business Profile1
8 pages
The Role of Chittagong Port in The Economy of Bangladesh II
100% (2)
The Role of Chittagong Port in The Economy of Bangladesh II
15 pages
Redmond Catalogo
No ratings yet
Redmond Catalogo
242 pages
Bda Unit 5 Notes
No ratings yet
Bda Unit 5 Notes
23 pages
Hive Installation On Windows 10
No ratings yet
Hive Installation On Windows 10
13 pages
Hive Tutorial PDF
0% (1)
Hive Tutorial PDF
14 pages
Chapter 5 Hive
No ratings yet
Chapter 5 Hive
69 pages
Hadoop 3 Installation
No ratings yet
Hadoop 3 Installation
10 pages
Hadoop HIVE
No ratings yet
Hadoop HIVE
41 pages
Hive Tutorial For Beginners: Learn With Examples in 3 Days
No ratings yet
Hive Tutorial For Beginners: Learn With Examples in 3 Days
3 pages
Hive Installation On Windows
No ratings yet
Hive Installation On Windows
21 pages
Project 2
No ratings yet
Project 2
7 pages
Hive PPT
No ratings yet
Hive PPT
61 pages
Guide For The IFT Approval
No ratings yet
Guide For The IFT Approval
34 pages
Chap 6 - Sale of Goods
No ratings yet
Chap 6 - Sale of Goods
35 pages
Factors That Affect Time Management of Humanities and Social Sciences Grade 11 Senior High School Students
No ratings yet
Factors That Affect Time Management of Humanities and Social Sciences Grade 11 Senior High School Students
8 pages
Artificial Intelligence in Product Management PDF
No ratings yet
Artificial Intelligence in Product Management PDF
4 pages
Hadoop - Hive
No ratings yet
Hadoop - Hive
190 pages
Web Design For Everyone Using Wordpress: Golam Morshed
No ratings yet
Web Design For Everyone Using Wordpress: Golam Morshed
31 pages
Apache Hive
No ratings yet
Apache Hive
77 pages
Film Insurance
100% (1)
Film Insurance
8 pages
Unit Iv Part - 1
No ratings yet
Unit Iv Part - 1
60 pages
Hive Is A Data Warehouse Infrastructure Tool To Process Structured Data in Hadoop
No ratings yet
Hive Is A Data Warehouse Infrastructure Tool To Process Structured Data in Hadoop
30 pages
Data Analytics 30-60
No ratings yet
Data Analytics 30-60
115 pages
Big Data
No ratings yet
Big Data
32 pages
Big Data & Analytics (CSE6005) L6
No ratings yet
Big Data & Analytics (CSE6005) L6
56 pages
Project Report On Business Intelligence
No ratings yet
Project Report On Business Intelligence
64 pages
Hive Unit VI
No ratings yet
Hive Unit VI
39 pages
Hive Crash Course: A Beginner's Guide
No ratings yet
Hive Crash Course: A Beginner's Guide
19 pages
Ludo Game Report LP
No ratings yet
Ludo Game Report LP
15 pages
Hive Tutorial
No ratings yet
Hive Tutorial
19 pages
Hive
No ratings yet
Hive
37 pages
(Final Draft) Taskap Sesdilu - M. Arief Priowahono
No ratings yet
(Final Draft) Taskap Sesdilu - M. Arief Priowahono
21 pages
BDA Unit-5
No ratings yet
BDA Unit-5
44 pages
BDA Unit-5
No ratings yet
BDA Unit-5
44 pages
2 - Installation
No ratings yet
2 - Installation
15 pages
Module 4 HIVE1ppt
No ratings yet
Module 4 HIVE1ppt
44 pages
Unit IV
No ratings yet
Unit IV
22 pages
Bda 06
No ratings yet
Bda 06
15 pages
Big Data Analytics Lab File
No ratings yet
Big Data Analytics Lab File
15 pages
Unit IV Notes
No ratings yet
Unit IV Notes
47 pages
Wa0006.
No ratings yet
Wa0006.
53 pages
Visually Pleasing Composition Amount of Information With Respect To Principles of User Interface Design
No ratings yet
Visually Pleasing Composition Amount of Information With Respect To Principles of User Interface Design
9 pages
HIVE
No ratings yet
HIVE
18 pages
BDA Unit V
No ratings yet
BDA Unit V
23 pages
Dao 2015-09
No ratings yet
Dao 2015-09
14 pages
Practical 3.6 Hive
No ratings yet
Practical 3.6 Hive
8 pages
820P 203
No ratings yet
820P 203
10 pages
Hive Updated
No ratings yet
Hive Updated
18 pages
Optimum Equipment Management Through: Life Cycle Costing
No ratings yet
Optimum Equipment Management Through: Life Cycle Costing
4 pages
Hive INstallation
No ratings yet
Hive INstallation
13 pages
CIS612 Kafka Installation Ubuntu
No ratings yet
CIS612 Kafka Installation Ubuntu
14 pages
BD U-5 (Anupam Sir)
No ratings yet
BD U-5 (Anupam Sir)
12 pages
BDA Exp-5
No ratings yet
BDA Exp-5
14 pages
637768232285587483ce 20ce33pt W3 S3 Sy
No ratings yet
637768232285587483ce 20ce33pt W3 S3 Sy
7 pages
Hadoop Installation
No ratings yet
Hadoop Installation
7 pages
Hadoop and Hive Installation
No ratings yet
Hadoop and Hive Installation
19 pages
A Routhray
No ratings yet
A Routhray
5 pages
Hive Configuration: Shashwat Shriparv
No ratings yet
Hive Configuration: Shashwat Shriparv
5 pages
Hive and Hiveql
No ratings yet
Hive and Hiveql
10 pages
Unit-4 Hive
No ratings yet
Unit-4 Hive
10 pages
Hadoop Fully Distributed Cluster
No ratings yet
Hadoop Fully Distributed Cluster
5 pages
Lsn21 NumPy
No ratings yet
Lsn21 NumPy
16 pages
Manual Hadoop HIve Installation
No ratings yet
Manual Hadoop HIve Installation
4 pages
Semi Automated Wireless Beach Cleaning Robot
No ratings yet
Semi Automated Wireless Beach Cleaning Robot
3 pages
Simple Additive Weighting Method To Determining Employee Salary Increase Rate
No ratings yet
Simple Additive Weighting Method To Determining Employee Salary Increase Rate
7 pages
Birds Nest Menu
No ratings yet
Birds Nest Menu
7 pages
Hive
No ratings yet
Hive
5 pages
Hive-1.2.1-Installation Guide-On-Hadoop-2.x
No ratings yet
Hive-1.2.1-Installation Guide-On-Hadoop-2.x
7 pages
Using Hive For Data Warehousing: Introduction To Hive
No ratings yet
Using Hive For Data Warehousing: Introduction To Hive
4 pages
Project 4
No ratings yet
Project 4
8 pages
Py Charm
No ratings yet
Py Charm
5 pages
SW 4048 120 Spec Sheet
No ratings yet
SW 4048 120 Spec Sheet
2 pages
A Common CNN Acrhitecture 365 Data Science Template - En.arabic
No ratings yet
A Common CNN Acrhitecture 365 Data Science Template - En.arabic
5 pages
Python NMFS
No ratings yet
Python NMFS
6 pages
Hive Main Installation
No ratings yet
Hive Main Installation
2 pages
Royal Ahold NV
No ratings yet
Royal Ahold NV
6 pages
Installation Steps
No ratings yet
Installation Steps
5 pages
Apache Hive Installation and Basic Usage Guide
No ratings yet
Apache Hive Installation and Basic Usage Guide
10 pages
Experiment 6
No ratings yet
Experiment 6
4 pages
HIve Installation Guide
No ratings yet
HIve Installation Guide
3 pages
Hadoop Hive
No ratings yet
Hadoop Hive
4 pages
Accessing Hadoop Data Using Hive: Hive Configuration
No ratings yet
Accessing Hadoop Data Using Hive: Hive Configuration
3 pages
Nihil Uppy
No ratings yet
Nihil Uppy
3 pages
Biostar H61MLB Spec
No ratings yet
Biostar H61MLB Spec
2 pages
Vs Code Installation
No ratings yet
Vs Code Installation
3 pages
BBS Implementation Process - Matrix
No ratings yet
BBS Implementation Process - Matrix
2 pages
Unified Case Study
No ratings yet
Unified Case Study
2 pages
Exp11 1
No ratings yet
Exp11 1
3 pages
Invoice: Invoice From Invoice To Customer Information
No ratings yet
Invoice: Invoice From Invoice To Customer Information
2 pages
Hive Properties:-: Hive - Metastore.warehouse - Dir Path You Want To Store Your Table and Database Directory and Its
No ratings yet
Hive Properties:-: Hive - Metastore.warehouse - Dir Path You Want To Store Your Table and Database Directory and Its
2 pages
Power BI Installation Instructions 1
No ratings yet
Power BI Installation Instructions 1
2 pages
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet
CONFIGURATION OF APACHE SERVER TO SUPPORT ASP
From Everand
CONFIGURATION OF APACHE SERVER TO SUPPORT ASP
DR. HIDAIA MAHMOOD ALASSOULI
No ratings yet
Firebase Storage for Angular: A reliable file upload solution for your applications
From Everand
Firebase Storage for Angular: A reliable file upload solution for your applications
Abdelfattah Ragab
No ratings yet
Configuration of Apache Server To Support ASP
From Everand
Configuration of Apache Server To Support ASP
Dr. Hedaya Mahmood Alasooly
No ratings yet
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet

Installation of Hive On Ubuntu

Uploaded by

Installation of Hive On Ubuntu

Uploaded by

Install Apache Hive on Ubuntu

Step 1: Download and Untar Hive

Select the apache-hive-3.1.2-bin.tar.gz file to begin the download process.

Append the following Hive environment variables to the .bashrc file:

Save the edits and exit the hive-config.sh file.

Step 4: Create Hive Directories in HDFS

Create tmp Directory

Add write and execute permissions to tmp group members:

Check if the permissions were added correctly:

Create warehouse Directory

Check if the permissions were added correctly:

Step 5: Configure hive-site.xml File (Optional)

List the files contained in the folder using the ls command.

Use the hive-default.xml.template to create the hive-site.xml file:

Access the hive-site.xml file using the nano text editor:

The process can take a few moments to complete.

Launch Hive Client Shell on Ubuntu

You might also like