0% found this document useful (0 votes)

178 views3 pages

Hive Tutorial For Beginners: Learn With Examples in 3 Days

This document provides an overview and syllabus for a 3 day Hive tutorial for beginners. It introduces Apache Hive, which helps query and manage large datasets using SQL-like queries. The syllabus covers basic Hive concepts like installation, configuration, data types, and advanced topics like partitions, buckets, indexes, queries and joins. It defines what Hive is, how it provides a SQL interface to analyze data stored in Hadoop using MapReduce, and compares Hive to using MapReduce directly.

Uploaded by

Karthikeyan Perumal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

178 views3 pages

Hive Tutorial For Beginners: Learn With Examples in 3 Days

Uploaded by

Karthikeyan Perumal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Hive Tutorial for Beginners: Learn with

Examples in 3 Days
ByDavid TaylorUpdatedApril 16, 2022

Hive Tutorial Summary

Apache Hive helps with querying and managing large datasets real fast. It is an ETL
tool for the Hadoop ecosystem. In this Apache Hive tutorial for beginners, you will
learn Hive basics and important topics like HQL queries, data extractions,
partitions, buckets, and so on. This Hive tutorials series will help you learn Hive
concepts and basics.

What should I know?

To learn this Hive query tutorial, you need basic knowledge of SQL, Hadoop and
knowledge of other databases will be of an additional help.

Hive Course Syllabus

Introduction
👉 Lesson 1 What is Hive? — Architecture & Modes

👉 Lesson 2 Download & Install HIVE — How to Download & Install HIVE on Ubuntu

👉 Lesson 3 HIVE Metastore Configuration — Why to Use MySQL?

👉 Lesson 4 Hive Data Types — Create & Drop Database in Hive

Advanced Stuff
👉 Lesson 1 Hive Create Table — Types and its Usage

👉 Lesson 2 Hive Partitions & Buckets — Learn with Example

👉 Lesson 3 Hive Indexes and View — Learn with Example

👉 Lesson 4 Hive Queries — Learn with Example

👉 Lesson 5 Hive Join & SubQuery Tutorial — Learn with Example

👉 Lesson 6 Hive Query Language Tutorial — Built-in Operators

👉 Lesson 7 Hive Function — Built-in & User Defined Functions

👉 Lesson 8 Hive ETL — Loading JSON, XML, Text Data Examples

Introduction to Hive
Hive evolved as a data warehousing solution built on top of Hadoop Map-Reduce
framework.

The size of data sets being collected and analyzed in the industry for business
intelligence is growing and in a way, it is making traditional data warehousing
solutions more expensive. Hadoop with MapReduce framework, is being used as an
alternative solution for analyzing data sets with huge size. Though, Hadoop has
proved useful for working on huge data sets, its MapReduce framework is very low
level and it requires programmers to write custom programs which are hard to
maintain and reuse. Hive comes here for rescue of programmers.

Hive engine compiles these queries into Map-Reduce jobs to be executed on

Hadoop. In addition, custom Map-Reduce scripts can also be plugged into queries.
Hive operates on data stored in tables which consists of primitive data types and
collection data types like arrays and maps.
Hive comes with a command-line shell interface which can be used to create tables
and execute queries.

Hive query language is similar to SQL wherein it supports subqueries. With Hive
query language, it is possible to take a MapReduce joins across Hive tables. It has a
support for simple SQL like functions– CONCAT, SUBSTR, ROUND etc.,
and aggregation functions– SUM, COUNT, MAX etc. It also supports GROUP BY and
SORT BY clauses. It is also possible to write user defined functions in Hive query
language.

What is Hive?
Apache Hive is a data warehouse framework for querying and analysis of data
stored in HDFS. It is developed on top of Hadoop. Hive is an open-source software
to analyze large data sets on Hadoop. It provides SQL-like declarative language,
called HiveQL, to express queries. Using Hive-QL, users associated with SQL can
perform data analysis very easily.

Hive Vs Map Reduce

Prior to choosing one of these two options, we must look at some of their features.

While choosing between Hive and Map reduce following factors are taken in
consideration;

 Type of Data
 Amount of Data
 Complexity of Code

Hive Vs Map Reduce?

Feature Hive Map Reduce

 It compiles language with two main task

It Supports SQL like query language for is map task, and another one is a reduce
Language
interaction and for Data modeling  We can define these task using Java or P

Level of abstraction Higher level of Abstraction on top of HDFS Lower level of abstraction

Efficiency in Code Comparatively lesser than Map reduce Provides High efficiency

Less number of lines code required for

Extent of code More number of lines of codes to be defined
execution

Type of Development
Less Development work required More development work needed
work required

Learn Hive in 24 Hours
From Everand
Learn Hive in 24 Hours
Alex Nordeen
No ratings yet
DBMS SUPER 25 K-Scheme
No ratings yet
DBMS SUPER 25 K-Scheme
45 pages
30 Days SQL
No ratings yet
30 Days SQL
48 pages
Five Steps To Simplify Your: Data Mart and BI Solution
No ratings yet
Five Steps To Simplify Your: Data Mart and BI Solution
43 pages
Kill Stale Version Store Connection
No ratings yet
Kill Stale Version Store Connection
3 pages
Crisp DM
No ratings yet
Crisp DM
14 pages
Ivan Bayross Book PDF
21% (14)
Ivan Bayross Book PDF
39 pages
9) 11th LESSON 11structured Query Langauge
No ratings yet
9) 11th LESSON 11structured Query Langauge
80 pages
Library Science
No ratings yet
Library Science
10 pages
LAB - Chapter 9 - Database Security
No ratings yet
LAB - Chapter 9 - Database Security
9 pages
Addbafba
No ratings yet
Addbafba
21 pages
DB2
No ratings yet
DB2
1 page
Multitenant Create and Configure Pluggable Database 12c
No ratings yet
Multitenant Create and Configure Pluggable Database 12c
40 pages
DOAG2021 DataPumpDeepDive
No ratings yet
DOAG2021 DataPumpDeepDive
61 pages
Best of Oracle 2018
No ratings yet
Best of Oracle 2018
80 pages
DBMS Lecture 4
No ratings yet
DBMS Lecture 4
27 pages
A Crash Course in Caching - Part 2 - by Alex Xu
No ratings yet
A Crash Course in Caching - Part 2 - by Alex Xu
9 pages
Mariadb Tutorial: Learn Syntax, Commands With Examples
No ratings yet
Mariadb Tutorial: Learn Syntax, Commands With Examples
39 pages
13
No ratings yet
13
296 pages
Bigdata MCQ QA Part2
No ratings yet
Bigdata MCQ QA Part2
9 pages
CHAPTER 5 - Sequence - Index - Synonyms
No ratings yet
CHAPTER 5 - Sequence - Index - Synonyms
5 pages
MongoDB Operations - Basics Guide
No ratings yet
MongoDB Operations - Basics Guide
10 pages
Zookeeper Tutorial: What Is, Architecture of Apache Zookeeper
No ratings yet
Zookeeper Tutorial: What Is, Architecture of Apache Zookeeper
10 pages
Ansible 2
No ratings yet
Ansible 2
15 pages
ISYS6508 Database System: Week 9 Semi-Structured Data and XML
No ratings yet
ISYS6508 Database System: Week 9 Semi-Structured Data and XML
40 pages
Compiler Design Tutorial For Beginners - Complete Guide
No ratings yet
Compiler Design Tutorial For Beginners - Complete Guide
3 pages
Compiler Design Tutorial For Beginners - Complete Guide
No ratings yet
Compiler Design Tutorial For Beginners - Complete Guide
3 pages
NLTK Tutorial: What Is NLTK Library in Python?
No ratings yet
NLTK Tutorial: What Is NLTK Library in Python?
3 pages
Schema Refinement and Normal Forms: Database Management Systems, 3ed, R. Ramakrishnan and J. Gehrke 1
No ratings yet
Schema Refinement and Normal Forms: Database Management Systems, 3ed, R. Ramakrishnan and J. Gehrke 1
19 pages
Dbms Notes
No ratings yet
Dbms Notes
28 pages
Keras Tutorial: What Is Keras? How To Install in Python (Example)
No ratings yet
Keras Tutorial: What Is Keras? How To Install in Python (Example)
36 pages
SQL Triggers: Prepared By: Rahim Suwal (29) Shyam Rajak
100% (1)
SQL Triggers: Prepared By: Rahim Suwal (29) Shyam Rajak
55 pages
INFORMATICA TUTORIAL: Complete Online Training
No ratings yet
INFORMATICA TUTORIAL: Complete Online Training
2 pages
Section5 Exercise2 Authoring A 3D Map
No ratings yet
Section5 Exercise2 Authoring A 3D Map
51 pages
COBOL Tutorial: What Is COBOL Programming Language?
No ratings yet
COBOL Tutorial: What Is COBOL Programming Language?
14 pages
Company Interview Question Bank
No ratings yet
Company Interview Question Bank
16 pages
JDBC
No ratings yet
JDBC
190 pages
Some Important Ques
No ratings yet
Some Important Ques
14 pages
Cassandra Tutorial For Beginners: Learn in 3 Days: What Is Apache Cassandra?
No ratings yet
Cassandra Tutorial For Beginners: Learn in 3 Days: What Is Apache Cassandra?
4 pages
Unit-2 MCQS
No ratings yet
Unit-2 MCQS
7 pages
Postgresql Tutorial For Beginners: Learn Basic PSQL in 3 Days
No ratings yet
Postgresql Tutorial For Beginners: Learn Basic PSQL in 3 Days
3 pages
Unix Lab QUESTION SET
No ratings yet
Unix Lab QUESTION SET
11 pages
Qlikview Tutorial: What Is Qlikview? How To Install Qlikview Tool
No ratings yet
Qlikview Tutorial: What Is Qlikview? How To Install Qlikview Tool
17 pages
Subject Class Test/Exam Syllabus To Be Covered in The Examination
No ratings yet
Subject Class Test/Exam Syllabus To Be Covered in The Examination
6 pages
Spark Scala Interview Question
No ratings yet
Spark Scala Interview Question
3 pages
Spark A To Z
No ratings yet
Spark A To Z
63 pages
Hive in Class Assignment Winter 2021
No ratings yet
Hive in Class Assignment Winter 2021
2 pages
15.python OS Module
No ratings yet
15.python OS Module
14 pages
Lesson 2 - Exploring Linux Command-Line Tools - Part 2 - 2-Đã M Khóa
No ratings yet
Lesson 2 - Exploring Linux Command-Line Tools - Part 2 - 2-Đã M Khóa
5 pages
Server Error This Database Cannot Be Read Due To An Invalid On Disk Structure Sending Notes Mail
100% (1)
Server Error This Database Cannot Be Read Due To An Invalid On Disk Structure Sending Notes Mail
1 page
Snowflake Setup - MD
No ratings yet
Snowflake Setup - MD
2 pages
Apache Hive: Prashant Gupta
100% (1)
Apache Hive: Prashant Gupta
61 pages
Spark in Production
No ratings yet
Spark in Production
34 pages
Impala
No ratings yet
Impala
11 pages
Create An Spark Streaming App: 1. Architecture and Abstraction
No ratings yet
Create An Spark Streaming App: 1. Architecture and Abstraction
8 pages
Python OS Module - 30 Most Useful Methods From Python OS Module
No ratings yet
Python OS Module - 30 Most Useful Methods From Python OS Module
5 pages
Tutorial-HDP-Administration V III
100% (1)
Tutorial-HDP-Administration V III
274 pages
Install Sqoop
No ratings yet
Install Sqoop
7 pages
Mendeley Teaching Presentation - 2011
No ratings yet
Mendeley Teaching Presentation - 2011
33 pages
Akka PDF
No ratings yet
Akka PDF
454 pages
DBMS SQL Practice Questions Shivani
No ratings yet
DBMS SQL Practice Questions Shivani
10 pages
IBM InfoSphere Replication Server and Data Event Publisher
From Everand
IBM InfoSphere Replication Server and Data Event Publisher
Pav Kumar-Chatterjee
No ratings yet
Beginning Microsoft SQL Server 2012 Programming
From Everand
Beginning Microsoft SQL Server 2012 Programming
Paul Atkinson
1/5 (1)
Midhun BIGDATA Curicullum
No ratings yet
Midhun BIGDATA Curicullum
17 pages
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
No ratings yet
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
74 pages
Data-Engineering Course Structure
No ratings yet
Data-Engineering Course Structure
9 pages
Docker - Part1
No ratings yet
Docker - Part1
3 pages
Transformer All Functions
100% (1)
Transformer All Functions
47 pages
Apache Airflow TRAINING12532
No ratings yet
Apache Airflow TRAINING12532
3 pages
Hive Tutorial PDF
0% (1)
Hive Tutorial PDF
14 pages
Unstructured Dataload Into Hive Database Through PySpark
No ratings yet
Unstructured Dataload Into Hive Database Through PySpark
9 pages
Python Exercises Documentation: Release 1.0
No ratings yet
Python Exercises Documentation: Release 1.0
15 pages
Interview PDF
No ratings yet
Interview PDF
100 pages
DVS SPARK Course Content PDF
No ratings yet
DVS SPARK Course Content PDF
2 pages
Sqoop Commands - Latest
No ratings yet
Sqoop Commands - Latest
4 pages
Exception Handling: 1. Syntax Errors
No ratings yet
Exception Handling: 1. Syntax Errors
28 pages
Datatypes in Hive
No ratings yet
Datatypes in Hive
31 pages
Sqoop Cammand
No ratings yet
Sqoop Cammand
8 pages
Learning Apache Spark With Python
No ratings yet
Learning Apache Spark With Python
10 pages
Hbase PDF
No ratings yet
Hbase PDF
8 pages
Elite SQL Queries For Practice PDF
0% (1)
Elite SQL Queries For Practice PDF
20 pages
SS1123 - D2T - Apache Cassandra Overview PDF
100% (1)
SS1123 - D2T - Apache Cassandra Overview PDF
45 pages
Apache Hive
No ratings yet
Apache Hive
3 pages
BD - Spark - Baladasu A - SightSpectrum
No ratings yet
BD - Spark - Baladasu A - SightSpectrum
3 pages
Exploring Reactive Integrations With: Akka Streams
No ratings yet
Exploring Reactive Integrations With: Akka Streams
66 pages
Module 7: Data Management Backup, DR, Test/Dev Environments
No ratings yet
Module 7: Data Management Backup, DR, Test/Dev Environments
9 pages
Mysql Interview Questions PDF
No ratings yet
Mysql Interview Questions PDF
5 pages
Oozie Tutorial
No ratings yet
Oozie Tutorial
84 pages
Parallel Programming With Spark: Matei Zaharia
No ratings yet
Parallel Programming With Spark: Matei Zaharia
40 pages
24 Hadoop Interview Questions & Answers For MapReduce Developers - FromDev
No ratings yet
24 Hadoop Interview Questions & Answers For MapReduce Developers - FromDev
7 pages
PostgreSQL and NoSQL
100% (7)
PostgreSQL and NoSQL
36 pages
Unix Exercise 1
No ratings yet
Unix Exercise 1
2 pages
Oracle Essbase 9 Implementation Guide
From Everand
Oracle Essbase 9 Implementation Guide
Joseph Sydney Gomez
No ratings yet
Hadoop Hive Cheat Sheet - Developer Guide For SQL To HiveQL - Qubole
No ratings yet
Hadoop Hive Cheat Sheet - Developer Guide For SQL To HiveQL - Qubole
19 pages
Hive Cheat Sheet - Quick Reference
No ratings yet
Hive Cheat Sheet - Quick Reference
19 pages
Scala Cheatsheet
No ratings yet
Scala Cheatsheet
2 pages
Hive Query Optimization Infinity
No ratings yet
Hive Query Optimization Infinity
13 pages
Sqoop User Guide
No ratings yet
Sqoop User Guide
58 pages
TCS Format & Experience 1
No ratings yet
TCS Format & Experience 1
3 pages
PL SQL Exercise by Unsw
No ratings yet
PL SQL Exercise by Unsw
5 pages
Business Intelligence DW
No ratings yet
Business Intelligence DW
17 pages

Hive Tutorial For Beginners: Learn With Examples in 3 Days

Uploaded by

Hive Tutorial For Beginners: Learn With Examples in 3 Days

Uploaded by

Hive Tutorial for Beginners: Learn with

Hive Tutorial Summary

What should I know?

Hive Course Syllabus

👉 Lesson 3 HIVE Metastore Configuration — Why to Use MySQL?

👉 Lesson 4 Hive Data Types — Create & Drop Database in Hive

👉 Lesson 2 Hive Partitions & Buckets — Learn with Example

👉 Lesson 3 Hive Indexes and View — Learn with Example

👉 Lesson 4 Hive Queries — Learn with Example

👉 Lesson 6 Hive Query Language Tutorial — Built-in Operators

👉 Lesson 7 Hive Function — Built-in & User Defined Functions

👉 Lesson 8 Hive ETL — Loading JSON, XML, Text Data Examples

Hive engine compiles these queries into Map-Reduce jobs to be executed on

Hive Vs Map Reduce

Hive Vs Map Reduce?

 It compiles language with two main task

Less number of lines code required for

You might also like