Hbase

Uploaded by

Being Gamer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views23 pages

Hbase

Uploaded by

Being Gamer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

HBASE

Dr. Shivangi Shukla

Assistant Professor
Computer Science and Engineering
IIIT Pune
Contents
• HBase
• RDBMS Versus HBase
• Advantages
• Disadvantages
• Architecture
• Features

HBase 2
HBase
• HBase is a distributed column-oriented database built
on top of HDFS.
• HBase is the Hadoop application to use
• when users require real-time read/write random-
access
• to very large datasets.
• HBase can store massive amounts of data from
terabytes to petabytes.
• It is column oriented and horizontally scalable.

HBase 3
HBase..
• There are numerous strategies and implementations
available for database storage and retrieval,
• however, they are not designed for large and
distributed datasets.
• HBase is able to host very large, sparsely populated
tables on different clusters made from commodity
hardware.
• Unlike relational database systems,
• HBase does not support a structured query
language like SQL;
• hence, HBase does not store relational data.
HBase 4
Difference between HBase and
RDBMS
RDBMS HBase
RDBMS is mostly row oriented HBase is column oriented
RDBMS has fixed schema HBase facilitates addition of
columns during run time.
RDBMS is suitable for structured HBase is suitable for structured as
data well as semi-structured data
RDBMS is optimized for joins HBase is not optimized for joins
RDBMS can store only normalized HBase can have denormalized
data data
• HBase is a NoSQL, column-oriented database that is
built on top of the Hadoop ecosystem.
• It is designed to provide low-latency, high-throughput
5
access to large-scale, distributed datasets.
HBase
• HBase applications are written in Java
• HBase does support writing applications in Apache
Hadoop Project
• such as in Avro, REST and Thrift.
• HBase system is designed to scale linearly.
• HBase comprises a set of standard tables with rows and
columns, much like a traditional database.
• Each table must have an element defined as a
primary key, and all access attempts to HBase tables
must use this primary key.

HBase 6
Applications of HBase
1. Real-time Analytics
• HBase is helpful in applications that focus on real-
time analytics
• as it provides low-latency data access.
• It provides fast read and write performance and
• can handle large amounts of data, making it
suitable for real-time data analysis.

HBase 7
Applications of HBase
2. Social Media Applications
• HBase is an ideal database for social media
applications
• that require high scalability and performance.
• It can handle the large volume of data generated
by social media platforms
• and provide real-time analytics capabilities.
3. IoT Applications
• HBase is scalable in nature and provides fast write
performance,
• It is helpful in IoT applications that require low-
latency data processing. 8
HBase
Applications of HBase
4. Online Transaction Processing (OLTP)
• HBase can be used as an OLTP database, providing
high availability, consistency, and low-latency data
access.
• HBase’s distributed architecture and automatic
failover capabilities make it a good fit for OLTP
applications that require high availability.

HBase 9
Applications of HBase
5. Ad serving and clickstream analysis
• HBase can be used to store and process large
volumes of clickstream data for ad serving and
clickstream analysis.
• HBase’s column-oriented data storage and indexing
capabilities make it a good fit for these types of
applications.

HBase 10
Advantages of HBase
1. Scalability
• HBase can handle extremely large datasets that can
be distributed across a cluster of machines.
• It is designed to scale horizontally by adding more
nodes to the cluster, which allows it to handle
increasingly larger amounts of data.
2. High-performance
• HBase is optimized for low-latency, high-throughput
access to data.
• It uses a distributed architecture that allows it to
process large amounts of data in parallel, which can
result in faster query response times. 11
HBase
Advantages of HBase..
3. Flexible Data Model
• HBase’s column-oriented data model allows for
flexible schema design and supports sparse
datasets.
• This can make it easier to work with data that has a
variable or evolving schema.
4. Fault Tolerance
• HBase is designed to be fault-tolerant by replicating
data across multiple nodes in the cluster.
• This helps ensure that data is not lost in the event of
a hardware or network failure.
HBase 12
Disadvantages of HBase
1. Complexity
• HBase can be complex to set up and manage.
• It requires knowledge of the Hadoop ecosystem and
distributed systems concepts, which can be
challenging for some users.
2. Limited Query Language
• HBase’s query language, HBase Shell, is not as
feature-rich as SQL.
• This can make it difficult to perform complex
queries and analyses.

HBase 13
Disadvantages of HBase..
3. No support for transactions
• HBase does not support transactions, which can
make it difficult to maintain data consistency in
some use cases.
4. Not suitable for all use cases
• HBase is best suited for use cases where high
throughput and low-latency access to large datasets
is required.
• It may not be the best choice for applications that
require real-time processing or strong consistency
guarantees.

HBase 14
Architecture of HBase
• HBase architecture comprises of three main
components:
• HMaster,
• Region Server
• Zookeeper

HBase 15
Architecture of HBase
• HMaster
• The implementation of Master Server in HBase is
HMaster.
• It is a process in which regions are assigned to
region server as well as DDL (create, delete table)
operations.
• It monitor all Region Server instances present in the
cluster.
• In a distributed environment, Master runs several
background threads.
• HMaster has many features like controlling load
balancing, failover etc.

HBase 16
Architecture of HBase..
• Region Server
• HBase Tables are divided horizontally by row key range
into Regions.
• Regions are the basic building elements of HBase cluster
• that consists of the distribution of tables and are
comprised of Column families.
• Region Server runs on HDFS DataNode which is present
in Hadoop cluster.
• Regions of Region Server are responsible for several
things,
• like handling, managing, executing as well as reads
and writes HBase operations on that set of regions.
• The default size of a region is 256 MB.
17
Architecture of HBase..
• Zookeeper
• It is like a coordinator in HBase.
• It provides services
• like maintaining configuration information,
naming, providing distributed synchronization,
server failure notification etc.
• Clients communicate with region servers via
zookeeper.

HBase 18
Architecture of HBase

HBase 19
Features of HBase Architecture
• Distributed and Scalable
• HBase is designed to be distributed and scalable,
which means it can handle large datasets and can
scale out horizontally by adding more nodes to the
cluster.
• Column-oriented Storage
• HBase stores data in a column-oriented manner,
which means data is organized by columns rather
than rows.
• This allows for efficient data retrieval and
aggregation.
HBase 20
Features of HBase Architecture..
• Hadoop Integration
• HBase is built on top of HDFS, which means it can
leverage HDFS for storage and MapReduce for data
processing.
• Consistency and Replication
• HBase provides strong consistency guarantees for
read and write operations,
• and supports replication of data across multiple
nodes for fault tolerance.

HBase 21
Features of HBase Architecture..
• Built-in Caching
• HBase has a built-in caching mechanism that can
cache frequently accessed data in memory, which
can improve query performance.
• Compression
• HBase supports compression of data, which can
reduce storage requirements and improve query
performance.
• Flexible Schema
• HBase supports flexible schemas,
• which means the schema can be updated on the fly
without requiring a database schema migration. 22
HBase
Difference between HBase and
HDFS
HBase HDFS
HBase provides low latency HDFS provides high latency
access operations
HBase supports random HDFS supports Write once
read and writes Read Many times
HBase is accessed through HDFS is accessed through
shell commands, Java API, MapReduce jobs
REST, Avro or Thrift API

HBase 23

Topics in Abstract Algebra Herstein Solutions
No ratings yet
Topics in Abstract Algebra Herstein Solutions
93 pages
(Lynn E. Roller) in Search of God The Mother The (BookFi) PDF
100% (4)
(Lynn E. Roller) in Search of God The Mother The (BookFi) PDF
401 pages
LUNA, Luis Eduardo - Functions of The Magic Melodies or Icaros PDF
100% (1)
LUNA, Luis Eduardo - Functions of The Magic Melodies or Icaros PDF
23 pages
Reading: Based On Tricia Hedge's "Teaching and Learning in The Language Classroom"
100% (1)
Reading: Based On Tricia Hedge's "Teaching and Learning in The Language Classroom"
22 pages
Unit 5 BDA
No ratings yet
Unit 5 BDA
34 pages
Big Data Analytics Unit-5
No ratings yet
Big Data Analytics Unit-5
28 pages
I Am Discourses 01 ST Germain
100% (5)
I Am Discourses 01 ST Germain
7 pages
BDA Unit 5 HIVE HBASE
No ratings yet
BDA Unit 5 HIVE HBASE
33 pages
Bda - Unit 5
No ratings yet
Bda - Unit 5
30 pages
HBase - Tutorial
No ratings yet
HBase - Tutorial
14 pages
Where The Forest Meets The Sea Sample Lesson Plan
100% (2)
Where The Forest Meets The Sea Sample Lesson Plan
29 pages
Part 3
No ratings yet
Part 3
113 pages
Adverbial Clauses
100% (1)
Adverbial Clauses
20 pages
Hbase
No ratings yet
Hbase
6 pages
Unit 5
No ratings yet
Unit 5
10 pages
Bigdata-Chap3 Notes
No ratings yet
Bigdata-Chap3 Notes
11 pages
10 NoSQL Databases - HBase Hive Cassandra
No ratings yet
10 NoSQL Databases - HBase Hive Cassandra
74 pages
Android Securecoding en
No ratings yet
Android Securecoding en
9 pages
DBMS Unit3
No ratings yet
DBMS Unit3
28 pages
Apache HBase
No ratings yet
Apache HBase
12 pages
Cse 17CS82 M2 S4 PPT
No ratings yet
Cse 17CS82 M2 S4 PPT
19 pages
Unit 5 Hbase
No ratings yet
Unit 5 Hbase
15 pages
Module 05 HBase - Distributed NoSQL Database
No ratings yet
Module 05 HBase - Distributed NoSQL Database
54 pages
10 HBase
No ratings yet
10 HBase
13 pages
BDA1
No ratings yet
BDA1
42 pages
Hbase
No ratings yet
Hbase
15 pages
HBase
No ratings yet
HBase
27 pages
4 4HBase
No ratings yet
4 4HBase
17 pages
HBASE
No ratings yet
HBASE
11 pages
Big Data 22MSM40206
No ratings yet
Big Data 22MSM40206
9 pages
Big Data Unit 5
No ratings yet
Big Data Unit 5
18 pages
Hadoop Week 6
No ratings yet
Hadoop Week 6
38 pages
Unit-5 Notes
No ratings yet
Unit-5 Notes
61 pages
HBase
No ratings yet
HBase
30 pages
BDT Unit - V
No ratings yet
BDT Unit - V
15 pages
Hadoop HBASE
No ratings yet
Hadoop HBASE
71 pages
Big Data UNIT 5 Own
No ratings yet
Big Data UNIT 5 Own
18 pages
Unit III - Full
No ratings yet
Unit III - Full
31 pages
Hbase - in Detail: Pushpinder Singh Paxcel Technologies
No ratings yet
Hbase - in Detail: Pushpinder Singh Paxcel Technologies
32 pages
Unit 3 Hbase, Mongodb and Couch DB
No ratings yet
Unit 3 Hbase, Mongodb and Couch DB
12 pages
Conditionals: A) - Not Possible
100% (1)
Conditionals: A) - Not Possible
2 pages
HBase
No ratings yet
HBase
6 pages
BDA Unit-5
No ratings yet
BDA Unit-5
31 pages
Unit - 5 Part - 1
No ratings yet
Unit - 5 Part - 1
8 pages
Hbase What Is Hbase?
No ratings yet
Hbase What Is Hbase?
2 pages
Chapter 12 HBase
No ratings yet
Chapter 12 HBase
108 pages
Unit 5 Big Data
No ratings yet
Unit 5 Big Data
34 pages
HBASE
No ratings yet
HBASE
35 pages
Unit V Hadoop Related Tools
No ratings yet
Unit V Hadoop Related Tools
54 pages
HBASE
No ratings yet
HBASE
18 pages
BDA Unit 5
No ratings yet
BDA Unit 5
33 pages
Hbase - Quick Guide Hbase - Overview
No ratings yet
Hbase - Quick Guide Hbase - Overview
53 pages
HBase
No ratings yet
HBase
31 pages
Hbase: Q) What Is Hbase ?
No ratings yet
Hbase: Q) What Is Hbase ?
15 pages
Wa0005.
No ratings yet
Wa0005.
53 pages
Unit - IV - Notes
No ratings yet
Unit - IV - Notes
23 pages
Unit 5 Lecture No-3 (Hbase)
No ratings yet
Unit 5 Lecture No-3 (Hbase)
35 pages
HBase
No ratings yet
HBase
4 pages
2 Unit 5
No ratings yet
2 Unit 5
24 pages
NoteGPT - What Is HBase - HBase Architecture - HBase Tutorial For Beginners - Hadoop Tutorial - Simplilearn
No ratings yet
NoteGPT - What Is HBase - HBase Architecture - HBase Tutorial For Beginners - Hadoop Tutorial - Simplilearn
5 pages
Unit 5 Lecture No-3 (Hbase)
No ratings yet
Unit 5 Lecture No-3 (Hbase)
35 pages
HBase
No ratings yet
HBase
12 pages
Unit 5 Bda
No ratings yet
Unit 5 Bda
42 pages
HBase Architecture
No ratings yet
HBase Architecture
1 page
HBase Presentation
No ratings yet
HBase Presentation
23 pages
Large-Scale Data Management: Hbase
No ratings yet
Large-Scale Data Management: Hbase
36 pages
HBase
No ratings yet
HBase
14 pages
4.5 Hbase
No ratings yet
4.5 Hbase
27 pages
Plants Poem Analysis
67% (3)
Plants Poem Analysis
2 pages
Hindu Law Notes and Study Material
No ratings yet
Hindu Law Notes and Study Material
17 pages
Swetha Ashok
No ratings yet
Swetha Ashok
5 pages
Voice
No ratings yet
Voice
15 pages
Linux VI and Vim Editor: Tutorial and Advanced Features
No ratings yet
Linux VI and Vim Editor: Tutorial and Advanced Features
17 pages
2023 JHS Scheme of Learning
No ratings yet
2023 JHS Scheme of Learning
37 pages
Ops English Lang. Module
No ratings yet
Ops English Lang. Module
12 pages
ASSURE Lesson Plan
No ratings yet
ASSURE Lesson Plan
4 pages
Unit - V
No ratings yet
Unit - V
90 pages
Hi! No Doubt You Know Me. Yes, Yes I Am William: Shakespeare!
No ratings yet
Hi! No Doubt You Know Me. Yes, Yes I Am William: Shakespeare!
16 pages
I Am From - Poetry
No ratings yet
I Am From - Poetry
6 pages
b2 First Handbook - Removed - Removed
No ratings yet
b2 First Handbook - Removed - Removed
1 page
CMake Lists
No ratings yet
CMake Lists
29 pages
CC Unit 4 Notes
No ratings yet
CC Unit 4 Notes
10 pages
TIB Bwpluginftl 6.7.1 User-Guide
No ratings yet
TIB Bwpluginftl 6.7.1 User-Guide
55 pages
Q3W7
No ratings yet
Q3W7
12 pages
Top 50 SQL Server Interview Question
No ratings yet
Top 50 SQL Server Interview Question
15 pages
AMAPOLA (LAbM)
No ratings yet
AMAPOLA (LAbM)
3 pages
C++ - Convert STD - Bind To Function Pointer - Stack Overflow
No ratings yet
C++ - Convert STD - Bind To Function Pointer - Stack Overflow
7 pages
Test 1 1. Mark The Letter A, B, C or D To Indicate The Underlined Part That Needs Correction in Each of The Following Questions
No ratings yet
Test 1 1. Mark The Letter A, B, C or D To Indicate The Underlined Part That Needs Correction in Each of The Following Questions
7 pages
Downshifting Essay
No ratings yet
Downshifting Essay
1 page
Learn Hbase in 24 Hours
From Everand
Learn Hbase in 24 Hours
Alex Nordeen
No ratings yet

Hbase

Uploaded by

Hbase

Uploaded by

HBASE

Dr. Shivangi Shukla

You might also like