Unit 5 Hbase
Unit 5 Hbase
1
Agenda
• Hbase Concept
• Hbase vs RDBMS
• Advanced usages
• Schema design
• Advance indexing
• Zookeeper
2
Introduction
• HBase is a data model that is similar to Google’s big table designed to
provide quick random access to huge amounts of structured data.
Limitations of Hadoop
• Hadoop can perform only batch processing, and data will be accessed
only in a sequential manner. That means one has to search the entire
dataset even for the simplest of jobs.
• At this point, a new solution is needed to access any point of data in a
single unit of time (random access).
3
What is HBase?
5
Storage Mechanism in HBase
• HBase is a column-oriented database and the tables in it are sorted
by row.
• The table schema defines only column families, which are the key
value pairs.
• A table have multiple column families and each column family can
have any number of columns.
6
HBase Vs RDBMS
7
Features of HBase
8
Where to Use HBase
9
Where to Use HBase
10
11
HBase Table Schema Design
12
Inserting Data using HBase Shell
13
• hbase(main):005:0> put 'emp','1','personal data:name','raju‘
• hbase(main):006:0> put 'emp','1','personal data:city','hyderabad'
14
Q & A Time
We have 10 Minutes for Q&A
15