0% found this document useful (0 votes)
22 views17 pages

BIG DATA With HBASE

The document provides an overview of HBase, a non-relational, column-oriented database management system derived from Google's Big Table, ideal for managing large amounts of sparse data. It discusses when to use HBase, its applications, and the importance of data management practices within organizations. Additionally, it highlights the benefits of effective data management, including improved compliance, enhanced customer experience, and scalability.

Uploaded by

OBELINK DYING
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views17 pages

BIG DATA With HBASE

The document provides an overview of HBase, a non-relational, column-oriented database management system derived from Google's Big Table, ideal for managing large amounts of sparse data. It discusses when to use HBase, its applications, and the importance of data management practices within organizations. Additionally, it highlights the benefits of effective data management, including improved compliance, enhanced customer experience, and scalability.

Uploaded by

OBELINK DYING
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 17

HÔM

B I G D A T A
TRÌNH

M A N A G E M E N T
CHƯƠNG

HCM
TP
/

W I T H

KHOA
HÔM NAY
NAY

BACH
H A B S E
LỚP

HOC
VỚI

DAI
ĐẾN

NGUYEN PHAN DUY


MỪNG

MINH
TRAN KHOI
CHÀO

1/15
C H
CONTENT
HBASE

HCM
• W H AT ’ S H B A S E ? 01
HBASE

TP
02
• WHEN TO USE HBASE ?

KHOA
03

À
WITH

04
• DATA MANAGEMENT

BACH
05
DATA

06
D E M O N S T R AT E H B A S E

HOC
O
BIG

DAI
• HOW TO HBASE ?

2/15
NGẮN

W H AT ’ S H B A S E ?

HCM
THIỆU

DEFINITION

TP
KHOA
GIỚI
/

A non-relational column-oriented database

BACH
NAY

management system derived from


HÔM

HOC
Google’s NoSQL database Big Table that
ĐỀ

DAI
run on top of HDFS
CHỦ

3/15
HIỂU VỀ HBASE SÂU SẮC HƠN
HƠN

HCM
SẮC

TP
KHOA
SÂU

Open-source that NoSQL database Well-suited for


HBASE

BACH
can horizontal written in JAVA which sparse data sets

HOC
scalable perfrom faster
VỀ
HIỂU

querying

DAI
Có khả năng chịu lỗi
Có khả năng mở Nhanh
rộng

4/15
HBAS RDB
E MS

• Does not have a fixed • Have a fixed schema


schema, define only • Work well with structured
columns data
• Work well with structured • Can only store normalized
data and semi-structured data
data • Built for thin table that
• Can have de-normalized can’t expand
data
• Built for large table that
can scale horizontally
HABSE

8/17
HBASE ?
WHEN TO USE

DAI HOC BACH KHOA TP HCM


W H AT ’ S H B A S E G O O D AT ?

HCM
TP
VIỆC

KHOA
LÀM

BACH
Serving a large Fast random access Write-heavy
BẢNG

amount of data application

HOC
and append-writing

DAI
(insert/overwrite)
rather than heavy
read-modify
operations

9/17
Favor consistency over
availability

HCM
TP
KHOA
Part of Hadoop system
HBASE

WHEN

BACH
HOC
DAI
Great community

10/17
A P P L I C AT I O N S

HCM
TP
KHOA
DỤ

BACH

HOC
DAI
Ví dụ Ví dụ Ví dụ
Lưu trữ hồ sơ bệnh Ngần hàng lưu trữ Lưu trữ hồ sơ, thông
án hồ sơ khách hàng tin, của e com
Lưu trữ giải trình tự
gen
11/17
NGẮN

D ATA

HCM
THIỆU

MANAGEMENT

TP
KHOA
GIỚI

Data management is the practice


/

BACH
NAY

of ingesting, processing, securing


HÔM

HOC
and storing an organization’s
ĐỀ

DAI
data, where it is then utilized for
CHỦ

strategic decision-making to
improve business outcomes.

12/17
T Y P E O F D ATA M A N A G E M E N T

HCM
NAY

TP
HÔM

KHOA
CỦA

BACH
TẮT

HOC
DATA DATA
TÓM

DATA STORAGE DATA SECURITY


PROCESSING GOVERNANCE

DAI
• What it is: The stage where raw data • What it is: How data is organized • What it is: The framework of standards • What it is: The practice of protecting
from various sources is collected, and stored for later retrieval and and processes that ensure effective data from unauthorized access,
transformed, and prepared for use in usage. and responsible data use throughout corruption, or theft.
• Key Types: • Why it's important: Data breaches can
analysis or other applications. an organization.
• How it works: ⚬ Data Warehouses: Structured • Key Focus Areas: be costly to organizations, both
⚬ Data is ingested from sources like storage for specific analysis ⚬ Data quality and consistency financially and in terms of reputation.
⚬ Data access and authorization • Key Techniques:
web APIs, apps, IoT devices, etc. needs as defined by business
⚬ It's processed via techniques like ETL ⚬ Data usability (including metadata ⚬ Encryption: Scrambling data to
users and data engineers.
(extract, transform, load) or ELT ⚬ Data Lakes: Repositories and data catalogs for accessibility) make it unreadable without a key.
⚬ Data security ⚬ Data Masking: Obscuring sensitive
(extract, load, transform) allowing both structured and
⚬ Filtering, merging, or aggregating unstructured data, often used data for testing or development
data happens to fit the intended for data science projects. 1 3/17 purposes.
analysis purpose.
B E N E F I T S O F D ATA M A N A G E M E N T

HCM
NAY

TP
HÔM

KHOA
CỦA

BACH
TẮT

HOC
TÓM

IMPROVED
ENHANCED

DAI
REDUCE COMPLIAN
D DATA CUSTOMER SCALABILIT
CE AND
SILOS EXPERIENC Y
SECURITY
E

14/17
MỌI THỨ RÕ RÀNG CHỨ?

15/17
tion
demonstra

DAI HOC BACH KHOA TP HCM


HCM
CHỨ?

TP
RÀNG

KHOA
MỌI THỨ RÕ RÀNG CHỨ?
Trước khi kết thúc, hãy thảo luận thoải mái và cởi mở

BACH
nếu có câu hỏi hay cần làm rõ vấn đề.
THỨ

HOC
MỌI

DAI
BẠN CÓ CÂU
HỎI?

16/17
B

HCM
Y

TP
T H A N K S F O R YO U R

KHOA
AT T E N T I O N
ENDING

BACH
HOC
E

DAI
17/17

You might also like