BIG DATA With HBASE
BIG DATA With HBASE
B I G D A T A
TRÌNH
M A N A G E M E N T
CHƯƠNG
HCM
TP
/
W I T H
KHOA
HÔM NAY
NAY
BACH
H A B S E
LỚP
HOC
VỚI
DAI
ĐẾN
MINH
TRAN KHOI
CHÀO
1/15
C H
CONTENT
HBASE
HCM
• W H AT ’ S H B A S E ? 01
HBASE
TP
02
• WHEN TO USE HBASE ?
KHOA
03
À
WITH
04
• DATA MANAGEMENT
BACH
05
DATA
06
D E M O N S T R AT E H B A S E
HOC
O
BIG
DAI
• HOW TO HBASE ?
2/15
NGẮN
W H AT ’ S H B A S E ?
HCM
THIỆU
DEFINITION
TP
KHOA
GIỚI
/
BACH
NAY
HOC
Google’s NoSQL database Big Table that
ĐỀ
DAI
run on top of HDFS
CHỦ
3/15
HIỂU VỀ HBASE SÂU SẮC HƠN
HƠN
HCM
SẮC
TP
KHOA
SÂU
BACH
can horizontal written in JAVA which sparse data sets
HOC
scalable perfrom faster
VỀ
HIỂU
querying
DAI
Có khả năng chịu lỗi
Có khả năng mở Nhanh
rộng
4/15
HBAS RDB
E MS
8/17
HBASE ?
WHEN TO USE
HCM
TP
VIỆC
KHOA
LÀM
BACH
Serving a large Fast random access Write-heavy
BẢNG
HOC
and append-writing
DAI
(insert/overwrite)
rather than heavy
read-modify
operations
9/17
Favor consistency over
availability
HCM
TP
KHOA
Part of Hadoop system
HBASE
WHEN
BACH
HOC
DAI
Great community
10/17
A P P L I C AT I O N S
HCM
TP
KHOA
DỤ
BACH
VÍ
HOC
DAI
Ví dụ Ví dụ Ví dụ
Lưu trữ hồ sơ bệnh Ngần hàng lưu trữ Lưu trữ hồ sơ, thông
án hồ sơ khách hàng tin, của e com
Lưu trữ giải trình tự
gen
11/17
NGẮN
D ATA
HCM
THIỆU
MANAGEMENT
TP
KHOA
GIỚI
BACH
NAY
HOC
and storing an organization’s
ĐỀ
DAI
data, where it is then utilized for
CHỦ
strategic decision-making to
improve business outcomes.
12/17
T Y P E O F D ATA M A N A G E M E N T
HCM
NAY
TP
HÔM
KHOA
CỦA
BACH
TẮT
HOC
DATA DATA
TÓM
DAI
• What it is: The stage where raw data • What it is: How data is organized • What it is: The framework of standards • What it is: The practice of protecting
from various sources is collected, and stored for later retrieval and and processes that ensure effective data from unauthorized access,
transformed, and prepared for use in usage. and responsible data use throughout corruption, or theft.
• Key Types: • Why it's important: Data breaches can
analysis or other applications. an organization.
• How it works: ⚬ Data Warehouses: Structured • Key Focus Areas: be costly to organizations, both
⚬ Data is ingested from sources like storage for specific analysis ⚬ Data quality and consistency financially and in terms of reputation.
⚬ Data access and authorization • Key Techniques:
web APIs, apps, IoT devices, etc. needs as defined by business
⚬ It's processed via techniques like ETL ⚬ Data usability (including metadata ⚬ Encryption: Scrambling data to
users and data engineers.
(extract, transform, load) or ELT ⚬ Data Lakes: Repositories and data catalogs for accessibility) make it unreadable without a key.
⚬ Data security ⚬ Data Masking: Obscuring sensitive
(extract, load, transform) allowing both structured and
⚬ Filtering, merging, or aggregating unstructured data, often used data for testing or development
data happens to fit the intended for data science projects. 1 3/17 purposes.
analysis purpose.
B E N E F I T S O F D ATA M A N A G E M E N T
HCM
NAY
TP
HÔM
KHOA
CỦA
BACH
TẮT
HOC
TÓM
IMPROVED
ENHANCED
DAI
REDUCE COMPLIAN
D DATA CUSTOMER SCALABILIT
CE AND
SILOS EXPERIENC Y
SECURITY
E
14/17
MỌI THỨ RÕ RÀNG CHỨ?
15/17
tion
demonstra
TP
RÀNG
KHOA
MỌI THỨ RÕ RÀNG CHỨ?
Trước khi kết thúc, hãy thảo luận thoải mái và cởi mở
RÕ
BACH
nếu có câu hỏi hay cần làm rõ vấn đề.
THỨ
HOC
MỌI
DAI
BẠN CÓ CÂU
HỎI?
16/17
B
HCM
Y
TP
T H A N K S F O R YO U R
KHOA
AT T E N T I O N
ENDING
BACH
HOC
E
DAI
17/17