Hive-Hands On - Bucketing Table

The document creates a Hive database and table called hive_bucket and temp to load transaction data from a CSV file. It then creates a partitioned and clustered table called transaction_bucket to bucket the data by customer ID. Data is inserted from temp to transaction_bucket. Additional tables bucket1, bucket2, bucket3 are then created to insert partitioned data with transactional support enabled.

Uploaded by

Story Telling

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views1 page

Hive-Hands On - Bucketing Table

Uploaded by

Story Telling

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 1

create database hive_bucket;

use hive_bucket;

create table temp (transaction_id string, cust_id int, tran_date string,

prod_subcat_code int, prod_cat_code int, Qty int, Rate int, Tax double, total_amt
double, Store_type string) row format delimited fields terminated by ',' lines
terminated by '\n' tblproperties("skip.header.line.count"="1");

load data local inpath '/projects/challenge/Transactions.csv' into table temp;

set hive.cli.print.header=true;
set hive.exec.dynamic.partition=true;
set hive.exec.dynamic.partition.mode=nonstrict;
set hive.mapred.mode=nonstrict;

create table transaction_bucket (transaction_id string, cust_id int, tran_date

string, Qty int, Rate int, Tax double, total_amt double) partitioned by (Store_type
string) clustered by (cust_id) into 3 buckets row format delimited fields
terminated by ',' lines terminated by '\n'
tblproperties("skip.header.line.count"="1");

insert overwrite table transaction_bucket partition (Store_type) select

transaction_id, cust_id, tran_date, Qty, Rate, Tax, total_amt, Store_type from temp
where Qty > 0;

create table bucket1 (transaction_id string, cust_id int, tran_date string, Qty
int, Rate int, Tax double, total_amt double, Store_type string) clustered by
(cust_id) into 3 buckets row format delimited fields terminated by ',' lines
terminated by '\n' stored as ORC tblproperties("orc.compress"="Zlib",
"skip.header.line.count"="1");

alter table bucket1 set tblproperties('transactional'='true');

set hive.txn.manager=org.apache.hadoop.hive.ql.lockmgr.DBTxnManager;
set hive.support.concurrency=true;
set hive.exec.dynamic.partition.mode=nonstrict;
set hive.compactor.initiator.on=true;
set hive.compactor.worker.threads=1;

insert into table bucket1 values ("80712190438",270351,"28-02-2014",-5,-772,405.3,-

4265.3,"e-Shop"),("29258453508",270384,"27-02-2014",-5,-1497,785.925,-8270.925,"e-
Shop"),("93274880719",271509,"24-02-2014",-3,-1363,429.345,-4518.345,"e-Shop"),
("97439039119",272357,"23-02-2014",-2,-824,173.04,-1821.04,"TeleShop"),
("45649838090",273667,"22-04-2014",-1,-1450,152.25,-1602.25,"e-Shop"),
("22643667930",271489,"22-02-2014",-1,-1225,128.625,-1353.625,"TeleShop"),
("79792372943",275108,"22-02-2014",-3,-908,286.02,-3010.02,"MBR"),
("50076728598",269014,"21-02-2014",-4,-581,244,-2568.02,"e-Shop");

update bucket1 set tran_date = "25-02-2014" where Store_type = "e-Shop";

Query To Get Inventory Material Aging at Oracle Cloud
0% (1)
Query To Get Inventory Material Aging at Oracle Cloud
9 pages
Mycart Documentation Add Jar /home/acadgild/ecommerce/hive-serdes-1.0-SNAPSHOT - Jar Products - Info - Raw Table Creation
No ratings yet
Mycart Documentation Add Jar /home/acadgild/ecommerce/hive-serdes-1.0-SNAPSHOT - Jar Products - Info - Raw Table Creation
36 pages
Introduction of DBMS
No ratings yet
Introduction of DBMS
89 pages
BigData Theory
No ratings yet
BigData Theory
65 pages
Python Dictionary and Tuple
No ratings yet
Python Dictionary and Tuple
4 pages
Python List Assignment
No ratings yet
Python List Assignment
17 pages
Python String
No ratings yet
Python String
14 pages
Big Data Group - Project
No ratings yet
Big Data Group - Project
24 pages
HOL Hive
No ratings yet
HOL Hive
85 pages
Aggregation
No ratings yet
Aggregation
7 pages
SWOT
No ratings yet
SWOT
2 pages
BANK
No ratings yet
BANK
26 pages
19hive Partitioning
No ratings yet
19hive Partitioning
2 pages
Documantation
No ratings yet
Documantation
12 pages
AR Details Qry
No ratings yet
AR Details Qry
5 pages
Northwind
No ratings yet
Northwind
101 pages
First DWH - Script
No ratings yet
First DWH - Script
7 pages
HIVE Codes
No ratings yet
HIVE Codes
6 pages
TPs ED
No ratings yet
TPs ED
7 pages
Hive Query
No ratings yet
Hive Query
5 pages
Dairy
No ratings yet
Dairy
6 pages
Hive Exercise
No ratings yet
Hive Exercise
7 pages
Java PPPP
No ratings yet
Java PPPP
3 pages
Hive Hands On - Partition Tables
No ratings yet
Hive Hands On - Partition Tables
1 page
Hive Commands Simplin
No ratings yet
Hive Commands Simplin
5 pages
02 Inc MX BCT Se Idc DTL Tax Gcs To Raw
No ratings yet
02 Inc MX BCT Se Idc DTL Tax Gcs To Raw
2 pages
RDD - Mini - Project - 1 - 1707570179 2024-02-10 13 - 03 - 29
No ratings yet
RDD - Mini - Project - 1 - 1707570179 2024-02-10 13 - 03 - 29
10 pages
02 Inc MX DTMSH Dim Ns Pymt Type Catlg Load To STG
No ratings yet
02 Inc MX DTMSH Dim Ns Pymt Type Catlg Load To STG
2 pages
02 Inc MX BCT Se Idc DTL TNDR Promo Gcs To Raw
No ratings yet
02 Inc MX BCT Se Idc DTL TNDR Promo Gcs To Raw
2 pages
02 Inc MX BCT Hs Idc DTL Util Gcs To Raw
No ratings yet
02 Inc MX BCT Hs Idc DTL Util Gcs To Raw
2 pages
Create Tables Northwind
No ratings yet
Create Tables Northwind
4 pages
Lab3 Transforming Data
No ratings yet
Lab3 Transforming Data
3 pages
Week 3
No ratings yet
Week 3
11 pages
SQL
No ratings yet
SQL
41 pages
Blink Basket
No ratings yet
Blink Basket
8 pages
Create The Following Tables
No ratings yet
Create The Following Tables
1 page
Modelamiento
No ratings yet
Modelamiento
3 pages
03 Full MX Cons DTMSH Dim Ns RPT Type Catlg CTG To CMP
No ratings yet
03 Full MX Cons DTMSH Dim Ns RPT Type Catlg CTG To CMP
2 pages
Lab20 ExternalTables
No ratings yet
Lab20 ExternalTables
3 pages
03 Inc MX Fin Icp Se Raw To Catalog
No ratings yet
03 Inc MX Fin Icp Se Raw To Catalog
2 pages
03 Inc MX BCT Se Idc DTL TNDR Promo Raw To CTG
No ratings yet
03 Inc MX BCT Se Idc DTL TNDR Promo Raw To CTG
2 pages
First Pyspark
No ratings yet
First Pyspark
18 pages
Project Retail Group1
No ratings yet
Project Retail Group1
49 pages
03 Inc MX BCT Se Idc DTL Tax Raw To CTG
No ratings yet
03 Inc MX BCT Se Idc DTL Tax Raw To CTG
2 pages
Богданов Никита Сергеевич М3313 Этап 4
No ratings yet
Богданов Никита Сергеевич М3313 Этап 4
5 pages
Богданов Никита Сергеевич М3313 Этап 6
No ratings yet
Богданов Никита Сергеевич М3313 Этап 6
5 pages
Bankingdatabase
No ratings yet
Bankingdatabase
13 pages
Hive Practice - New
No ratings yet
Hive Practice - New
8 pages
Hive
No ratings yet
Hive
7 pages
Финк Максим Антонович этап4
No ratings yet
Финк Максим Антонович этап4
5 pages
Create DB Store
No ratings yet
Create DB Store
3 pages
01 08说明
No ratings yet
01 08说明
5 pages
Bucketing Hadoop Class Notes
No ratings yet
Bucketing Hadoop Class Notes
1 page
Hive Notes
No ratings yet
Hive Notes
4 pages
Dbmscomands
No ratings yet
Dbmscomands
4 pages
Grocery SQL
No ratings yet
Grocery SQL
3 pages
Name
No ratings yet
Name
5 pages
Supermarket
No ratings yet
Supermarket
1 page
April Assignment
No ratings yet
April Assignment
7 pages
Grocery Billing Store Mysql Queries
No ratings yet
Grocery Billing Store Mysql Queries
2 pages
Ecommerce
No ratings yet
Ecommerce
3 pages
To Create A Table in Hive
No ratings yet
To Create A Table in Hive
1 page
Furniture Store SQL Project Documentation
No ratings yet
Furniture Store SQL Project Documentation
12 pages
Bigdata Question
No ratings yet
Bigdata Question
16 pages
NgRx SignalStore: An effortless solution for state management
From Everand
NgRx SignalStore: An effortless solution for state management
Abdelfattah Ragab
No ratings yet
Stripe Integration in Angular: A Step-by-Step Guide to Creating Payment Functionality
From Everand
Stripe Integration in Angular: A Step-by-Step Guide to Creating Payment Functionality
Abdelfattah Ragab
No ratings yet

Hive-Hands On - Bucketing Table

Uploaded by

Hive-Hands On - Bucketing Table

Uploaded by

create database hive_bucket;

create table temp (transaction_id string, cust_id int, tran_date string,

load data local inpath '/projects/challenge/Transactions.csv' into table temp;

create table transaction_bucket (transaction_id string, cust_id int, tran_date

insert overwrite table transaction_bucket partition (Store_type) select

alter table bucket1 set tblproperties('transactional'='true');

insert into table bucket1 values ("80712190438",270351,"28-02-2014",-5,-772,405.3,-

update bucket1 set tran_date = "25-02-2014" where Store_type = "e-Shop";

You might also like