0% found this document useful (0 votes)

44 views2 pages

Connect Hadoop Database by Using Hive in Python - Ting Yu

The document discusses connecting Python to a Hadoop database using Hive. It explains how to install necessary Python libraries to allow a connection. It then provides an example Python script that connects to Hive and runs a sample query to retrieve data from a Hive table.

Uploaded by

Ahmed Mohamed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views2 pages

Connect Hadoop Database by Using Hive in Python - Ting Yu

Uploaded by

Ahmed Mohamed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Ting Yu > Blogs >

Connect Hadoop Database by Using Hive in Python

posted Oct 11, 2014, 4:43 AM by Ting Yu [ updated Oct 22, 2014, 2:47 AM ]

On the Hadoop platform, there are two scripting languages that simplify the code: PIG is a specific scripting language,
HIVE looks like SQL. Using HIVE is quite easy. It has a bunch of extension functions (called user defined functions) to
transform data like regular expression tools and so on. A developer can add user defined functions, by developing
them in Java. Another way to have a procedural logic that complements SQL Set-based language is to use a language
like Python.

In this example, we use a Python module to access a database table. Hive is used to get the data, partition it and
send the rows to the Python processes which are created on the different cluster nodes.

In addition to the standard python program, a few libraries need to be installed to allow Python to build the connection
to the Hadoop databae.

1. Pyhs2, Python Hive Server 2 Client Driver: https://pypi.python.org/pypi/pyhs2/0.5.0

2. Sasl, Cyrus-SASL bindings for Python: https://pypi.python.org/pypi/sasl/0.1.3

3. Thrift, Python bindings for the Apache Thrift RPC system: https://pypi.python.org/pypi/thrift/0.9.1

4. PyHive, Python interface to Hive: https://pypi.python.org/pypi/PyHive/0.1.0

All the libraries are installed in the fold ~/site-packages. Installation Commands are below:

unzip pyhs2-master.zip
cd pyhs2-master
python setup.py install --user

tar zxvf sasl-0.1.3.tar.gz

cd sasl-0.1.3
python setup.py install –user

tar zxvf thrift-0.9.1.tar.gz

cd thrift-0.9.1
python setup.py install –user

tar zxvf PyHive-0.1.0.tar.gz

cd PyHive-0.1.0
python setup.py install --user

The main Python code to connect the database:

#!/usr/bin/env python
import pyhs2 as hive
import getpass
DEFAULT_DB = 'default'
DEFAULT_SERVER = '10.37.40.1'
DEFAULT_PORT = 10000
DEFAULT_DOMAIN = 'PAM01-PRD01.IBM.COM'
# Get the username and password
u = raw_input('Enter PAM username: ')
s = getpass.getpass()
# Build the Hive Connection
connection = hive.connect(host=DEFAULT_SERVER, port= DEFAULT_PORT, authMechanism='LDAP', user=u + '@' +
DEFAULT_DOMAIN, password=s)
# Hive query statement
statement = "select * from user_yuti.Temp_CredCard where pir_post_dt = '2014-05-01' limit 100"
cur = connection.cursor()

# Runs a Hive query and returns the result as a list of list

cur.execute(statement)
df = cur.fetchall()

Remember to change the permission of the executable

chmod +x test_hive2.py
./test_hive2.py

Commentaires

Vous n'êtes pas autorisé à ajouter des commentaires.

Afﬁcher la version Ordinateur Mes sites

Avec la technologie de Google Sites

Hive Installation On Windows 10
No ratings yet
Hive Installation On Windows 10
13 pages
Potentials and Challenges of Agile Project Management in Real Estate Development
100% (1)
Potentials and Challenges of Agile Project Management in Real Estate Development
35 pages
Apache Hive: Prashant Gupta
100% (1)
Apache Hive: Prashant Gupta
61 pages
Inspection and Test Plan For Earth Work Doc. No. IONE-AA00-ITP-CS-0003
100% (1)
Inspection and Test Plan For Earth Work Doc. No. IONE-AA00-ITP-CS-0003
23 pages
Cs101 Final Term Solved Papers 2014
100% (1)
Cs101 Final Term Solved Papers 2014
8 pages
Tally and Accounting Course Notes
No ratings yet
Tally and Accounting Course Notes
35 pages
Bda Unit 5 Notes
No ratings yet
Bda Unit 5 Notes
23 pages
Hive PPT
No ratings yet
Hive PPT
61 pages
Property Case Law
100% (1)
Property Case Law
24 pages
SANS Malware Analysis & Reverse Engineering Cheat Sheet
No ratings yet
SANS Malware Analysis & Reverse Engineering Cheat Sheet
1 page
Hadoop HIVE
No ratings yet
Hadoop HIVE
41 pages
Geometry by DP Singh - (WWW - gknotesPDF.com)
No ratings yet
Geometry by DP Singh - (WWW - gknotesPDF.com)
140 pages
Day 17 NAT and PAT
No ratings yet
Day 17 NAT and PAT
18 pages
Hive Installation On Windows
No ratings yet
Hive Installation On Windows
21 pages
Serenity 48V 3000W
No ratings yet
Serenity 48V 3000W
2 pages
COMM 1140 Week 8 Tutorial Worksheet
No ratings yet
COMM 1140 Week 8 Tutorial Worksheet
139 pages
Scribd-Dl PyPI
No ratings yet
Scribd-Dl PyPI
4 pages
Bda From Module 3
No ratings yet
Bda From Module 3
81 pages
A630 A640 Advanced - Manual.en
No ratings yet
A630 A640 Advanced - Manual.en
147 pages
Download
No ratings yet
Download
11 pages
Blox Hunt Script
No ratings yet
Blox Hunt Script
8 pages
Run Python MapReduce On Local Docker Hadoop Cluster - DEV Community
No ratings yet
Run Python MapReduce On Local Docker Hadoop Cluster - DEV Community
5 pages
Module 4 HIVE1ppt
No ratings yet
Module 4 HIVE1ppt
44 pages
Unit IV
No ratings yet
Unit IV
22 pages
Nutripot BM33100 6qt Manual
No ratings yet
Nutripot BM33100 6qt Manual
32 pages
Hive Crash Course: A Beginner's Guide
No ratings yet
Hive Crash Course: A Beginner's Guide
19 pages
C++ TestMate - Visual Studio Marketplace
0% (1)
C++ TestMate - Visual Studio Marketplace
4 pages
BDA Unit V
No ratings yet
BDA Unit V
23 pages
6 H Data With Hive Big Data Analytics B.tech. Final Year
No ratings yet
6 H Data With Hive Big Data Analytics B.tech. Final Year
24 pages
OASIS Native QuickHelp
No ratings yet
OASIS Native QuickHelp
21 pages
How Are IKEA Mattresses Packaged 800 S4
No ratings yet
How Are IKEA Mattresses Packaged 800 S4
2 pages
Hive Updated
No ratings yet
Hive Updated
18 pages
Bda 06
No ratings yet
Bda 06
15 pages
Om 1
No ratings yet
Om 1
14 pages
CN 62 - Detailed Account Transit Charges - Surface Mail: Completion Instructions
No ratings yet
CN 62 - Detailed Account Transit Charges - Surface Mail: Completion Instructions
11 pages
NoteGPT - Apache Hive Tutorial For Beginners - Big Data Training - Edureka - Big Data Rewind
No ratings yet
NoteGPT - Apache Hive Tutorial For Beginners - Big Data Training - Edureka - Big Data Rewind
15 pages
Docker Desktop For Windows User Manual - Docker Documentation
No ratings yet
Docker Desktop For Windows User Manual - Docker Documentation
12 pages
Module 1 - Creating A Business Plan
No ratings yet
Module 1 - Creating A Business Plan
11 pages
Hadoop Python MapReduce Tutorial For Beginners
No ratings yet
Hadoop Python MapReduce Tutorial For Beginners
15 pages
Hive and Hiveql
No ratings yet
Hive and Hiveql
10 pages
Iso 23910 2007
No ratings yet
Iso 23910 2007
8 pages
Getting Started - Conan 1.37.2 Documentation
No ratings yet
Getting Started - Conan 1.37.2 Documentation
12 pages
Hw6 Solution
No ratings yet
Hw6 Solution
11 pages
C - C++ Projects Quick Start Tutorial
No ratings yet
C - C++ Projects Quick Start Tutorial
10 pages
What Is Hadoop - Introduction, Architecture, Ecosystem, Components
No ratings yet
What Is Hadoop - Introduction, Architecture, Ecosystem, Components
8 pages
Salesforce1 User Guide
No ratings yet
Salesforce1 User Guide
8 pages
Choice, Transparency, Coordination, and Quality Among Direct-to-Consumer Telemedicine Websites and Apps Treating Skin Disease
No ratings yet
Choice, Transparency, Coordination, and Quality Among Direct-to-Consumer Telemedicine Websites and Apps Treating Skin Disease
8 pages
Run Your First Windows Container - Microsoft Docs
No ratings yet
Run Your First Windows Container - Microsoft Docs
7 pages
Hive
No ratings yet
Hive
5 pages
Kaleen Butterfield Resume Summer 2014
No ratings yet
Kaleen Butterfield Resume Summer 2014
2 pages
Configure A Linux CMake Project in Visual Studio - Microsoft Docs
No ratings yet
Configure A Linux CMake Project in Visual Studio - Microsoft Docs
6 pages
bdcc-2 4
No ratings yet
bdcc-2 4
5 pages
Channel Partner Training Deck
No ratings yet
Channel Partner Training Deck
5 pages
Building Anaconda Navigator Applications - Anaconda Documentation
No ratings yet
Building Anaconda Navigator Applications - Anaconda Documentation
6 pages
Mip Noc
No ratings yet
Mip Noc
2 pages
YIELD Function - Formula, Examples, Calculate Yield in Excel
No ratings yet
YIELD Function - Formula, Examples, Calculate Yield in Excel
5 pages
Eclipse and PyDev - Anaconda Documentation
No ratings yet
Eclipse and PyDev - Anaconda Documentation
3 pages
Exp11 1
No ratings yet
Exp11 1
3 pages
Using Hive For Data Warehousing: Introduction To Hive
No ratings yet
Using Hive For Data Warehousing: Introduction To Hive
4 pages
Aspen Exchanger Design and Rating Shell & Tube V10: File: Printed: 1/31/2023 at 3:41:24 PM TEMA Sheet
No ratings yet
Aspen Exchanger Design and Rating Shell & Tube V10: File: Printed: 1/31/2023 at 3:41:24 PM TEMA Sheet
1 page
TrueSTUDIO - A Powerful Eclipse-Based C - C++ Integrated Development Tool For Your STM32 Projects - STMicroelectronics
No ratings yet
TrueSTUDIO - A Powerful Eclipse-Based C - C++ Integrated Development Tool For Your STM32 Projects - STMicroelectronics
4 pages
Construction Weekly Progress Report
No ratings yet
Construction Weekly Progress Report
1 page
Installing and Running Pandas - Anaconda Documentation
No ratings yet
Installing and Running Pandas - Anaconda Documentation
4 pages
Topic 2: Operation Strategy Name Affiliation
No ratings yet
Topic 2: Operation Strategy Name Affiliation
4 pages
Summary - Learn - Microsoft Docs
No ratings yet
Summary - Learn - Microsoft Docs
4 pages
3) - Nasdag
No ratings yet
3) - Nasdag
4 pages
Testing in Python Using Doctest Module
No ratings yet
Testing in Python Using Doctest Module
3 pages
Managing Environments - Anaconda Documentation
No ratings yet
Managing Environments - Anaconda Documentation
3 pages
Creating An R Environment and Running RStudio - Anaconda Documentation
No ratings yet
Creating An R Environment and Running RStudio - Anaconda Documentation
3 pages
Using The R Programming Language in Jupyter Notebook - Anaconda Documentation
No ratings yet
Using The R Programming Language in Jupyter Notebook - Anaconda Documentation
3 pages
Danish Ayub
No ratings yet
Danish Ayub
3 pages
English Language Holiday Homework 2024-25
No ratings yet
English Language Holiday Homework 2024-25
2 pages
Managing Channels - Anaconda Documentation
No ratings yet
Managing Channels - Anaconda Documentation
2 pages
Fesco Online Billl
No ratings yet
Fesco Online Billl
1 page
The Complete Guide to Installing Parrot OS
From Everand
The Complete Guide to Installing Parrot OS
mehul kothari
No ratings yet
Perl and Apache: Your visual blueprint for developing dynamic Web content
From Everand
Perl and Apache: Your visual blueprint for developing dynamic Web content
Adam McDaniel
No ratings yet
Hands-On Python for DevOps: Leverage Python's native libraries to streamline your workflow and save time with automation
From Everand
Hands-On Python for DevOps: Leverage Python's native libraries to streamline your workflow and save time with automation
Ankur Roy
No ratings yet
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
From Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
No ratings yet
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet
Study Guide Cisco 300-735 SAUTO Automating and Programming Cisco Security Solutions Exam
From Everand
Study Guide Cisco 300-735 SAUTO Automating and Programming Cisco Security Solutions Exam
Anand Vemula
No ratings yet
The PEAR Installer Manifesto
From Everand
The PEAR Installer Manifesto
Gregory Beaver
No ratings yet
Troubleshooting Ubuntu Server
From Everand
Troubleshooting Ubuntu Server
Bhargav Skanda
No ratings yet
Mastering Shell for DevOps
From Everand
Mastering Shell for DevOps
Gilbert Stew
No ratings yet
Python Automation for Beginners: A Practical Guide with Examples
From Everand
Python Automation for Beginners: A Practical Guide with Examples
William E. Clark
No ratings yet
Mastering Shell for DevOps: Automate, streamline, and secure DevOps workflows with modern shell scripting
From Everand
Mastering Shell for DevOps: Automate, streamline, and secure DevOps workflows with modern shell scripting
Gilbert Stew
No ratings yet
Ansible For Linux by Examples
From Everand
Ansible For Linux by Examples
Luca Berton
No ratings yet
Ansible For Security by Examples
From Everand
Ansible For Security by Examples
Berton
No ratings yet
Build Your First Home Server
From Everand
Build Your First Home Server
R.R. Arnob
No ratings yet
Introduction to Python Programming: Learn Coding with Hands-On Projects for Beginners
From Everand
Introduction to Python Programming: Learn Coding with Hands-On Projects for Beginners
Kiet Huynh
No ratings yet
Mastering Go Network Automation
From Everand
Mastering Go Network Automation
Ian Taylor
No ratings yet
Firebase Storage for Angular: A reliable file upload solution for your applications
From Everand
Firebase Storage for Angular: A reliable file upload solution for your applications
Abdelfattah Ragab
No ratings yet
Mastering Go Network Automation: Automating Networks, Container Orchestration, Kubernetes with Puppet, Vegeta and Apache JMeter
From Everand
Mastering Go Network Automation: Automating Networks, Container Orchestration, Kubernetes with Puppet, Vegeta and Apache JMeter
Ian Taylor
No ratings yet
PHP & MySQL Practice It Learn It
From Everand
PHP & MySQL Practice It Learn It
Jitendra Patel
3/5 (2)
Backend Handbook: for Ruby on Rails Apps
From Everand
Backend Handbook: for Ruby on Rails Apps
Francisco Quintero
1/5 (1)
P.H.P Simple C.R.U.D Design
From Everand
P.H.P Simple C.R.U.D Design
Rohaya Mohamad
4/5 (1)
Raspberry Pi :Raspberry Pi Guide On Python & Projects Programming In Easy Steps
From Everand
Raspberry Pi :Raspberry Pi Guide On Python & Projects Programming In Easy Steps
Jason Scotts
3/5 (9)
Professional Node.js: Building Javascript Based Scalable Software
From Everand
Professional Node.js: Building Javascript Based Scalable Software
Pedro Teixeira
No ratings yet
Python Programming: Learn, Code, Create
From Everand
Python Programming: Learn, Code, Create
Sachin Naha
No ratings yet
Python and SQLite Development
From Everand
Python and SQLite Development
Agus Kurniawan
No ratings yet
Docker Tutorial for Beginners: Learn Programming, Containers, Data Structures, Software Engineering, and Coding
From Everand
Docker Tutorial for Beginners: Learn Programming, Containers, Data Structures, Software Engineering, and Coding
Andrew Lee
3/5 (2)
Mastering Python Programming: A Comprehensive Guide: The IT Collection
From Everand
Mastering Python Programming: A Comprehensive Guide: The IT Collection
Christopher Ford
5/5 (1)
Introduction to PHP, Part 1, Second Edition
From Everand
Introduction to PHP, Part 1, Second Edition
Adam Majczak
No ratings yet
Python-Deprecated Library v1.1 Documentation
From Everand
Python-Deprecated Library v1.1 Documentation
Laurent LAPORTE
No ratings yet
PHP MySQL Development of Login Modul: 3 hours Easy Guide
From Everand
PHP MySQL Development of Login Modul: 3 hours Easy Guide
Esstree Ishak Abdullah
5/5 (1)
Configuration of Apache Server To Support ASP
From Everand
Configuration of Apache Server To Support ASP
Dr. Hedaya Mahmood Alasooly
No ratings yet
Python for Beginners: An Introduction to Learn Python Programming with Tutorials and Hands-On Examples
From Everand
Python for Beginners: An Introduction to Learn Python Programming with Tutorials and Hands-On Examples
Nathan Metzler
4/5 (2)
Footprinting, Reconnaissance, Scanning and Enumeration Techniques of Computer Networks
From Everand
Footprinting, Reconnaissance, Scanning and Enumeration Techniques of Computer Networks
Dr. Hidaia Mahmood Alassouli
No ratings yet
Hacking of Computer Networks: Full Course on Hacking of Computer Networks
From Everand
Hacking of Computer Networks: Full Course on Hacking of Computer Networks
Dr. Hidaia Mahmood Alassouli
No ratings yet
Configuration of Apache Server to Support Asp
From Everand
Configuration of Apache Server to Support Asp
Dr. Hidaia Mahmood Alassouli
No ratings yet
Wireless and Mobile Hacking and Sniffing Techniques
From Everand
Wireless and Mobile Hacking and Sniffing Techniques
Dr. Hidaia Mahmood Alassouli
No ratings yet
Setup of a Graphical User Interface Desktop for Linux Virtual Machine on Cloud Platforms
From Everand
Setup of a Graphical User Interface Desktop for Linux Virtual Machine on Cloud Platforms
Dr. Hidaia Mahmood Alassouli
No ratings yet
Hiding Web Traffic with SSH: How to Protect Your Internet Privacy against Corporate Firewall or Insecure Wireless
From Everand
Hiding Web Traffic with SSH: How to Protect Your Internet Privacy against Corporate Firewall or Insecure Wireless
Slava Gomzin
No ratings yet
A concise guide to PHP MySQL and Apache
From Everand
A concise guide to PHP MySQL and Apache
alasdair gilchrist
4/5 (2)

Connect Hadoop Database by Using Hive in Python - Ting Yu

Uploaded by

Connect Hadoop Database by Using Hive in Python - Ting Yu

Uploaded by

Ting Yu > Blogs >

Connect Hadoop Database by Using Hive in Python

1. Pyhs2, Python Hive Server 2 Client Driver: https://pypi.python.org/pypi/pyhs2/0.5.0

2. Sasl, Cyrus-SASL bindings for Python: https://pypi.python.org/pypi/sasl/0.1.3

4. PyHive, Python interface to Hive: https://pypi.python.org/pypi/PyHive/0.1.0

tar zxvf sasl-0.1.3.tar.gz

tar zxvf thrift-0.9.1.tar.gz

tar zxvf PyHive-0.1.0.tar.gz

The main Python code to connect the database:

# Runs a Hive query and returns the result as a list of list

Remember to change the permission of the executable

Vous n'êtes pas autorisé à ajouter des commentaires.

Afﬁcher la version Ordinateur Mes sites

Avec la technologie de Google Sites

You might also like