100% found this document useful (1 vote)

84 views

4 BNI Python Training

Uploaded by

valkriez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

84 views

4 BNI Python Training

Uploaded by

valkriez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 126

PROGRAMMING FOR

EVERYBODY

Adam Aulia Rahmadi

TRAINER PROFILE

ADAM AULIA RAHMADI

Data Engineer / Manage data Support business Help data scientist Experience 4 years
Scientist – PT XL pipelines from with big data build model in in python , 3 years
AXIATA Cloudera technology large scale in big data
PYTHON INTRODUCTION

Adam Aulia Rahmadi

FOUNDER

Guido van Rossum

Version :
• 2.*
• 3.*
WHY PYTHON ?

• Simple
• Extensive Support Libraries
• Integration Feature
WHY
PYTHON ?
TOP
COMPANIES
THAT USING
PYTHON
PYTHON TOP
LIBRARY
WHAT PYTHON CAN DO
INTERMEZO
DATA ANALYTICS CYCLE

Data Engineer
Traditional DW

HADOOP Tech
Python

Others Tools (Tableau,

Knime, etc) Csv/Hadoop/DB

/presentation /reporting

Data Scientist
Variable
Operator
String and functions
Conditionals
Iterations
OVERVIEW
List
Tuple
Dictionary
File
Error handling
BASIC PYTHON

Adam Aulia Rahmadi

GOOGLE
COLAB

• https://
colab.research.google.com/
JUPYTER NOTEBOOK

Open cmd , type jupyter notebook

PYTHON INSTALL PACKAGE

3
Types Value
String message = 'And now for
PRIMITIVE something completely
different'
VARIABLE
Integer n = 17

Double / Float pi = 3.1415926535897931

Boolean True / False

Types Value
List []
[1,2,’abc’]
Tuple ()
COMPLEX (1,2,’abc’)
VARIABLES
Dictionary {}
a={}
a[‘day’]=‘monday’
a[‘date’]= 20191011
Type Values
+ Addition
- subtraction
MATH * Multiplication
OPERATORS
/ division
** Power
| Modulo
Types Values
> Greater than
< Less than
BOOLEAN >= Greater than or equal
EXPRESSIONS
<= Less than or equal
!= Not equal
== equal
STRING OPERATORS

• first = 10 • first = '100' • first = ‘100’

second = 15 second = '150' second = 3
print(first + second) print(first + second) print(first*second)
CONDITIONAL

• One conditional
CONDITIONAL

• Two conditionals
CHAINED CONDITIONAL

• More than two conditionals

NESTED
CONDITIONAL
Type Values
Build in functions Max(), min(), avg(), type(), len(), int(),
str()

FUNCTIONS Import functions Import random, math

User defined functions def function_name(argument_variable):
return something
FUNCTIONS
LOOP

• For loop • While loop

STRINGS

• A string is a sequence of characters. You can access the

characters one at a time
with the bracket operator
STRINGS

• Traversal through a string with a loop

STRINGS (THE IN OPERATOR )
STRINGS (STRING COMPARISON )
LIST

• Sequence • Initialize

• Mutable
LIST

• Traversing a list
LIST OPERATIONS

• The + operator concatenates lists:

• Similarly, the * operator repeats a list a given number of times:

LIST SLICES
LIST METHODS

• Append

• Extend
LIST AND
FUNCTIONS
DELETING ELEMENT

• Remove
• Pop

• del
TUPLES

• Sequence
• immutable (no append)

• Initialize
LIST AND STRINGS

• String to list • List to string

DICTIONARY
DICTIONARY

• Create dictionary

• Show dictionary key • Show dictionary values

• a.keys() • a.values()
DICTIONARY
LOOPING AND DICTIONARY
FILES

• Reading files
FILES

• Searching through a file

FILES

• Write to file
ERROR HANDLING
SOURCE

• http://do1.dr-chuck.com/pythonlearn/EN_us/pythonlearn.pdf
PANDAS
INTRODUCTION

Adam Aulia Rahmadi

WHY PANDAS ?
PANDAS
CORE
CREATING DATAFRAMES FROM
SCRATCH
CREATING DATAFRAMES FROM
SCRATCH

• Add index
CREATING DATAFRAME FROM
SCRATCH

• Search by index
COMMON FILES

Pandas
READ DATA FROM CSV
READ DATA FROM CSV

• Set index while read data

READING DATA FROM JSON

• Read from json

READING DATA FROM A SQL
DATABASE

• Read from database

CONVERTING BACK TO A CSV, JSON, OR SQL
VIEWING YOUR DATA
HANDLING DUPLICATES
COLUMNS
MISSING VALUES

• Check null • summary null each columns

• Drop null (rows) • Drop null (columns)

IMPUTATION
UNDERSTANDING YOUR VARIABLES

• Statistics columns
DATAFR AME S LIC IN G, S ELECTING, EXTRACTING

Columns wise

Select genre, rating from movies_df

Row wise
D ATA F R A M E S L I C I N G , S E L E C T I N G , E X T R A C T I N G
DATAFR AME S LIC IN G, S ELECTING, EXTRACTING

Conditional selections

Select * from movies_df where director = ‘Ridley Scott’

Select * from movies_df where rating >= 8.6 limit 3

Select * from movies_df where director = ‘Ridley Scott’ and director == ‘Christopher Nolan’
DATAFR AME S LIC IN G, S ELECTING, EXTRACTING

Select * from movies_df where director in (‘Ridley Scott’ ,‘Christopher Nolan’)

APPLY FUNCTIONS
PANDAS AGGREGATION

Take one genre

Apply aggregation

Select year, new_genre, sum(revenue_millions) as sum_revenue_mio, count(new_genre) as count_genre from movies_df

Groupby year, new_genre
PANDAS PIVOT

Dataframe = movies_df_gb
Index = ‘year’
Columns = ‘new_genre’ rows to column
Values = ‘count_genre’ fill the cell
aggfunc = function aggregation (max,min,sum,etc)
PANDAS JOIN

• Create dataframe
PANDAS JOIN
Create a date range

Work with timestamp data

Convert string to time stamp

DATE AND TIME

Index and slice time series data into data frame

Resample time series for different time period

aggregation / summary

Understanding unix /epoch time

DATE AND TIME

• Create timestamp

• Timestamp function
• Year, month, day , hour, minutes, seconds, ms, day/month name, day in week/month/year
DATE AND TIME

• Exploration
DATE AND TIME

Date range

Format : mm/dd/yyyy
DATE AND TIME

• Different format
DATE AND TIME

• Slicing data

• Daily aggregation

• Monthly aggregation
DATE AND TIME

• Daily aggregation alternative ways

get year
get year, month
get year, month, day
get hour
get dayname
DATE AND TIME

• Time delta
UNIX TIME

• https://www.unixtimestamp.com/index.php

From unixtime to datetime

From datetime to unixtime

SOURCE

• Source
• https://www.learndatasci.com/tutorials/python-pandas-tutorial-complete-introd
uction-for-beginners/
• https://towardsdatascience.com/basic-time-series-manipulation-with-pandas-
4432afee64ea
PYTHON DATA VISUALIZATION

Adam Aulia Rahmadi

BASIC LIBRARY

• Matplotlib
• Seaborn
TYPE OF CHART

• Bar chart
• Line chart
• Scatter plot
• heatmap
SETUP JUPYTER
DATA PREPARATION
BAR CHART

Purpose is to comparing
few items
BAR CHART
DATA CHART
LINE CHART

• Purpose is to look for the trend

LINE CHART
SCATTER PLOT

• To look comparison between 2 variable

SCATTER PLOT
HEATMAP

• Purpose is to visualize a correlation or volume

HEATMAP

• movies_df_gb =
movies_df[['year','new_genre','revenue_millions']].groupby(['year','new_genre']).agg({'new_genre':'count','revenue_
millions':'sum'})
• movies_df_gb.columns = ['count_genre','sum_revenue_mio']
• movies_df_gb = movies_df_gb.reset_index()

movies_df_gb_pvt = pd.pivot_table(movies_df_gb,
values='count_genre',
index=['year'],columns=['new_genre'],
aggfunc=np.sum).fillna(0)
HEAT MAP
SPARK INTRODUCTION

Adam aulia rahmadi

• Apache Spark is an open-source distributed general-purpose
cluster-computing framework. Spark provides an interface for programming
entire clusters with implicit data parallelism and fault tolerance. Originally
developed at the University of California, Berkeley's AMPLab, the Spark
codebase was later donated to the Apache Software Foundation, which has
maintained it since.
• Wikipedia
DATA TYPES
DATAFRAME VS RDD

• Rdd
a = sc.parallelize([1,2,3,4])

• dataframe
Df = a.toDF(‘a’)
START SPARK ENGINE

• Basic configuration
LOAD DATA

• From csv
LOAD DATA

• Convert from pandas

SHOW DATA
SPARK UI

http://localhost:4040
SCHEMA AND COLUMNS

• Rename columns
DATAFRAME EXPLORATION

• select • Fill null column

• filter

• count • Null values

DATAFRAME EXPLORATION

• Filter

• aggregation

• join

• pivot
USER DEFINED FUNCTION (UDF)
TEMPORARY TABLE
EXPORT RESULT

Save to csv

Save to parquet

Result folder

Csv folder parquet folder

TURN OFF SPARK ENGINE

• spark.stop()
PYTHON HBASE

Adam Aulia Rahmadi

SET UP

Environment Python Library

• Jupyter • Happy base
• Hbase • Pandas
SYSTEM ARCHITECTURE

Python(Jupyter notebook) + API (flask_app.py) HBASE (VM ubuntu)

SETUP HBASE ON UBUNTU VM

-- 1. step by step vbox ubuntu -- 2. step by step install java ubuntu

hbase-site.xml
<property>
<name>hbase.rootdir</name>
• 1. install ubuntu • sudo apt update <value>file:///home/hduser/HBASE/hbase</value>
</property>
<property>
• 2. install additional .iso • sudo apt install default-jre <name>hbase.zookeeper.property.dataDir</name>
<value>/home/hduser/HBASE/zookeeper</value>
• 3. set up bidirectional • sudo apt install default-jdk </property>

• 4. set up host network adapter • Add to ~/.bashrc

(after download hbase) • Type java –version
• 5. test ping
-- 3. step by step install stand alone hbase

~/.bashrc
• download hbase and extract hbase
export JAVA_HOME="/usr/lib/jvm/java-8-openjdk-amd64" https://downloads.apache.org/hbase/1.4.13/hbase-1.4.13-bin.tar.gz/
export PATH=$PATH:$JAVA_HOME/bin
export HBASE_HOME=/home/adam/hduser/hbase-1.4.12 • configure conf/hbase_env.sh (app java_home)
export PATH=$PATH:$HBASE_HOME/bin
• Export hbase_home to ~/.bashrc
• configure conf/hbase-site.xml
• Start hbase, hbase shell and thrift
RETRIEVE DATA
READ CSV

1. library

2. Import csv
CREATE CONNECTION AND INSERT
SOURCE

• https://www.digitalocean.com/community/tutorials/how-to-install-java-with-ap
t-on-ubuntu-18-04
• https://www.guru99.com/hbase-installation-guide.html

Material For Student RWVCPC V012021A EN
No ratings yet
Material For Student RWVCPC V012021A EN
70 pages
Advances in Network and Distributed Systems Security PDF
No ratings yet
Advances in Network and Distributed Systems Security PDF
218 pages
The TOGAF Standard - BoK Walkthrough
No ratings yet
The TOGAF Standard - BoK Walkthrough
15 pages
Eco System Notes
100% (1)
Eco System Notes
15 pages
Information Technology 2
No ratings yet
Information Technology 2
147 pages
CDD Aws Storage 2022 05 25
No ratings yet
CDD Aws Storage 2022 05 25
116 pages
Bench Commands Cheatsheet
No ratings yet
Bench Commands Cheatsheet
4 pages
3.EcmaScript 6 Overview
No ratings yet
3.EcmaScript 6 Overview
126 pages
Chap 07
No ratings yet
Chap 07
213 pages
FALLSEM2024-25 BCSE324L TH VL2024250101403 2024-07-16 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE324L TH VL2024250101403 2024-07-16 Reference-Material-I
141 pages
API Security Fundamentals New
No ratings yet
API Security Fundamentals New
25 pages
Cse - 2014 Se Module 2 V1
No ratings yet
Cse - 2014 Se Module 2 V1
154 pages
Chapter 1
No ratings yet
Chapter 1
46 pages
Fundamental of Computer
No ratings yet
Fundamental of Computer
195 pages
1 Basics
No ratings yet
1 Basics
152 pages
Road To DevOps - 1
100% (1)
Road To DevOps - 1
102 pages
Chapter 2
No ratings yet
Chapter 2
125 pages
Course Material
No ratings yet
Course Material
130 pages
Unit 1 - Fundamental of OOP - Final
No ratings yet
Unit 1 - Fundamental of OOP - Final
122 pages
Instant Download Modern JavaScript for the Impatient 1st Edition Cay S. Horstmann PDF All Chapters
100% (5)
Instant Download Modern JavaScript for the Impatient 1st Edition Cay S. Horstmann PDF All Chapters
50 pages
MODULE 3 Updated
No ratings yet
MODULE 3 Updated
50 pages
HTML5 - NexTGen Web - EInternational
100% (1)
HTML5 - NexTGen Web - EInternational
600 pages
Erp Complete File
No ratings yet
Erp Complete File
179 pages
ITIL v4 Notes
No ratings yet
ITIL v4 Notes
11 pages
Javascript
No ratings yet
Javascript
158 pages
Corejava Slides
No ratings yet
Corejava Slides
137 pages
B1 40 - ArchiSurance - Case - Slides
No ratings yet
B1 40 - ArchiSurance - Case - Slides
112 pages
Chapter 1 - 4
No ratings yet
Chapter 1 - 4
168 pages
PC2102-Module-1-4 2
No ratings yet
PC2102-Module-1-4 2
146 pages
SMC-Presentation-Virtual-Faculty - 4thedition (Autosaved)
No ratings yet
SMC-Presentation-Virtual-Faculty - 4thedition (Autosaved)
502 pages
Module 1
No ratings yet
Module 1
111 pages
Unit - 3 - Web Development With MERN Stack Using DevOps
No ratings yet
Unit - 3 - Web Development With MERN Stack Using DevOps
126 pages
Monetary Theory All Merged 04072023-2
No ratings yet
Monetary Theory All Merged 04072023-2
291 pages
Module 1
No ratings yet
Module 1
156 pages
AZ-400: Designing and Implementing Microsoft DevOps
No ratings yet
AZ-400: Designing and Implementing Microsoft DevOps
73 pages
Kyle Mcevoy - Test Automation in Python
No ratings yet
Kyle Mcevoy - Test Automation in Python
144 pages
Romi Tfu 06 Casestudy Nov2015
No ratings yet
Romi Tfu 06 Casestudy Nov2015
109 pages
Discrete Structure 1
No ratings yet
Discrete Structure 1
29 pages
TOGAF 9 Level 1 + 2 Exam Study Guide: Created by Nik Ansell
No ratings yet
TOGAF 9 Level 1 + 2 Exam Study Guide: Created by Nik Ansell
5 pages
API Design For Integration (2023)
No ratings yet
API Design For Integration (2023)
8 pages
Uml Final
No ratings yet
Uml Final
70 pages
ITIL Foundation
No ratings yet
ITIL Foundation
133 pages
Integration Patterns For Virtual MDM Implementations - WSN1
No ratings yet
Integration Patterns For Virtual MDM Implementations - WSN1
40 pages
OS Full
No ratings yet
OS Full
264 pages
Ccna (Cisco Certified Network Associate) Certification and Training Program
No ratings yet
Ccna (Cisco Certified Network Associate) Certification and Training Program
469 pages
Change Management in Ibm
100% (1)
Change Management in Ibm
13 pages
+91-98206xxxxx: TH TH
No ratings yet
+91-98206xxxxx: TH TH
1 page
vshn-ch-devops-report-2024
No ratings yet
vshn-ch-devops-report-2024
71 pages
02 Project Management Fundamentals (2)
No ratings yet
02 Project Management Fundamentals (2)
183 pages
Cloud Connector Through Integration Platforms
100% (1)
Cloud Connector Through Integration Platforms
10 pages
ThoughtWorks TR Technology Radar Vol 28 en
No ratings yet
ThoughtWorks TR Technology Radar Vol 28 en
47 pages
Programming Fundamental All Chapter
100% (1)
Programming Fundamental All Chapter
265 pages
Exam Candidate Guide - CISA
No ratings yet
Exam Candidate Guide - CISA
20 pages
Introduction To Amazon EC2
No ratings yet
Introduction To Amazon EC2
15 pages
Alfresco Activiti Workflow Training Course
No ratings yet
Alfresco Activiti Workflow Training Course
12 pages
10-4 Error Messages Reference
No ratings yet
10-4 Error Messages Reference
2,710 pages
ITIL4 Course Slides PDF
No ratings yet
ITIL4 Course Slides PDF
47 pages
What's Blockchain? - at Berkeley
No ratings yet
What's Blockchain? - at Berkeley
57 pages
Ultimate Microsoft Intune for Administrators: Master Enterprise Endpoint Security and Manage Devices, Apps, and Cloud Security with Expert Microsoft Intune Strategies (English Edition)
From Everand
Ultimate Microsoft Intune for Administrators: Master Enterprise Endpoint Security and Manage Devices, Apps, and Cloud Security with Expert Microsoft Intune Strategies (English Edition)
Paul Winstanley
No ratings yet
Mastering Microsoft 365 ENTRA ID - 100 Practical Guides For Secure Identity and Access Management: Mastering Microsoft 365, #122
From Everand
Mastering Microsoft 365 ENTRA ID - 100 Practical Guides For Secure Identity and Access Management: Mastering Microsoft 365, #122
Openshelves
No ratings yet

4 BNI Python Training

Uploaded by

4 BNI Python Training

Uploaded by

PROGRAMMING FOR

Adam Aulia Rahmadi

ADAM AULIA RAHMADI

Adam Aulia Rahmadi

Guido van Rossum

Others Tools (Tableau,

Adam Aulia Rahmadi

Open cmd , type jupyter notebook

Double / Float pi = 3.1415926535897931

Boolean True / False

• first = 10 • first = '100' • first = ‘100’

• More than two conditionals

FUNCTIONS Import functions Import random, math

• For loop • While loop

• A string is a sequence of characters. You can access the

• Traversal through a string with a loop

• The + operator concatenates lists:

• Similarly, the * operator repeats a list a given number of times:

• String to list • List to string

• Show dictionary key • Show dictionary values

• Searching through a file

Adam Aulia Rahmadi

• Set index while read data

• Read from json

• Read from database

• Check null • summary null each columns

• Drop null (rows) • Drop null (columns)

Select genre, rating from movies_df

Select * from movies_df where director = ‘Ridley Scott’

Select * from movies_df where rating >= 8.6 limit 3

Select * from movies_df where director in (‘Ridley Scott’ ,‘Christopher Nolan’)

Take one genre

Select year, new_genre, sum(revenue_millions) as sum_revenue_mio, count(new_genre) as count_genre from movies_df

Work with timestamp data

Convert string to time stamp

DATE AND TIME

Resample time series for different time period

Understanding unix /epoch time

• Daily aggregation alternative ways

From unixtime to datetime

From datetime to unixtime

Adam Aulia Rahmadi

• Purpose is to look for the trend

• To look comparison between 2 variable

• Purpose is to visualize a correlation or volume

Adam aulia rahmadi

• Convert from pandas

• select • Fill null column

• count • Null values

Csv folder parquet folder

Adam Aulia Rahmadi

Environment Python Library

Python(Jupyter notebook) + API (flask_app.py) HBASE (VM ubuntu)

-- 1. step by step vbox ubuntu -- 2. step by step install java ubuntu

• 4. set up host network adapter • Add to ~/.bashrc

You might also like