SlideShare a Scribd company logo
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Data Analysis With Python
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Agenda
Python Applications
Data Life-cycle
Python For Data Analysis
What is Pandas? – Numpy, Scipy
Pandas Operations
Python for Statistics
Python for Hadoop
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Python Applications
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Python Applications
Web Scraping
Testing
Web
Development
Data Analysis
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Data Life-Cycle
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Data Life-Cycle
Data
Data
Data
Data
Data
Warehousing
Data AnalysisData AnalysisData Analysis Data Visualization
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
What is Data Analysis?
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
What is Data Analysis?
Percentage increase in unemployed
youth in Afghanistan between 2010-2011
Data of unemployed
youth across the globe
from 2010-2014
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
What is Pandas?
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Data Analysis Using Python
Pandas is a software library written for the Python programming language for data manipulation and analysis.
Numpy and Scipy
and Matplotlib
Pandas is well suited for many different kinds of data:
 Tabular data with heterogeneously-typed columns.
 Ordered and unordered time series data.
 Arbitrary matrix data with row and column labels
 Any other form of observational / statistical data sets. The data actually
need not be labeled at all to be placed into a pandas data structure
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Pandas Operations
Changing the Index Concatenation
Slicing the
DataFrame
Data conversion
Changing the
column headers
Joining and Merging
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Slicing
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Slicing
Index Int rate US GDP Thousands
2001 2 50
2002 3 55
2003 2 65
2004 2 55
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Slicing
Slicing the starting 2 rows
Slicing the last 2 rows
Index Int rate US GDP Thousands
2001 2 50
2002 3 55
2003 2 65
2004 2 55
Index Int rate US GDP Thousands
2001 2 50
2002 3 55
Index Int rate US GDP Thousands
2003 2 65
2004 2 55
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Merging
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Merging
Index HPI Int rate US GDP Thousands
2001 80 2 50
2002 85 3 55
2003 88 2 65
2004 85 2 55
Index HPI Int rate US GDP Thousands
2005 80 2 50
2006 85 3 55
2007 88 2 65
2008 85 2 55
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Merging
Index HPI Int rate US GDP Thousands
2001 80 2 50
2002 85 3 55
2003 88 2 65
2004 85 2 55
Index HPI Int rate US GDP Thousands
2005 80 2 50
2006 85 3 55
2007 88 2 65
2008 85 2 55
Merging
Index HPI Int rate US GDP
Thousands x
US GDP
Thousands y
0 80 2 50 50
1 85 3 55 55
2 88 2 65 65
3 85 2 55 55
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Joining
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Joining
Index Int rate US GDP
Thousands
2001 2 50
2002 3 55
2003 2 65
2004 2 55
Index Low tier
HPI
Unemployment
2001 50 7
2003 52 8
2004 50 9
2005 43 6
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Joining
Index Int rate US GDP
Thousands
2001 2 50
2002 3 55
2003 2 65
2004 2 55
Index Low tier
HPI
Unemployment
2001 50 7
2003 52 8
2004 50 9
2005 43 6
Index Int rate US GDP
Thousands
Low tier
HPI
Unemployment
2001 2.0 50.0 50.0 7.0
2002 3.0 55.0 NaN NaN
2003 2.0 65.0 52.0 8.0
2004 2.0 55.0 50.0 9.0
2005 NaN NaN 53.0 6.0
Joining
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Changing the Index and Column Headers
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Changing the Index and Column Headers
Index Int rate US GDP Thousands
2001 2 50
2002 3 55
2003 2 65
2004 2 55
Index US GDP Thousands
2001 50
2002 55
2003 65
2004 55
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Changing the Index and Column Headers
Index Int rate US GDP Thousands
2001 2 50
2002 3 55
2003 2 65
2004 2 55
Index US GDP Thousands
2001 50
2002 55
2003 65
2004 55
Index US GDP Thousands
2 50
3 55
2 65
2 55
Index GDP
2001 50
2002 55
2003 65
2004 55
Changing the Index
Changing the
column headers
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Concatenation
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Concatenation
Student
Name:
Age:
Sex:
Phone number:
Student Data
Concatenate
E-mail
Student
Name:
Age:
Sex:
Phone number:
E-mail:
Concatenation
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Data Munging
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Data Munging
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Use-Case
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Example: Youth Unemployment Data
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Example: Youth Unemployment Data
Problem Statement
Find the change in percentage of unemployed youth for every country from 2010-2011
There is approx. 3.1%
increase in unemployed
youth in ‘Arab World’
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Example: Youth Unemployment Data
Column 1 – Country Name
Column 2 – Country Code
Column 3 – 2010
Column 4 – 2011
Column 5 – 2012
Column 6 – 2013
Column 7 – 2014
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Python For Statistics
from statistics import mean
print(mean([1,1,1,1,3,4,4,4,5,2]))
Mean
Median
from statistics import median
print(median([1,1,1,1,3,4,4,4,5,2]))
High Median
Low Median
from statistics import mode
print(mode([1,1,1,1,3,4,4,4,5,2]))
Mode
from statistics import mode
print(mode([1,1,1,1,3,4,4,4,5,2]))
Variance
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Python For Hadoop : Pydoop
Pydoop is a Python interface to Hadoop that allows you to write MapReduce applications and interact with HDFS
in pure Python.
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING
Python For Hadoop : Pydoop
Python Applications What Is Data Analysis
Pandas Operations Data Analysis Use-Case
What Is Pandas
Python For Statistics And
Python For Hadoop
www.edureka.co/pythonEDUREKA PYTHON CERTIFICATION TRAINING

More Related Content

PDF
Introduction to NumPy (PyData SV 2013)
PyData
 
PPTX
Data Science With Python | Python For Data Science | Python Data Science Cour...
Simplilearn
 
PPTX
Introduction to matplotlib
Piyush rai
 
PDF
Introduction to Python Pandas for Data Analytics
Phoenix
 
PPTX
Python - Numpy/Pandas/Matplot Machine Learning Libraries
Andrew Ferlitsch
 
PDF
Python Matplotlib Tutorial | Matplotlib Tutorial | Python Tutorial | Python T...
Edureka!
 
PPTX
Python Scipy Numpy
Girish Khanzode
 
Introduction to NumPy (PyData SV 2013)
PyData
 
Data Science With Python | Python For Data Science | Python Data Science Cour...
Simplilearn
 
Introduction to matplotlib
Piyush rai
 
Introduction to Python Pandas for Data Analytics
Phoenix
 
Python - Numpy/Pandas/Matplot Machine Learning Libraries
Andrew Ferlitsch
 
Python Matplotlib Tutorial | Matplotlib Tutorial | Python Tutorial | Python T...
Edureka!
 
Python Scipy Numpy
Girish Khanzode
 

What's hot (20)

PPTX
Python Seaborn Data Visualization
Sourabh Sahu
 
PDF
Introduction to Python for Data Science
Arc & Codementor
 
PDF
Data Visualization in Python
Jagriti Goswami
 
PDF
Python NumPy Tutorial | NumPy Array | Edureka
Edureka!
 
ODP
Data Analysis in Python
Richard Herrell
 
PPTX
Data Analysis in Python-NumPy
Devashish Kumar
 
PDF
Python Basics
tusharpanda88
 
PPTX
Introduction to numpy
Gaurav Aggarwal
 
PDF
The matplotlib Library
Haim Michael
 
PPTX
Python for data science
Tanzeel Ahmad Mujahid
 
ODP
Python Modules
Nitin Reddy Katkam
 
PPTX
Data Analysis with Python Pandas
Neeru Mittal
 
PDF
Data visualization in Python
Marc Garcia
 
PDF
pandas - Python Data Analysis
Andrew Henshaw
 
PDF
Data Science With Python
Mosky Liu
 
PPTX
Birch Algorithm With Solved Example
kailash shaw
 
PPTX
Artificial Neural Network | Deep Neural Network Explained | Artificial Neural...
Simplilearn
 
PPTX
Naive bayes
Ashraf Uddin
 
PPTX
Looping statement in python
RaginiJain21
 
PPT
2.3 bayesian classification
Krish_ver2
 
Python Seaborn Data Visualization
Sourabh Sahu
 
Introduction to Python for Data Science
Arc & Codementor
 
Data Visualization in Python
Jagriti Goswami
 
Python NumPy Tutorial | NumPy Array | Edureka
Edureka!
 
Data Analysis in Python
Richard Herrell
 
Data Analysis in Python-NumPy
Devashish Kumar
 
Python Basics
tusharpanda88
 
Introduction to numpy
Gaurav Aggarwal
 
The matplotlib Library
Haim Michael
 
Python for data science
Tanzeel Ahmad Mujahid
 
Python Modules
Nitin Reddy Katkam
 
Data Analysis with Python Pandas
Neeru Mittal
 
Data visualization in Python
Marc Garcia
 
pandas - Python Data Analysis
Andrew Henshaw
 
Data Science With Python
Mosky Liu
 
Birch Algorithm With Solved Example
kailash shaw
 
Artificial Neural Network | Deep Neural Network Explained | Artificial Neural...
Simplilearn
 
Naive bayes
Ashraf Uddin
 
Looping statement in python
RaginiJain21
 
2.3 bayesian classification
Krish_ver2
 
Ad

Similar to Python For Data Analysis | Python Pandas Tutorial | Learn Python | Python Training | Edureka (20)

PDF
Python Certification | Data Science with Python Certification | Python Online...
Edureka!
 
PPTX
Make Your Reports Over the Counter
TIBCO Jaspersoft
 
PDF
Supervised vs Unsupervised vs Reinforcement Learning | Edureka
Edureka!
 
PPTX
Tackling Python: What is it and How Can it Help with Technical SEO? | TechSEO...
Ruth Everett
 
PDF
SearchLeeds 2018 - Stephen Kenwright - Branded3 - Customer-centric search: se...
Branded3
 
PDF
AI in Software for Augmenting Intelligence Across the Enterprise
The Hive
 
PPTX
Set Your Course for Change with Real-Time Analytics and Insights
TIBCO Software Inc.
 
PDF
product internationalization.pdf
priyatai1
 
PDF
Statistics Using Python | Statistics Python Tutorial | Python Certification T...
Edureka!
 
PDF
Prueba de audio
cristian1255
 
PPT
RAPD Presentation
Course Creators
 
PDF
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Edureka!
 
PDF
Data Analytics Certification Program for Beginners | IABAC
Seenivasan
 
PDF
Pmp 3 chapter
Hisham Ali,PMP
 
PPTX
Actionable Insights with Google Analytics - Ben Rogers - Attacat Internet Mar...
Attacat Internet Marketing
 
PDF
Making Data Science accessible to a wider audience
Lou Bajuk
 
PDF
LIVErtising 2015 3 Listenomics
Jean Pierre Ranschaert
 
PPTX
CMW2023 EEAT & AI Blueprint for content scaling.pptx
Carlos Meza
 
PPTX
Session 3.pptx trainin of trainer AASt for thos who wants to be trainers
MarioAshraf10
 
PDF
Enabling a Culture of Self-Service Analytics
Precisely
 
Python Certification | Data Science with Python Certification | Python Online...
Edureka!
 
Make Your Reports Over the Counter
TIBCO Jaspersoft
 
Supervised vs Unsupervised vs Reinforcement Learning | Edureka
Edureka!
 
Tackling Python: What is it and How Can it Help with Technical SEO? | TechSEO...
Ruth Everett
 
SearchLeeds 2018 - Stephen Kenwright - Branded3 - Customer-centric search: se...
Branded3
 
AI in Software for Augmenting Intelligence Across the Enterprise
The Hive
 
Set Your Course for Change with Real-Time Analytics and Insights
TIBCO Software Inc.
 
product internationalization.pdf
priyatai1
 
Statistics Using Python | Statistics Python Tutorial | Python Certification T...
Edureka!
 
Prueba de audio
cristian1255
 
RAPD Presentation
Course Creators
 
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Edureka!
 
Data Analytics Certification Program for Beginners | IABAC
Seenivasan
 
Pmp 3 chapter
Hisham Ali,PMP
 
Actionable Insights with Google Analytics - Ben Rogers - Attacat Internet Mar...
Attacat Internet Marketing
 
Making Data Science accessible to a wider audience
Lou Bajuk
 
LIVErtising 2015 3 Listenomics
Jean Pierre Ranschaert
 
CMW2023 EEAT & AI Blueprint for content scaling.pptx
Carlos Meza
 
Session 3.pptx trainin of trainer AASt for thos who wants to be trainers
MarioAshraf10
 
Enabling a Culture of Self-Service Analytics
Precisely
 
Ad

More from Edureka! (20)

PDF
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
PDF
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
PDF
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
PDF
Tableau Tutorial for Data Science | Edureka
Edureka!
 
PDF
Python Programming Tutorial | Edureka
Edureka!
 
PDF
Top 5 PMP Certifications | Edureka
Edureka!
 
PDF
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
PDF
Linux Mint Tutorial | Edureka
Edureka!
 
PDF
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
PDF
Importance of Digital Marketing | Edureka
Edureka!
 
PDF
RPA in 2020 | Edureka
Edureka!
 
PDF
Email Notifications in Jenkins | Edureka
Edureka!
 
PDF
EA Algorithm in Machine Learning | Edureka
Edureka!
 
PDF
Cognitive AI Tutorial | Edureka
Edureka!
 
PDF
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
PDF
Blue Prism Top Interview Questions | Edureka
Edureka!
 
PDF
Big Data on AWS Tutorial | Edureka
Edureka!
 
PDF
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
PDF
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
PDF
Introduction to DevOps | Edureka
Edureka!
 
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
Tableau Tutorial for Data Science | Edureka
Edureka!
 
Python Programming Tutorial | Edureka
Edureka!
 
Top 5 PMP Certifications | Edureka
Edureka!
 
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
Linux Mint Tutorial | Edureka
Edureka!
 
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
Importance of Digital Marketing | Edureka
Edureka!
 
RPA in 2020 | Edureka
Edureka!
 
Email Notifications in Jenkins | Edureka
Edureka!
 
EA Algorithm in Machine Learning | Edureka
Edureka!
 
Cognitive AI Tutorial | Edureka
Edureka!
 
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
Blue Prism Top Interview Questions | Edureka
Edureka!
 
Big Data on AWS Tutorial | Edureka
Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
Introduction to DevOps | Edureka
Edureka!
 

Recently uploaded (20)

PPTX
IoT Sensor Integration 2025 Powering Smart Tech and Industrial Automation.pptx
Rejig Digital
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PDF
Doc9.....................................
SofiaCollazos
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
Beyond Automation: The Role of IoT Sensor Integration in Next-Gen Industries
Rejig Digital
 
PDF
Software Development Company | KodekX
KodekX
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
PPTX
Comunidade Salesforce SĂŁo Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira JĂșnior
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
Orbitly Pitch DeckA Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PDF
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 
IoT Sensor Integration 2025 Powering Smart Tech and Industrial Automation.pptx
Rejig Digital
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
Doc9.....................................
SofiaCollazos
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
Beyond Automation: The Role of IoT Sensor Integration in Next-Gen Industries
Rejig Digital
 
Software Development Company | KodekX
KodekX
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
Comunidade Salesforce SĂŁo Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira JĂșnior
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Orbitly Pitch DeckA Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 

Python For Data Analysis | Python Pandas Tutorial | Learn Python | Python Training | Edureka