0% found this document useful (0 votes)
23 views

Python For Data Analysis Lecture 1

Uploaded by

yangbn2008
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
23 views

Python For Data Analysis Lecture 1

Uploaded by

yangbn2008
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 44

PYTHON FOR

D ATA A N A LY S I S 1 / 7

Ø Python 数据分析 – 就业市场

Ø Python 数据分析 – 简历准备

Ø Python 数据分析 – 相关技能

Ø Python 数据分析 – 对比 R |SAS |VBA |Java | C++

Ø Python 数据分析 – 学习计划

Ø Python 数据分析 – 第一步: 读取数据

Y A N G L I U

M A R 2 5 , 2 0 2 1
My background
• Master and Bachelor in Computer Science
Education • MBA (Rotman School of Management, Finance Specialization)

• Banking: Capital One Bank, Scotiabank and TD Bank


Companies • Healthcare: UHN (Toronto General Hospital, Princess Margaret Hospital and Toronto Western Hospital)
• Others: IBM Global Services, Sandals Marketing, University of Toronto and Start-ups

• Architect: System Architect (IBM), SAS Architect and Enterprise Data Lake/Cloud Architect (Scotiabank)
Roles • Senior Manager: Senior Management, Data Analysis (EDW) and Senior Manager, Stress Testing (GRM)
• Consulting: Senior Consultants at various companies including Capital One Bank and TD Bank

• SAS, Python (ETL, Risk Management Analytics and Modeling)


Technical Skills • R (Risk Management Modelling and Stress testing)
• VBA automation … many more including Web/Mobile programming …

Aspiration • Leverage the latest technologies to bring the best data analysis learning experiences
Python Developer vs Data Analyst with Python skills
•Web Development (Django, Flask)
•Game Development
•Machine Learning and Artificial Intelligence
•Data Analysis and Analytics
•Basic ETL (Extract Transform Load) ELT (Extract Load
Transform)
•ETL Developer
•Data Science and Data Visualization
•Desktop GUI
•Web Scraping Applications
•Business Applications (Web, Mobile, iPad and Desktop)
•Audio and Video Applications
•CAD Applications
•Embedded Applications
Resume Enhancement

Manager:
1. Can he/she work?
Make sure practice them first 2. Do I like him/her?
ü Python3 3. Can I take him/her to meetings?
ü Anaconda
ü Jupyter notebook You:
ü JupyterLab
ü Qt Console 1. How I am different?
ü Ipython 2. Can I answer the technical
ü Powershell (Windows only) questions?
ü Shell/Terminal 3. Can I tell my background and
ü Pandas experiences in stories align with
ü …. business needs.
DO NOT WAIT

Many
opportunities
in this area
DEVELOP THE RIGHT SKILLS – HARD SKILLS
Banking (CFA/FRM/CSC)

Telecom
Domain Healthcare (HL7)
Data analysis concepts
Knowledge
Online Retail (Google Analytics)
Data analysis
methodology
Potential
Statistics Anova, R2
Candidates
Database
Data Analysis Technical SQL (NoSQL)
Knowledge Knowledge Python
SAS
R
VBA
Cloud 8
DEVELOP THE RIGHT SKILLS – SOFT SKILLS

Attention to
details

Communication
Good skills
Candidate
Interview skills

Personal skills

9
TECHNICAL SKILLS TO LEARN – THE MORE “IT” YOU ARE …

• Excel/VBA Financial Analyst


• SAS Business/Finance/Accounting
Business Marketing Analyst

• Python/Pandas Statistical Modeling


Math, Stats, Physics
Stats/Math •R Data Analytics/Data Science

• Java Computer Science, Computer ELT/Cloud


Computer • C/C++ Engineer, Electronic Engineer Software Development
Science
Python Basics:

1. I know how to write if/else if/else

2. I know List [ ] Tuple ( ) Dictionary { }

3. I know which objects are mutable/immutable

4. I know how to write for and while loop

5. I know how to write a function

6. Optional: I know Boolean (True/False), Set (no duplicates)

and Class (see image, Class is covered in an after-class video)


Python Developer vs Data Analyst with Python skills
https://mcs.utm.utoronto.ca/~108s19/#lectures
https://mcs.utm.utoronto.ca/~108s19/#lectures
PYTHON FUNCTIONS
How Python works
• The PVM

4**0.5
Python History

Python 2.0 in 2000 Python 2.7 in 2010

Python 0.9 Python 2.7 ends

1991 2008 2020

Python 3.0 Python 3.8


Python 3.9 (Oct 2020)
Python 3.10 (Oct 2021)
Hands-on Exercise: Open your Editor/IDE or use Editor online
https://pynative.com/online-python-code-editor-to-execute-python-code/
https://www.w3schools.com/python/python_compiler.asp
Jupyter Pronunciation and its Creator
Creators of Jupyter, Python and Perl

Source: Images are copies from Wikipedia at https://en.wikipedia.org/


Basic Concepts Jupyter Suite
Command line
Jupyter Suite
Shell
Jupyter
Terminal Jupyter Notebook JupyterLab
Qt Console Console
Console
Terminal

Kernel Kernel IPython


Ipython
Notebook Python Terminal
Hard-
ware Windows Mac/Linux/Unix
Command Line Command Line
Powershell Shell
Command Prompt Terminal
Windows CMD and PowerShell
Windows Shell, Terminal and Console

1. Shell is the “Engine” you see in 1. Terminal and Console run on the top Kernel.
terminal. 2. Kernel is the central core of OS (operating system): Controls system and
2. Shell is used interactively via a communicates with hardware.
terminal. 3. Kernel manages files/devices/network/processes, allocates CPU/Memory/Hard
3. Different shells: Bash, Ksh, Zsh … disk resources and controls which part of memory a program/process can access.
Terminal on Mac
Python terminal Vs IPython
Ipython vs Qt Console
Jupyter notebook and JupyterLab Dashboard
Database Installation and Connection
TXT / CSV
JSON/XML + (XSLT: My worst programming experience)

https://www.guru99.com/json-vs-xml-difference.html
Reading XML
Reading

JSON

With

Python

Reading JSON With Pandas


Homework will be posted on shared drive and answer
will be provided before the next lecture

Business Technical Technical


Processing
Scenarios Inputs Outputs

- Real Business Python


- Data inputs - Report
Scenarios
- Various files types - Charts
- Use-cases based SAS
- Reading directories - Various files formats
- Story telling in the
interviews - Reading databases - Store for future use
R

VBA

41
Independent Business Scenario #101: Level: Hard Interview Work

Business Technical
Technical Inputs Processing
Scenarios Outputs
Python
Early in the morning, SAS R VBA

your manager sent you

an email and ask you to

merge several customer

data files into a single

file in Excel before 2pm

afternoon meeting.

42
The journey of learning (400 - 1000 Scenarios)

Practice is the best of all instructors.


Publius Syrus, Circa 42 B.C.

We all learned by doing, by experimenting (and often failing), and by


asking questions.
Jay Jacob Wind
15 minutes Q & A

44

You might also like