SlideShare a Scribd company logo
Intro to Python for DSIntro to Python for DS
2 birds. One stone.
 
WIFI: MakeO ces 5Ghz
Password: Internet!23
 
http://bit.ly/thinkful-dc-python
TJ Stalcup
Lead DC Mentor @Thinkful
API Evangelist @540
Pokemon Master
About UsAbout Us
***see slide***
Speaker notes
What's your name?
What do you do?
Why are you interested in data science or python?
About youAbout you
Online Bootcamp since 2012. We have worked
with over 6000 students around the world
paired up with over 400 mentors. 
 
We get you ready for a career and guarantee
your rst job
 
92% success rate
About ThinkfulAbout Thinkful
Local DC Crew
 
Learn why DS is a thing
 
What is Python
 
How do we use it with a real world project?
 
How do I learn more?
TONIGHT: Learn Python by DoingTONIGHT: Learn Python by Doing
What is a Data Scientist?What is a Data Scientist?
Example: LinkedIn 2006Example: LinkedIn 2006
“[LinkedIn] was like arriving at a conference reception
and realizing you don’t know anyone. So you just stand
in the corner sipping your drink—and you probably
leave early.”
-LinkedIn Manager, June 2006
Enter: Jonathan GoldmanEnter: Jonathan Goldman
Data Scientist
Joined LinkedIn in 2006, only 8M users (450M in 2016)
Started experiments to predict people’s networks
Engineers were dismissive: “you can already import your address
book”
DS ProcessDS Process
Frame the question
Collect the raw data
Process the data
Explore the data
Communicate results
Frame the QuestionFrame the Question
What questions do we want to answer?
What connections (type and number) lead to higher
user engagement?
Which connections do people want to make but are
currently limited from making?
How might we predict these types of connections with
limited data from the user?
Collect the DataCollect the Data
What data do we need to answer these questions?
Connection data (who is who connected to?)
Demographic data (what is the pro le of the
connection)
Engagement data (how do they use the site)
Process the DataProcess the Data
How is the data “dirty” and how can we clean it?
‱ User input
‱ Redundancies
‱ Feature changes
‱ Data model changes
Explore the DataExplore the Data
What are the meaningful patterns in the data?
‱ Triangle closing
‱ Time overlaps
‱ Geographic overlaps
Communicate FindingsCommunicate Findings
How do we communicate this? To whom?
Marketing - this will enable us to sell X more ad space. Results in X
more impressions per day
Product - this will allow us to build X more features
Development - this will allow us to grow our team by X
Sales - this will attract X more premium accounts
C-Level - this will result in $$$ more revenue
8M - 450M in 10 years
The ResultThe Result
 
 
Career Whack-A-Mole
Why DS now?Why DS now?
Big Data: datasets whose size is beyond the
ability of typical database software tools to
capture, store, manage, and analyze
Big DataBig Data
Trend "started" in 2005
Web 2.0 - Majority of content is created by users
Mobile accelerates this — data/person skyrockets
The Data ProblemThe Data Problem
We are generating more data
every year than existed
before.........
The SolutionThe Solution
There goes my hero....
 
watch 'em as they code....
Just need to do everything....Just need to do everything....
Just need to do everything....Just need to do everything....
Knowledge of statistics, algorithms, & software
Comfort with languages & tools (Python, SQL, Tableau)
Inquisitiveness and intellectual curiosity
Strong communication skills
It’s all Teachable!
Coming Soon....Coming Soon....
Intro to SQL
Intro to Tableau
Intro to Statistics
 
http://meetup.com/Thinkful-DC
Let's Learn Python TonightLet's Learn Python Tonight
Python for Programming
Great for Data Science
Robotics
Web Development (Python/Django)
Automation
Let's Learn Python TonightLet's Learn Python Tonight
firstName = 'TJ'
lastName = "Stalcup"
age = 34 // wow, much old
print firstName // TJ
print firstName + lastName // TJStalcup
print firstName + ' ' + lastName // TJ Stalcup
print lastName + ', ' + firstName // Stalcup, TJ
print age * 2 // 68, hopefully retired
def greet(name):
print 'Hello', name
greet('Jack') // Hello, Jack
greet('Jill') // Hello, Jill
greet('Bob') // Hello, Bob
greet(firstName) // Hello, TJ
greet(firstName + ' ' + lastName) // Hello, TJ Stalcup
The ModelThe Model
Our model is going to be a Decision Tree.
 
Decision trees predict the most likely outcome based on input.
 
You can think of it like a computer building a version of 20 questions.
Decision Trees - Golf?Decision Trees - Golf?
The NotebookThe Notebook
We're going to use a Google hosted Python to build this
model. This app is called Colaboratory (Collaboration + Laboratory)
 
http://colab.research.google.com
 
New Notebook > New Python3 Notebook
notebook
ShortcomingsShortcomings
Our model has a few weaknesses:
 
-Limited inputs
-Assumptions
Data Science @ ThinkfulData Science @ Thinkful
Flexible, project-based curriculum to help you become the data
scientist you want to be
You don’t just learn skills, you get to make things
Mentor support from experts in the industry
Also, there's a job guarantee
Link for the third party audit jobs report:
https://www.thinkful.com/bootcamp-jobs-stats
Thinkful Graduates 92% Job Placement Rate
Learning Mentor
Career MentorProgram Manager
Local Community
You
Unprecedented SupportUnprecedented Support
http://bit.ly/dc-ds-trial
Initial 2-week trial course
Start with Python and Statistics
Unlimited Q&A Sessions
Option to continue with full bootcamp
Financing & scholarships available
O er valid for tonight only
Aaron Lamphere
Trial Program Manager
 
Thinkful Two Week TrialThinkful Two Week Trial

More Related Content

PDF
Introduction to Python for Data Science
Arc & Codementor
 
PDF
Python for Data Science
Gabriel Moreira
 
PPTX
Data Science using Python
ShapeMySkills Pvt Ltd
 
PDF
Using hadoop for big data
Data Science Thailand
 
PDF
Python in Data Science Work
Rick. Bahague
 
PPTX
Python for data science
Tanzeel Ahmad Mujahid
 
PDF
Be a Data Scientist in 8 steps!
PromptCloud
 
PDF
Life of a data scientist (pub)
Buhwan Jeong
 
Introduction to Python for Data Science
Arc & Codementor
 
Python for Data Science
Gabriel Moreira
 
Data Science using Python
ShapeMySkills Pvt Ltd
 
Using hadoop for big data
Data Science Thailand
 
Python in Data Science Work
Rick. Bahague
 
Python for data science
Tanzeel Ahmad Mujahid
 
Be a Data Scientist in 8 steps!
PromptCloud
 
Life of a data scientist (pub)
Buhwan Jeong
 

What's hot (20)

PDF
Data science presentation 2nd CI day
Mohammed Barakat
 
PDF
What is Big Data?
CodePolitan
 
PDF
Unit 3 part 2
MohammadAsharAshraf
 
PDF
Introduction to Data Science (Data Science Thailand Meetup #1)
Data Science Thailand
 
PDF
How to become a Data Scientist?
HackerEarth
 
PPS
Big Data Science: Intro and Benefits
Chandan Rajah
 
PPTX
Data Science: Past, Present, and Future
Gregory Piatetsky-Shapiro
 
PDF
Agile Data Science
Dhiana Deva
 
PPTX
Visualization of Information (ProQuest)
Michael Adcock
 
PPTX
Data Science: Not Just For Big Data
Revolution Analytics
 
PDF
Big Data and Data Science for traditional Swiss companies
Swiss Big Data User Group
 
PDF
Data science and_analytics_for_ordinary_people_ebook
Jeffrey Strickland, Ph.D., CMSP
 
PPTX
Session 10 handling bigger data
bodaceacat
 
PPTX
Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"
Dataconomy Media
 
PDF
Cheat sheets for data scientists
Ajay Ohri
 
PDF
Introduction to Data Science
Edureka!
 
PDF
Promoting a Data Driven Culture in a Microservices Environment
PyData
 
PPTX
Reproducible Research with R, The Tidyverse, Notebooks, and Spark
Adaryl "Bob" Wakefield, MBA
 
PPTX
Session 01 designing and scoping a data science project
bodaceacat
 
PDF
Embracing the Monolith
Leon Sasson
 
Data science presentation 2nd CI day
Mohammed Barakat
 
What is Big Data?
CodePolitan
 
Unit 3 part 2
MohammadAsharAshraf
 
Introduction to Data Science (Data Science Thailand Meetup #1)
Data Science Thailand
 
How to become a Data Scientist?
HackerEarth
 
Big Data Science: Intro and Benefits
Chandan Rajah
 
Data Science: Past, Present, and Future
Gregory Piatetsky-Shapiro
 
Agile Data Science
Dhiana Deva
 
Visualization of Information (ProQuest)
Michael Adcock
 
Data Science: Not Just For Big Data
Revolution Analytics
 
Big Data and Data Science for traditional Swiss companies
Swiss Big Data User Group
 
Data science and_analytics_for_ordinary_people_ebook
Jeffrey Strickland, Ph.D., CMSP
 
Session 10 handling bigger data
bodaceacat
 
Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"
Dataconomy Media
 
Cheat sheets for data scientists
Ajay Ohri
 
Introduction to Data Science
Edureka!
 
Promoting a Data Driven Culture in a Microservices Environment
PyData
 
Reproducible Research with R, The Tidyverse, Notebooks, and Spark
Adaryl "Bob" Wakefield, MBA
 
Session 01 designing and scoping a data science project
bodaceacat
 
Embracing the Monolith
Leon Sasson
 
Ad

Similar to Intro to Python for Data Science (20)

PDF
Intro to Python for Data Science
TJ Stalcup
 
PDF
Tf itpbapm
Shannon Gallagher
 
PDF
Thinkful DC - Intro to Data Science
TJ Stalcup
 
PDF
Getting started in data science (4:3)
Thinkful
 
PDF
Getting started in data science (4:3)
Thinkful
 
PDF
Intro to Data Science
TJ Stalcup
 
PDF
2017 06-14-getting started with data science
Thinkful
 
PDF
Career in Data Science (July 2017, DTLA)
Thinkful
 
PPTX
Building Data Scientists
Mitch Sanders
 
PDF
Data Science with Python - WeCloudData
WeCloudData
 
PDF
Getting Started in Data Science
Thinkful
 
PDF
Tf itpbapm
Shannon Gallagher
 
PDF
Getting started in Data Science (April 2017, Los Angeles)
Thinkful
 
PDF
Tf gsds
Shannon Gallagher
 
PPTX
Data scientist roadmap
Sonu Kumar
 
PPT
PDS Unit - 1 Introdiction to DS.ppt
ssuser52a19e
 
PDF
Class 01 - Intro.pdf
JonathanArp3
 
PDF
Startds9.19.17sd
Thinkful
 
PDF
Python's slippy path and Tao of thick Pandas: give my data, Rrrrr...
Alexey Zinoviev
 
PPTX
Data Science_Unit-1.2 part - 2 of intro.pptx
sagarrathore52204
 
Intro to Python for Data Science
TJ Stalcup
 
Tf itpbapm
Shannon Gallagher
 
Thinkful DC - Intro to Data Science
TJ Stalcup
 
Getting started in data science (4:3)
Thinkful
 
Getting started in data science (4:3)
Thinkful
 
Intro to Data Science
TJ Stalcup
 
2017 06-14-getting started with data science
Thinkful
 
Career in Data Science (July 2017, DTLA)
Thinkful
 
Building Data Scientists
Mitch Sanders
 
Data Science with Python - WeCloudData
WeCloudData
 
Getting Started in Data Science
Thinkful
 
Tf itpbapm
Shannon Gallagher
 
Getting started in Data Science (April 2017, Los Angeles)
Thinkful
 
Data scientist roadmap
Sonu Kumar
 
PDS Unit - 1 Introdiction to DS.ppt
ssuser52a19e
 
Class 01 - Intro.pdf
JonathanArp3
 
Startds9.19.17sd
Thinkful
 
Python's slippy path and Tao of thick Pandas: give my data, Rrrrr...
Alexey Zinoviev
 
Data Science_Unit-1.2 part - 2 of intro.pptx
sagarrathore52204
 
Ad

More from TJ Stalcup (20)

PDF
Intro to JavaScript - Thinkful DC
TJ Stalcup
 
PDF
Frontend Crash Course
TJ Stalcup
 
PDF
Build Your Own Website - Intro to HTML & CSS
TJ Stalcup
 
PDF
Intro to Python
TJ Stalcup
 
PDF
Intro to Python
TJ Stalcup
 
PDF
Predict the Oscars using Data Science
TJ Stalcup
 
PDF
Thinkful DC - Intro to JavaScript
TJ Stalcup
 
PDF
Data Science Your Vacation
TJ Stalcup
 
PDF
Data Science Your Vacation
TJ Stalcup
 
PDF
Build a Game with Javascript
TJ Stalcup
 
PDF
Thinkful DC FrontEnd Crash Course - HTML & CSS
TJ Stalcup
 
PDF
Build Your Own Instagram Filters
TJ Stalcup
 
PDF
Choosing a Programming Language
TJ Stalcup
 
PDF
Frontend Crash Course
TJ Stalcup
 
PDF
Thinkful FrontEnd Crash Course - HTML & CSS
TJ Stalcup
 
PDF
Thinkful FrontEnd Crash Course - HTML & CSS
TJ Stalcup
 
PDF
Build a Virtual Pet with JavaScript
TJ Stalcup
 
PDF
Intro to Javascript
TJ Stalcup
 
PDF
DC jQuery App
TJ Stalcup
 
PDF
Thinkful DC - Intro to JavaScript
TJ Stalcup
 
Intro to JavaScript - Thinkful DC
TJ Stalcup
 
Frontend Crash Course
TJ Stalcup
 
Build Your Own Website - Intro to HTML & CSS
TJ Stalcup
 
Intro to Python
TJ Stalcup
 
Intro to Python
TJ Stalcup
 
Predict the Oscars using Data Science
TJ Stalcup
 
Thinkful DC - Intro to JavaScript
TJ Stalcup
 
Data Science Your Vacation
TJ Stalcup
 
Data Science Your Vacation
TJ Stalcup
 
Build a Game with Javascript
TJ Stalcup
 
Thinkful DC FrontEnd Crash Course - HTML & CSS
TJ Stalcup
 
Build Your Own Instagram Filters
TJ Stalcup
 
Choosing a Programming Language
TJ Stalcup
 
Frontend Crash Course
TJ Stalcup
 
Thinkful FrontEnd Crash Course - HTML & CSS
TJ Stalcup
 
Thinkful FrontEnd Crash Course - HTML & CSS
TJ Stalcup
 
Build a Virtual Pet with JavaScript
TJ Stalcup
 
Intro to Javascript
TJ Stalcup
 
DC jQuery App
TJ Stalcup
 
Thinkful DC - Intro to JavaScript
TJ Stalcup
 

Recently uploaded (20)

PDF
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
PDF
Revolutionize Operations with Intelligent IoT Monitoring and Control
Rejig Digital
 
PPTX
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
 
PDF
This slide provides an overview Technology
mineshkharadi333
 
PDF
Doc9.....................................
SofiaCollazos
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PPTX
Comunidade Salesforce SĂŁo Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira JĂșnior
 
PDF
CIFDAQ'S Market Insight: BTC to ETH money in motion
CIFDAQ
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PPTX
Coupa-Overview _Assumptions presentation
annapureddyn
 
PDF
Software Development Company | KodekX
KodekX
 
PPTX
ChatGPT's Deck on The Enduring Legacy of Fax Machines
Greg Swan
 
PDF
Orbitly Pitch DeckA Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
Revolutionize Operations with Intelligent IoT Monitoring and Control
Rejig Digital
 
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
 
This slide provides an overview Technology
mineshkharadi333
 
Doc9.....................................
SofiaCollazos
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
Comunidade Salesforce SĂŁo Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira JĂșnior
 
CIFDAQ'S Market Insight: BTC to ETH money in motion
CIFDAQ
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Coupa-Overview _Assumptions presentation
annapureddyn
 
Software Development Company | KodekX
KodekX
 
ChatGPT's Deck on The Enduring Legacy of Fax Machines
Greg Swan
 
Orbitly Pitch DeckA Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 

Intro to Python for Data Science

  • 1. Intro to Python for DSIntro to Python for DS 2 birds. One stone.   WIFI: MakeO ces 5Ghz Password: Internet!23   http://bit.ly/thinkful-dc-python
  • 2. TJ Stalcup Lead DC Mentor @Thinkful API Evangelist @540 Pokemon Master About UsAbout Us
  • 4. What's your name? What do you do? Why are you interested in data science or python? About youAbout you
  • 5. Online Bootcamp since 2012. We have worked with over 6000 students around the world paired up with over 400 mentors.    We get you ready for a career and guarantee your rst job   92% success rate About ThinkfulAbout Thinkful Local DC Crew
  • 6.   Learn why DS is a thing   What is Python   How do we use it with a real world project?   How do I learn more? TONIGHT: Learn Python by DoingTONIGHT: Learn Python by Doing
  • 7. What is a Data Scientist?What is a Data Scientist?
  • 8. Example: LinkedIn 2006Example: LinkedIn 2006 “[LinkedIn] was like arriving at a conference reception and realizing you don’t know anyone. So you just stand in the corner sipping your drink—and you probably leave early.” -LinkedIn Manager, June 2006
  • 9. Enter: Jonathan GoldmanEnter: Jonathan Goldman Data Scientist Joined LinkedIn in 2006, only 8M users (450M in 2016) Started experiments to predict people’s networks Engineers were dismissive: “you can already import your address book”
  • 10. DS ProcessDS Process Frame the question Collect the raw data Process the data Explore the data Communicate results
  • 11. Frame the QuestionFrame the Question What questions do we want to answer? What connections (type and number) lead to higher user engagement? Which connections do people want to make but are currently limited from making? How might we predict these types of connections with limited data from the user?
  • 12. Collect the DataCollect the Data What data do we need to answer these questions? Connection data (who is who connected to?) Demographic data (what is the pro le of the connection) Engagement data (how do they use the site)
  • 13. Process the DataProcess the Data How is the data “dirty” and how can we clean it? ‱ User input ‱ Redundancies ‱ Feature changes ‱ Data model changes
  • 14. Explore the DataExplore the Data What are the meaningful patterns in the data? ‱ Triangle closing ‱ Time overlaps ‱ Geographic overlaps
  • 15. Communicate FindingsCommunicate Findings How do we communicate this? To whom? Marketing - this will enable us to sell X more ad space. Results in X more impressions per day Product - this will allow us to build X more features Development - this will allow us to grow our team by X Sales - this will attract X more premium accounts C-Level - this will result in $$$ more revenue 8M - 450M in 10 years
  • 17. Why DS now?Why DS now? Big Data: datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze
  • 18. Big DataBig Data Trend "started" in 2005 Web 2.0 - Majority of content is created by users Mobile accelerates this — data/person skyrockets
  • 19. The Data ProblemThe Data Problem We are generating more data every year than existed before.........
  • 20. The SolutionThe Solution There goes my hero....   watch 'em as they code....
  • 21. Just need to do everything....Just need to do everything....
  • 22. Just need to do everything....Just need to do everything.... Knowledge of statistics, algorithms, & software Comfort with languages & tools (Python, SQL, Tableau) Inquisitiveness and intellectual curiosity Strong communication skills It’s all Teachable!
  • 23. Coming Soon....Coming Soon.... Intro to SQL Intro to Tableau Intro to Statistics   http://meetup.com/Thinkful-DC
  • 24. Let's Learn Python TonightLet's Learn Python Tonight Python for Programming Great for Data Science Robotics Web Development (Python/Django) Automation
  • 25. Let's Learn Python TonightLet's Learn Python Tonight firstName = 'TJ' lastName = "Stalcup" age = 34 // wow, much old print firstName // TJ print firstName + lastName // TJStalcup print firstName + ' ' + lastName // TJ Stalcup print lastName + ', ' + firstName // Stalcup, TJ print age * 2 // 68, hopefully retired def greet(name): print 'Hello', name greet('Jack') // Hello, Jack greet('Jill') // Hello, Jill greet('Bob') // Hello, Bob greet(firstName) // Hello, TJ greet(firstName + ' ' + lastName) // Hello, TJ Stalcup
  • 26. The ModelThe Model Our model is going to be a Decision Tree.   Decision trees predict the most likely outcome based on input.   You can think of it like a computer building a version of 20 questions.
  • 27. Decision Trees - Golf?Decision Trees - Golf?
  • 28. The NotebookThe Notebook We're going to use a Google hosted Python to build this model. This app is called Colaboratory (Collaboration + Laboratory)   http://colab.research.google.com   New Notebook > New Python3 Notebook notebook
  • 29. ShortcomingsShortcomings Our model has a few weaknesses:   -Limited inputs -Assumptions
  • 30. Data Science @ ThinkfulData Science @ Thinkful Flexible, project-based curriculum to help you become the data scientist you want to be You don’t just learn skills, you get to make things Mentor support from experts in the industry Also, there's a job guarantee
  • 31. Link for the third party audit jobs report: https://www.thinkful.com/bootcamp-jobs-stats Thinkful Graduates 92% Job Placement Rate
  • 32. Learning Mentor Career MentorProgram Manager Local Community You Unprecedented SupportUnprecedented Support
  • 33. http://bit.ly/dc-ds-trial Initial 2-week trial course Start with Python and Statistics Unlimited Q&A Sessions Option to continue with full bootcamp Financing & scholarships available O er valid for tonight only Aaron Lamphere Trial Program Manager   Thinkful Two Week TrialThinkful Two Week Trial