Data Science - Module 1_ Data Science Fundamentals
Data Science - Module 1_ Data Science Fundamentals
Introduction to
Python and Data
Science
Reminder
2 At the start of all classes, please rename yourselves to: Name + Last
3 digits and letter of your NRIC. Example: John Tan (123A)
Agenda ● Introduction To The Course
● Python Fundamentals
Overview for cleaning and aggregating data, and understand the basic
statistical testing values of your data and more.
Overview patterns from data sets, tune data parameters for advanced
model evaluation.
You will also begin working on your capstone project to solve a real-world
problem related to finance.
Project
(must be finance-related). You will acquire a real-world finance data set,
form a hypothesis about it, and then clean, parse, and apply modelling
(100% of final grade) techniques and data science principles.
● The capstone project will culminate your learning by applying the new
tools and concepts learnt to create a report that includes:
1 2 3
How is your Wifi connection? Latest Laptop is required. Mobile or tablets Recording and PDF slides are
version of Zoom is required if you’re screens are too small and the screen provided to you in your learners’
having trouble with Zoom. will be different from the trainer demo if portal on Vertical Institute’s Website.
you use your mobile or tablets.
To receive the funding support, please take note of
the following:
● Minimum of 75% attendance (this means
that you must attend at least 6 out of 7
lessons.
Things to ● Achieve at least a PASS for Capstone
take note…✍ ●
Project.
The Capstone Project has to be submitted
by the deadline given (1 week from the end
of the bootcamp).
a lesson?🗓
To arrange for a make-up class, you can contact the
Teaching Assistant or Admin and they will be able to
arrange for a make-up class nearer to the date.
Hello there! 👋
Instructor
Introduction 👋
●Name
●Occupation/School
FAANG companies love Python and use it for their real-world applications.
https://www.wsj.com/video/series/inside-tiktoks-highly-secretive-algorithm/investigation-how- tiktok-
algorithm -figures-out-your-deepest-desires/6C0C2040-FF25-4827-8528-2BD6612E3796
Big data, on the other hand, is… bigger than traditional data, and not in the trivial sense.
From variety (numbers, text, but also images, audio, mobile data, etc.), to velocity
(retrieved and computed in real time), to volume (measured in tera-, peta-, exa-bytes),
big data is usually distributed across a network of computers.
https://www.kdnuggets.com/2018/06/what-where-how-data-science.html
Float Short for floating point number, usually used with decimals -2.1, 2.8, 3.14159
Boolean Takes up the value of True, False. Commonly used True, False
for controlling flow of program.
Tuples Tuples are another standard sequence data type however it (‘a’,’b’,’c’,’d’,’e’)
is immutable, meaning once defined, you cannot delete, add
or edit any values inside it
a=123 #number
b = ‘Hello’ #string #list
c = [1,2,3] #dictionary
d = {“1”: “A”, “2”: “B”}
1. Python variable name can contain small case letters (a-z), upper case letters (A-Z),
numbers (0-9), and underscore (_).
● sorted()
● len()
● set()
● list()
● print()
● type()
1. Syntax errors
2. Logical errors (exceptions)
AttributeError: when you try to call an attribute of an object whose type does not support that method
ZeroDivisionError: Due to either a number being divided by zero, or a number being modulo by zero
Operator Associativity (the order in which Python evaluates an expression containing multiple operators
of the same precedence)
1. Left associativity means that the expression is evaluated from left-to-right (almost all operators)
2. Right associativity means the expression is evaluated from right-to-left
CRUD is an acronym that comes from the world of computing and refers to the four
functions that are considered necessary to implement a persistent storage application.
A financial institution maintains multiple databases that helps manage to and keep track of existing
customers, financial products and spending patterns. Below are some of the common financial tables:
● A Customer Data Table includes attributes such as first and last name, personal identification
number, contact number, home address, work location, and any other relevant personal details.
● A Product Table that includes the company’s financial products such as credit cards, loans and
trading activities.
● A Transaction Table that contains data at the transaction level for each of the customers, including
frequency, amount and recency.
Thank you!