0% found this document useful (0 votes)
13 views24 pages

DS231 Week 4

Uploaded by

Abdu 77
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views24 pages

DS231 Week 4

Uploaded by

Abdu 77
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 24

‫الجامعة السعودية االلكترونية‬

‫الجامعة السعودية االلكترونية‬

‫‪26/12/2021‬‬
College of Computing and
Informatics
Bachelor of Data Science
DS-231
Introduction to Data Science
Programming

2
DS-231
Introduction to Data Science Programming
Week 4
Machine Learning Means … Using a
Machine to Learn from Data
Contents

1. Defining Machine Learning and Its Processes

2. Considering Learning Styles

3. Seeing What You Can Do

4
Weekly Learning
Outcomes
1. Grasping the machine learning process

2. Exploring machine learning styles and algorithms

3. Overviewing algorithms, deep learning, and


Apache Spark

5
Required Reading
1. Chapter 3. Machine Learning Means … Using a
Machine to Learn from Data (Lillian Pierson,
Data Science, 3rd Edition, 2021)

Videos
• Machine Learning Basics | What Is Machine Learning? | Introduction To Machine
Learning
https://youtu.be/ukzFI9rgwfU
• Supervised vs Unsupervised vs Reinforcement Learning
https://youtu.be/1FZ0A1QCMWc

6
1. Defining Machine Learning and Its Processes
Defining Machine Learning and Its Processes

• Machine learning is the practice of applying algorithmic


models to data over and over again so that your computer
discovers hidden patterns or trends that you can use to make
predictions. It’s also called algorithmic learning.
• Machine learning has a vast and ever-expanding assortment
of use cases, including Real-time Internet advertising, Spam
filtering, Recommendation engines, Natural language
processing and sentiment analysis, Automatic facial
recognition, etc.
Defining Machine Learning and Its Processes
Walking through the steps of the machine learning process
• Three main steps are involved in machine learning: setup,
learning, and application.
• Setup involves acquiring data, preprocessing it, selecting the
most appropriate variables for the task at hand (called
feature selection), and breaking the data into training and
test datasets. You use the training data to train the model,
and the test data to test the accuracy of the model’s
predictions.
Defining Machine Learning and Its Processes
Walking through the steps of the machine learning process
• The learning step involves model experimentation, training,
building, and testing.
• The application step involves model deployment and
prediction.
• Here’s a rule of thumb for breaking data into test and training
sets: Apply random sampling to two-thirds of the original
dataset in order to use that sample to train the model. Use
the remaining one-third of the data as test data, for
evaluating the model’s predictions.
Defining Machine Learning and Its Processes
Becoming familiar with machine learning terms
• Because machine learning is a branch of both traditional
statistics and computer science, it has adopted terms from
both fields and added a few of its own. Here is what you need
to know:
Instance: The same as row (in a data table), an observation (in
statistics), and a data point. Machine learning practitioners are also
known to call an instance case.
Feature: The same as a column or field (in a data table) and a
variable (in statistics). In regression methods, a feature is also called
an independent variable (IV).
Defining Machine Learning and Its Processes
Becoming familiar with machine learning terms
Target variable: The same as a predictant or dependent variable
(DV) in statistics.
• Although machine learning is often referred to in context of
data science and artificial intelligence, these terms are all
separate and distinct. Machine learning is a practice within
data science, but there is more to data science than just
machine learning
2. Considering Learning
Styles
Considering Learning Styles

• Machine learning can be applied in three main styles:


supervised, unsupervised, and semisupervised.
Supervised and unsupervised methods are behind most
modern machine learning applications, and
semisupervised learning is an up and coming star.
Considering Learning Styles
Learning with supervised algorithms
• Supervised learning algorithms require that input data has
labeled features.
• These algorithms learn from known features of that data
to produce an output model that successfully predicts
labels for new incoming, unlabeled data points.
• You use supervised learning when you have a labeled
dataset composed of historical values that are good
predictors of future events.
• Logistic regression is a type of supervised learning
Considering Learning Styles
Learning with unsupervised algorithms
• Unsupervised learning algorithms accept unlabeled data
and attempt to group observations into categories based
on underlying similarities in input features.
• Principal component analysis, k-means clustering, and
singular value decomposition are all examples of
unsupervised machine learning algorithms.
Considering Learning Styles
Learning with reinforcement
• Reinforcement learning is a behavior-based learning
model.
• It’s based on a mechanic like how humans and animals
learn.
• The model is given “rewards” based on how it behaves,
and it subsequently learns to maximize the sum of its
rewards by adapting the decisions it makes to earn as
many rewards as possible.
3. Seeing What You Can
Do
Seeing What You Can Do
Selecting algorithms based on function
• When you need to choose a class of machine learning
algorithms, it’s helpful to consider each model class based
on its functionality. For the most part, algorithmic
functionality falls into the categories shown in the
following Figure.
Machine
FIGURE 3-3:

learning
algorithms can
be broken down
.by function
Seeing What You Can Do
Selecting algorithms based on function
• Example of Deep learning algorithm: If you use Gmail, you
must be enjoying its autoreply functionality. You know —
the three 1-line messages from which you can choose an
autoreply to a message someone sent you? Well, this
autoreply functionality within Gmail is called Smart Reply,
and it is built on deep learning algorithms.
• Another innovation built on deep learning is Facebook
DeepFace, the Facebook feature that automatically
recognizes and suggests tags for the people who appear in
your Facebook photos.
Seeing What You Can Do
Using Spark to generate real-time big data analytics
• Apache Spark is an in-memory distributed computing
application that you can use to deploy machine learning
algorithms on big data sources in near-real-time to
generate analytics from streaming big data sources.
• In-memory refers to processing data within the computer’s
memory, without reading and writing its computational
results onto the disk. In-memory computing provides its
results a lot faster but cannot process much data per
processing interval.
Thank
You

You might also like