Module 1

Uploaded by

anushaj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views34 pages

Module 1

Uploaded by

anushaj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

The Machine Learning Landscape

• Common Misconception: Machine Learning = Robots

(helpful or harmful)
• Reality: Machine Learning is already here (decades old)
• Examples of Existing Machine Learning Applications:
– Optical Character Recognition (OCR)
– Spam Filters (1990s)
• Machine Learning in Everyday Products & Features:
– Improved Recommendations (e.g., online shopping)
– Voice Search
Introduction
• What is Machine Learning?
– Not simply downloading data
– Machine Learning Exploration:
– Understanding Core Concepts
• Key Regions & Landmarks:
– Supervised vs Unsupervised Learning
– Online vs Batch Learning
– Instance-based vs Model-based Learning
What Is Machine Learning?
• Science & Art of Programming Learning Computers they learn
from data
• General Definition: Give computers ability to learn without explicit
programming (Arthur Samuel, 1959)
• Engineering Definition: Program learns from experience to
improve performance on a specific task (Tom Mitchell, 1997)
• Machine Learning in Action - Spam Filter Example
– Task (T): Flag spam emails
– Experience (E): Training data (examples of spam & non-spam
emails)
– Performance Measure (P): Accuracy (ratio of correctly
classified emails)
Cont...
• Machine Learning vs. Just Downloading Data
– Downloading data (e.g., Wikipedia) doesn't make a computer
learn or improve at tasks
– Machine Learning requires using data to improve performance
on a specific task
Why Use Machine Learning?
The traditional approach - a spam filter using traditional
programming technique
1. First you would look at what spam
typically looks like. You might notice that
some words or phrases (such as “4U,”
“credit card,” “free,” and amazing”) tend to
come up a lot in the subject. Perhaps you
would also notice a few other patterns
in the sender’s name, the email’s body,
and so on.
2. You would write a detection algorithm for
each of the patterns that you noticed,
and your program would flag emails as
spam if a number of these patterns are
detected.
3. You would test your program, and repeat
steps 1 and 2 until it is good enough
Problem - your program will likely
become a long list of complex rules—
pretty hard to maintain.
Traditional Programming:
• Pros: • Cons:
– Precise control over – Requires manual coding for
program logic and every specific task
functionality – Can be inflexible for
– Easier to understand and adapting to new data or
interpret the code situations
– More efficient for well- – Difficulty in handling
defined tasks with clear complex or large datasets
rules – Time-consuming to modify
– Often faster for simpler or update code for changing
tasks requirements
Machine Learning approach

The program is much shorter, easier to maintain,

and most likely more accurate

A spam filter based on Machine Learning

techniques automatically learns which words and
phrases are good predictors of spam by detecting
unusually frequent patterns of words in the spam
examples

if spammers notice that all their emails

containing “4U” are blocked, they might start
writing “For U” instead. A spam filter using
traditional programming techniques would need
to be updated to flag “For U” emails. If spammers
keep working around your spam filter, you will
need to keep writing new rules forever
Machine Learning Approach:
• Pros: • Cons:
– Learns from data, improving – Can be less interpretable ("black
performance over time box") - understanding how the
– Can identify patterns and model arrives at a decision can be
make predictions in difficult
complex data – Requires expertise in Machine
– Adapts to new data and Learning and data preparation
situations without explicit – Training data can be time-
programming consuming and expensive to collect
– Efficient for handling large and label
and evolving datasets – Performance can be unpredictable
and may require ongoing fine-tuning
Automatically adapting to change
• spam filter based on Machine
Learning techniques automatically
notices that “For U” has become
unusually frequent in spam flagged
by users, and it starts flagging
them without your intervention

• Speech recognition - to spell “one”,

Two
• No algorithm and complex
• So machine learning can be
used by providing numerous
recordings.
Machine Learning can help humans learn
Machine Learning can help humans learn
• ML algorithms can be inspected to see what
they have learned (although for some
algorithms this can be tricky).
• For instance, once the spam filter has been
trained on enough spam, it can easily be
inspected to reveal the list of words and
combinations of words that it believes are the
best predictors of spam. Sometimes this will
reveal unsuspected correlations or new
trends, and thereby lead to a better
understanding of the problem.
• Applying ML techniques to dig into large
amounts of data can help discover patterns that
were not immediately apparent. This is called
data mining.
summary:
• Traditional programming is ideal for well-defined tasks
with clear rules and where precise control is needed.
• Machine Learning is a powerful tool for complex problems
with large datasets, where the ability to learn and adapt is
crucial.
Machine Learning is great for:
• Problems for which existing solutions require a lot of hand-tuning
or long lists of rules: one Machine Learning algorithm can often
simplify code and perform better.
• Complex problems for which there is no good solution at all using
a traditional approach: the best Machine Learning techniques can
find a solution.
• Fluctuating environments: a Machine Learning system can adapt
to new data.
• Getting insights about complex problems and large amounts of
data
Types of Machine Learning Systems
• Classify them in broad categories based on:
– Whether or not they are trained with human
supervision (supervised, unsupervised,
semisupervised, and Reinforcement Learning)
– Whether or not they can learn incrementally on the fly
(online versus batchlearning)
– Whether they work by simply comparing new data
points to known data points,or instead detect patterns in
the training data and build a predictive model, much like
scientists do (instance-based versus model-based
learning)
Supervised/Unsupervised Learning

• There are four major categories according to the

amount and type of supervision they get during
training
– supervised learning,
– unsupervised learning,
– semisupervised learning, and
– Reinforcement Learning
Supervised learning
According to the amount and type of supervision they get during training
A typical supervised learning task is classification. The spam filter is a good example of this: it
is trained with many example emails along with their class (spam or ham),and it must learn
how to classify new emails
Supervised learning and Type - Regression
Another typical task is to predict a target numeric value, such as the price of a car, given a set of
features (mileage, age, brand, etc.) called predictors. This sort of task is called regression. To
train the system, you need to give it many examples of cars, including both their predictors and
their labels (i.e., their prices).
Supervised learning and Types
• Classification problems ask the algorithm to predict a discrete
value that can identify the input data as a member of a particular
class or group. Taking up the animal photos dataset, each photo
has been labeled as a dog, a cat, etc., and then the algorithm has
to classify the new images into any of these labeled categories.
• Regression problems are responsible for continuous data, e.g.,
for predicting the price of a piece of land in a city, given the area,
location, etc.. Here, the input is sent to the machine for predicting
the price according to previous instances. And the machine
determines a function that would map the pairs. If it is unable to
provide accurate results, backward propagation is used to repeat
the whole function until it receives satisfactory results.
supervised learning algorithms
• important supervised learning algorithms
– k-Nearest Neighbors
– Linear Regression
– Logistic Regression
– Support Vector Machines (SVMs)
– Decision Trees and Random Forests
– Neural networks
Unsupervised learning

In unsupervised learning, as you might guess, the training data is unlabeled. The system tries
to learn without a teacher.
Unsupervised learning
types of unsupervised learning

there are Four types of unsupervised learning tasks:

– clustering,
– Anomaly detection and novelty detection
– association rules, and
– visualization and dimensionality reduction.
Cont..

• Clustering • Visualization
Cont..

• Anamaly Detection • Association rules

Reinforcement Learning
Comparision
Semi - Supervised Learning
Batch and Online Learning
• Another criterion used to classify Machine Learning whether or
not the system can learn incrementally from a stream of
incoming data.
• Types -
• Batch learning
• Online learning
Instance-Based Versus Model-Based Learning
• Instance-based learning • Model-based learning
Main Challenges of Machine Learning
• “bad algorithm” and “bad data.”
• BAD DATA
– Insufficient Quantity of Training Data
– Nonrepresentative Training Data
– Poor-Quality Data
– Irrelevant Features
– Overfitting the Training Data
– Underfitting the Training Data
Insufficient Quantity of Training Data
The Unreasonable Effectiveness of Data
Nonrepresentative Training Data
Overfitting the Training Data
Cont..

Spammer Detection Fake Pople Identification On Social Networks1
No ratings yet
Spammer Detection Fake Pople Identification On Social Networks1
64 pages
Lecture 1 Machine Learning
No ratings yet
Lecture 1 Machine Learning
23 pages
Machine Learning Tutorial
100% (1)
Machine Learning Tutorial
775 pages
@vtucode - in 21AI63 Module 1 AI&ML 2021 Scheme
No ratings yet
@vtucode - in 21AI63 Module 1 AI&ML 2021 Scheme
38 pages
ML Notes
No ratings yet
ML Notes
113 pages
Module 4 ISML
No ratings yet
Module 4 ISML
88 pages
ML m1-m5 NOTES
No ratings yet
ML m1-m5 NOTES
160 pages
Unit-3 Machine Learning
No ratings yet
Unit-3 Machine Learning
81 pages
Machine Learning Full PDF
No ratings yet
Machine Learning Full PDF
149 pages
The Machine Learning Landscape
No ratings yet
The Machine Learning Landscape
25 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
78 pages
Module1 ML
No ratings yet
Module1 ML
114 pages
Machine Learning BE Merged Modules
No ratings yet
Machine Learning BE Merged Modules
561 pages
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
No ratings yet
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
114 pages
BE02000041 Funda of AI Unit 3 Basics of ML
No ratings yet
BE02000041 Funda of AI Unit 3 Basics of ML
86 pages
1 - AML - Manish
No ratings yet
1 - AML - Manish
72 pages
UNIT I Introduction To Machine Learning
No ratings yet
UNIT I Introduction To Machine Learning
150 pages
Unit 1
No ratings yet
Unit 1
55 pages
Machine Learning Tutorial
100% (1)
Machine Learning Tutorial
44 pages
Module 1 Notes
No ratings yet
Module 1 Notes
56 pages
Unit V
No ratings yet
Unit V
67 pages
Machine Learning
No ratings yet
Machine Learning
97 pages
01 - ML - Introduction
No ratings yet
01 - ML - Introduction
65 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
78 pages
Cognate X Spidey
No ratings yet
Cognate X Spidey
46 pages
Overview of Machine Learning
No ratings yet
Overview of Machine Learning
60 pages
Unit-1 New
No ratings yet
Unit-1 New
48 pages
Ml-Unit 1
No ratings yet
Ml-Unit 1
53 pages
21AI63 Module 1
No ratings yet
21AI63 Module 1
38 pages
Machine Learning Is A Branch of Artificial Intelligence (AI)
No ratings yet
Machine Learning Is A Branch of Artificial Intelligence (AI)
80 pages
21ai63 Mod 1
No ratings yet
21ai63 Mod 1
38 pages
Unit-2 AI Python
No ratings yet
Unit-2 AI Python
57 pages
Module 1 Notes
No ratings yet
Module 1 Notes
38 pages
Unit-1 MLA
No ratings yet
Unit-1 MLA
31 pages
Chapter 1
No ratings yet
Chapter 1
40 pages
Aiba-Module 1 Machine Learning
No ratings yet
Aiba-Module 1 Machine Learning
23 pages
Ad8552 ML Unit I
No ratings yet
Ad8552 ML Unit I
31 pages
Lec 2
No ratings yet
Lec 2
18 pages
Chapter 1
No ratings yet
Chapter 1
30 pages
ML Module 1
No ratings yet
ML Module 1
26 pages
Module1.3 - Machine Learning Methods
No ratings yet
Module1.3 - Machine Learning Methods
17 pages
Machine Learning Basics and Applications For Beginners
No ratings yet
Machine Learning Basics and Applications For Beginners
15 pages
Machine Learning
No ratings yet
Machine Learning
35 pages
ML L1 PDF
No ratings yet
ML L1 PDF
43 pages
CE469 - Introduction To Machine Learning: Lecturer Contact
No ratings yet
CE469 - Introduction To Machine Learning: Lecturer Contact
33 pages
ML Study
No ratings yet
ML Study
9 pages
GettingStartedwithMachineLearningML DataScience365
No ratings yet
GettingStartedwithMachineLearningML DataScience365
12 pages
Machine Learning Unit-I
No ratings yet
Machine Learning Unit-I
41 pages
Unit-1 Part-1 Material
No ratings yet
Unit-1 Part-1 Material
45 pages
ML Notes
No ratings yet
ML Notes
18 pages
Machine Learning Types
No ratings yet
Machine Learning Types
30 pages
Chapter 1
No ratings yet
Chapter 1
6 pages
Null 5
No ratings yet
Null 5
16 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
6 pages
ML Unit-I Part 1
No ratings yet
ML Unit-I Part 1
7 pages
ML Unit 1
No ratings yet
ML Unit 1
19 pages
Machine Learning A Basic Approach
No ratings yet
Machine Learning A Basic Approach
9 pages
DAIOT UNIT 5 (1) Own
No ratings yet
DAIOT UNIT 5 (1) Own
13 pages
Paypal Verification New Method
No ratings yet
Paypal Verification New Method
5 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
Karnataka Geography QRN
No ratings yet
Karnataka Geography QRN
50 pages
CSCRF Gap Assessment
No ratings yet
CSCRF Gap Assessment
3 pages
Hunter Hawk Eye Elite Brochure
No ratings yet
Hunter Hawk Eye Elite Brochure
20 pages
California Housing Project
No ratings yet
California Housing Project
5 pages
DBMS
No ratings yet
DBMS
68 pages
Lab Guide - PDF - EN
No ratings yet
Lab Guide - PDF - EN
114 pages
ENARSI v1.1 - Part 3 - Chapter 2 - Quiz #1
No ratings yet
ENARSI v1.1 - Part 3 - Chapter 2 - Quiz #1
3 pages
MNIST
No ratings yet
MNIST
54 pages
01 Intro
No ratings yet
01 Intro
30 pages
Elementary Activities To Celebrate Children's Book Day by Slidesgo
No ratings yet
Elementary Activities To Celebrate Children's Book Day by Slidesgo
35 pages
1 Number System
No ratings yet
1 Number System
22 pages
SEE2204 Sem 20 21B Lecture 13 Smart Resilient Cities Compressed
No ratings yet
SEE2204 Sem 20 21B Lecture 13 Smart Resilient Cities Compressed
53 pages
Linkq Interview Question
No ratings yet
Linkq Interview Question
14 pages
Python Introduction
No ratings yet
Python Introduction
15 pages
ReleaseNote FileList of X64W11 22H2 SWP X1502ZA 06.00
No ratings yet
ReleaseNote FileList of X64W11 22H2 SWP X1502ZA 06.00
7 pages
3RI Technologies Company Profile
No ratings yet
3RI Technologies Company Profile
12 pages
Review Markscheme (5.1 - 5.3 and Past Material)
No ratings yet
Review Markscheme (5.1 - 5.3 and Past Material)
12 pages
Project Report
No ratings yet
Project Report
13 pages
Broken Link Hijacking
No ratings yet
Broken Link Hijacking
7 pages
Testing
No ratings yet
Testing
8 pages
Lab 3 Exercises
No ratings yet
Lab 3 Exercises
4 pages
Facebook's "Social Dilemma": A Company's Strategy For Issues, Brand, Reputation, and Crisis Management
No ratings yet
Facebook's "Social Dilemma": A Company's Strategy For Issues, Brand, Reputation, and Crisis Management
18 pages
5th PSKA - Outstanding SK Council (Province) Awards Matrix
No ratings yet
5th PSKA - Outstanding SK Council (Province) Awards Matrix
3 pages
MNIST
No ratings yet
MNIST
3 pages
Types of Network: Dr. Neha Gulati Assistant Professor University Business School Panjab University, Chandigarh
No ratings yet
Types of Network: Dr. Neha Gulati Assistant Professor University Business School Panjab University, Chandigarh
15 pages
Offbeat Careers
No ratings yet
Offbeat Careers
6 pages
CSCS System-Config EDC HQ Tony 20190613
No ratings yet
CSCS System-Config EDC HQ Tony 20190613
6 pages
White Professional Web Designer Resume
No ratings yet
White Professional Web Designer Resume
3 pages
Simulasi - Rakit PC TB - Enterkomputer Jual Beli Online Komputer, Rakit PC, Termurah & Terlengkap
No ratings yet
Simulasi - Rakit PC TB - Enterkomputer Jual Beli Online Komputer, Rakit PC, Termurah & Terlengkap
3 pages
26.1.7 Lab - Snort and Firewall Rules
No ratings yet
26.1.7 Lab - Snort and Firewall Rules
8 pages
The Contemporary Digital Revolution
No ratings yet
The Contemporary Digital Revolution
4 pages
Pre Test 2021
No ratings yet
Pre Test 2021
4 pages
KXStudio - Applications - Carla
No ratings yet
KXStudio - Applications - Carla
5 pages
Deep Learning with Keras: Beginner’s Guide to Deep Learning with Keras
From Everand
Deep Learning with Keras: Beginner’s Guide to Deep Learning with Keras
Frank Millstein
3/5 (1)