0% found this document useful (0 votes)

36 views

AI-Lecture 8 (Machine Learning Overview)

This document provides an overview of machine learning. It discusses machine learning approaches, including supervised and unsupervised learning. It also covers machine learning components like representation, evaluation, and optimization. Example machine learning algorithms for regression, classification, and clustering are listed. Issues like project failure reasons and popular frameworks like scikit-learn and Keras are also summarized.

Uploaded by

Braga Gladys Mae

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views

AI-Lecture 8 (Machine Learning Overview)

Uploaded by

Braga Gladys Mae

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

Artificial Intelligence

Lecture 8

Bicol University College of Science

1st Semester 2021-2022
Machine Learning Overview
Paradigm
Traditional approach

5
Machine learning approach

6
Traditional Programming

Data
Computer Output
Program

Machine Learning

Data
Computer Program
Output
Machine Learning (ML)
• ML is a branch of artificial intelligence:
• Uses computing based systems to make sense
out of data
• Extracting patterns, fitting data to functions,
classifying data, etc
• ML systems can learn and improve
• With historical data, time and experience
• Bridges theoretical computer science and real
noise data.

8
ML in real-life

9
ML in a Nutshell
• Tens of thousands of machine learning algorithms

• Hundreds new every year

• Every machine learning algorithm has three

components:
– Representation

– Evaluation
– Optimization
ML Components
• Representation
– Numerical functions
●
Linear regression
●
Neural networks
●
Support vector machines
– Symbolic functions
●
Decision trees
●
Sets of rules / Logic programs
– Instance-based functions
●
Nearest-neighbor
●
Case-based
– Probabilistic Graphical Models
●
Naïve Bayes
●
Bayesian networks
●
Hidden-Markov Models (HMMs)
●
Probabilistic Context Free Grammars (PCFGs)
●
Markov networks
ML Components
• Various Search/Optimization Algorithms
– Gradient descent
●
Perceptron
●
Backpropagation
– Dynamic Programming
●
HMM Learning
●
PCFG Learning
– Divide and Conquer
●
Decision tree induction
●
Rule learning
– Evolutionary Computation
●
Genetic Algorithms (GAs)
●
Genetic Programming (GP)
●
Neuro-evolution
ML Components
• Evaluation
– Accuracy
– Precision and recall
– Squared error
– Likelihood
– Posterior probability
– Cost / Utility
– Margin
– Entropy
– K-L divergence
– Etc.
Types of Learning
• Supervised (inductive) learning
– Training data includes desired outputs

– regression: predict numerical values

– classification: predict categorical values, i.e., labels
• Unsupervised learning
– Training data does not include desired outputs

– clustering: group data according to "distance"

– association: find frequent co-occurrences
– link prediction: discover relationships in data
– data reduction: project features to fewer features
• Reinforcement learning
– Rewards from sequence of actions
Classification
Object recognition
https://ai.googleblog.com/
2014/09/building-deeper-u
nderstanding-of-images.ht
ml

15
Reinforcement
learning
Learning to play Break Out
https://www.youtube.com/
watch?v=V1eYniJ0Rnk

16
Clustering
Crime prediction using k-
means clustering
http://www.grdjournals.co
m/uploads/article/GRDJE/V
02/I05/0176/GRDJEV02I05
0176.pdf

17
Machine learning algorithms
• Regression:
Ridge regression, Support Vector Machines, Random Forest,
Multilayer Neural Networks, Deep Neural Networks, ...

• Classification:
Naive Base, , Support Vector Machines,
Random Forest, Multilayer Neural Networks,
Deep Neural Networks, ...

• Clustering:
k-Means, Hierarchical Clustering, ...

18
Issues
• Many machine learning/AI projects fail
(Gartner claims 85 %)

• Ethics, e.g., Amazon has/had

sub-par employees fired by an AI
automatically

19
Reasons for failure
• Asking the wrong question
• Trying to solve the wrong problem
• Not having enough data
• Not having the right data
• Having too much data
• Hiring the wrong people
• Using the wrong tools
• Not having the right model
• Not having the right yardstick

20
Frameworks
• Programming languages
– Python
– R Fast-evolving ecosystem!
– C++
– ...
• Many libraries classic machine
– scikit-learn learning
– PyTorch
deep learning
– TensorFlow
frameworks
– Keras
– …

21
scikit-learn
• Nice end-to-end framework
– data exploration (+ pandas + holoviews)
– data preprocessing (+ pandas)
●
cleaning/missing values
●
normalization
– training
– testing
– application
• "Classic" machine learning only
• https://scikit-learn.org/stable/
22
Keras
• High-level framework for deep learning
• TensorFlow backend
• Layer types
– dense
– convolutional
– pooling
– embedding
– recurrent
– activation
– …
• https://keras.io/

23
Supervised and Unsupervised Learning
• Unsupervised Learning
• There are not predefined and known set of outcomes
• Look for hidden patterns and relations in the data
• A typical example: Clustering
2.5

2.0

1.5
irisCluster$cluster

Petal.Width
1

1.0

0.5

0.0
2 4 6
Petal.Length

24
Supervised and Unsupervised Learning
• Supervised Learning
• For every example in the data there is always a predefined
outcome
• Models the relations between a set of descriptive features and
a target (Fits data to a function)
• 2 groups of problems:
• Classification
• Regression

25
Supervised Learning
• Classification
• Predicts which class a given sample of data (sample of
descriptive features) is part of (discrete value).

virginica
0.0 4.0 96.0

Percent
100

Predicted
versicolor
0.0 96.0 4.0 50

• Regression 25

• Predicts continuous values.

setosa
100.0 0.0 0.0

setosa versicolor virginica

Actual

26
Machine Learning as a Process
- Define measurable and quantifiable goals
Define
- Use this stage to learn about the problem
Objectives

- Normalization
- Transformation
Model - Missing Values
Deployment Data - Outliers
Preparation

- Study models accuracy

- Work better than the naïve - Data Splitting
approach or previous system - Features Engineering
- Do the results make sense in - Estimating Performance
the context of the problem - Evaluation and Model
Model Model Selection
Evaluation Building

27
ML as a Process: Data Preparation
• Needed for several reasons
• Some Models have strict data requirements
• Scale of the data, data point intervals, etc
• Some characteristics of the data may impact dramatically on the
model performance
• Time on data preparation should not be underestimated

• Missing Values • Scaling

• Error Values • Centering
Raw • Different Scales Data
Transform
• Skewness Data Modeling
Data • Dimensionality
• Types Problems ation
• Outliers
• Missing Values
Ready phase
• Many others • Errors

28
ML as a Process: Feature engineering
• Determine the predictors (features) to be used is one of the most critical
questions
• Some times we need to add predictors
• Reduce Number:
• Fewer predictors more interpretable model and less costly
• Most of the models are affected by high dimensionality, specially for non-informative
predictors
Algorithms
Multiple
that use
models
Wrappers adding and
removing
models as
input and
Genetics
Algorithms
performance
parameter
as output

• Binning predictors
Evaluate the Based
Filters relevance of
the predictor
normally on
correlations

29
View of Std ML Datasets
- a Single Table (2D array)

Output
Feature 1 Feature 2 Feature N
... Category

Example 1 0.0 small red true

Example 2 9.3 medium red false

Example 3 8.2 small blue false

...

Example M 5.7 medium green true

ML as a Process: Model Building

• Data Splitting
• Allocate data to different tasks
• model training
• performance evaluation
• Define Training, Validation and Test sets
• Feature Selection (Review the decision made previously)
• Estimating Performance
• Visualization of results – discovery interesting areas of the problem
space
• Statistics and performance measures
• Evaluation and Model selection
• The ‘no free lunch’ theorem no a priory assumptions can be made
• Avoid use of favorite models if NEEDED

31
Nearest Neighbors: Basic Algorithm
for Classification
• Find the K nearest neighbors to
test-set example
• Or find all ex’s within radius R
• Combine their ‘votes’
– Most common category
– Average value (real-valued prediction) +
- -
-
– Can also weight votes by distance ?
+ -
– Lots of variations on basic theme
Simple Example: 1-NN
(1-NN ≡ one nearest neighbor)

Training Set
1. a=0, b=0, c=1 +
2. a=0, b=0, c=0 -
3. a=1, b=1, c=1 -
Test Example
a=0, b=1, c=0 ?
“Hamming Distance” (# of different bits)
Ex 1 = 2
Ex 2 = 1 So output -
Ex 3 = 2
From neurons to ANNs

𝑥1 inspiration
𝑤1
𝑥2 𝑤2
𝑦 𝑁 𝜎 (𝑥 )

𝑥3
𝑤3 𝑦 =𝜎 ( ∑ 𝑤 𝑖 𝑥𝑖 + 𝑏
𝑖=1
) activation function
𝑏
+1
𝑥
...

𝑤𝑁
𝑥𝑁

34
Multilayer network

How to determine
weights?

35
Training: backpropagation
• Initialize weights "randomly"
• For all training epochs
• for all input-output in training set
• using input, compute output
(forward)
• compare computed output with
training output
• adapt weights (backward) to
improve output
• if accuracy is good enough, stop
36
Task: handwritten digit
recognition
• Input data
• grayscale image
• Output data
• digit 0, 1, ..., 9
• Training examples
• Test examples

37
Deep neural networks
• Many layers
• Features are learned, not given
• Low-level features combined into
high-level features

• Special types of layers

• convolutional
• drop-out
• recurrent
• ...
39
Convolutional neural
networks

1 ⋯ 0

[ ]
⋮ ⋱ ⋮
0 ⋯ 1


40
Convolution examples
1 ⋯ 0 1 ⋯ 0

[ ]
⋮ ⋱ ⋮
0 ⋯ 1 [ ]
⋮ ⋱ ⋮
0 ⋯ 1

0 ⋯ 1 0 ⋯ 1

[ ]
⋮ ⋱ ⋮
1 ⋯ 0 [ ]
⋮ ⋱ ⋮
1 ⋯ 0

41
Task: sentiment <start> this film was just
brilliant casting location

classification scenery story direction

everyone's really suited the
part they played and you
could just imagine being
• Input data there Robert redford's is an
amazing actor and now the
• movie review (English) same being director
norman's father came from
• Output data the same scottish island as

/
myself so i loved the fact
there was a real connection
with this
• Training examples film the witty remarks
throughout the film were
• Test examples great it was
just brilliant so much that
i bought the film as soon as
it


43
Word
embedding
• Represent words as
one-hot vectors
length = vocabulary
size
Issues:
• unwieldy
• no semantics

• Word embeddings
• dense vector
• vector distance 
semantic distance

• Training
• use context
• discover relations with
surrounding words
44
End

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
87% (46)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
58% (81)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (79)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
The Secret Language of Attraction
86% (108)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Penis Enlargement Secret
60% (124)
Penis Enlargement Secret
12 pages
Workbook For The Body Keeps The Score
89% (53)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
Phone Codes
79% (28)
Phone Codes
5 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
How 2 Setup Trust
97% (307)
How 2 Setup Trust
3 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
78% (36)
100 Questions To Ask Your Partner
2 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (8)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
1001 Songs
69% (72)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
Module 2 - Assignment
No ratings yet
Module 2 - Assignment
8 pages
Machine Learning?
100% (2)
Machine Learning?
114 pages
Email The Output of A Concurrent Program As Attachment
100% (1)
Email The Output of A Concurrent Program As Attachment
16 pages
Azure Machine Learning Studio - Automobile Price Prediction
No ratings yet
Azure Machine Learning Studio - Automobile Price Prediction
11 pages
Lesson 4 -Introduction Machine Learning
No ratings yet
Lesson 4 -Introduction Machine Learning
44 pages
2024 Machine Learning Intro
No ratings yet
2024 Machine Learning Intro
50 pages
UNIT 1
No ratings yet
UNIT 1
38 pages
Course Overview
No ratings yet
Course Overview
33 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
45 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
49 pages
ML Lectures 2022 Part 1
No ratings yet
ML Lectures 2022 Part 1
231 pages
UNIT-I
No ratings yet
UNIT-I
132 pages
Mlfa Autumn 22 Lec 01
No ratings yet
Mlfa Autumn 22 Lec 01
43 pages
1 Sup
No ratings yet
1 Sup
80 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
58 pages
Core Concepts of AI
No ratings yet
Core Concepts of AI
46 pages
Machine Learning: What Is Data and Model? Machine Learning Workflow Distance Based Classifiers Bayes Decision Theory
No ratings yet
Machine Learning: What Is Data and Model? Machine Learning Workflow Distance Based Classifiers Bayes Decision Theory
81 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
Week01 Intro AI
No ratings yet
Week01 Intro AI
53 pages
Air quality prediction using machine learning
No ratings yet
Air quality prediction using machine learning
29 pages
Lecture 1
No ratings yet
Lecture 1
43 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
No ratings yet
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
102 pages
Machine: Learning ATO Z - I
No ratings yet
Machine: Learning ATO Z - I
131 pages
ML Short U1-4
No ratings yet
ML Short U1-4
60 pages
Machine Learning
No ratings yet
Machine Learning
14 pages
Karthik
No ratings yet
Karthik
10 pages
Machine Learning Batch 8 2021
100% (1)
Machine Learning Batch 8 2021
73 pages
ML-cahp-1
No ratings yet
ML-cahp-1
35 pages
1 AI_Introduction and ML
No ratings yet
1 AI_Introduction and ML
32 pages
ML-chap-2
No ratings yet
ML-chap-2
60 pages
Ai - Foundations of Machine Learning I
No ratings yet
Ai - Foundations of Machine Learning I
39 pages
Machine Learning Spark ML
No ratings yet
Machine Learning Spark ML
11 pages
Unit-I
No ratings yet
Unit-I
23 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
56 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
ML Merged
No ratings yet
ML Merged
433 pages
CS480 Lecture November 14th
No ratings yet
CS480 Lecture November 14th
72 pages
Chapter 1
No ratings yet
Chapter 1
62 pages
Subjects You Need To Know:: Programming Languages of AI
0% (1)
Subjects You Need To Know:: Programming Languages of AI
7 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
ETI microproject
No ratings yet
ETI microproject
11 pages
Introduction To Machine Learning
100% (1)
Introduction To Machine Learning
119 pages
Machine Learning Updated
No ratings yet
Machine Learning Updated
14 pages
Machine Learning for Data Science Unit-4
No ratings yet
Machine Learning for Data Science Unit-4
16 pages
ML Midterm Cheatsheet
No ratings yet
ML Midterm Cheatsheet
2 pages
1. U1 ML Intro and Applications
No ratings yet
1. U1 ML Intro and Applications
123 pages
mlintro-2
No ratings yet
mlintro-2
28 pages
LM #01-Introduction To ML
No ratings yet
LM #01-Introduction To ML
33 pages
Elements of Machine Learning
No ratings yet
Elements of Machine Learning
116 pages
An Introduction To Machine Learning and Its Applications
No ratings yet
An Introduction To Machine Learning and Its Applications
8 pages
July4 SaketAnand FriendlyIntroToML
No ratings yet
July4 SaketAnand FriendlyIntroToML
84 pages
ML Intro Theory
No ratings yet
ML Intro Theory
10 pages
Introduction To ML
No ratings yet
Introduction To ML
48 pages
Machine - Learning - Unit - 1
No ratings yet
Machine - Learning - Unit - 1
70 pages
1 DEEP LEARNING 2324 (1)
No ratings yet
1 DEEP LEARNING 2324 (1)
55 pages
Unit 1
No ratings yet
Unit 1
43 pages
ML1_Introduction
No ratings yet
ML1_Introduction
109 pages
Aws ML PDF
No ratings yet
Aws ML PDF
74 pages
Mastering Data Structures and Algorithms with Python: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Data Structures and Algorithms with Python: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
ERDAS IMAGINE 2020 Viewer
No ratings yet
ERDAS IMAGINE 2020 Viewer
17 pages
Routine PDF
No ratings yet
Routine PDF
1 page
Representation of Vending Device Using Finite State Automata
No ratings yet
Representation of Vending Device Using Finite State Automata
7 pages
Project #1 Dsa
No ratings yet
Project #1 Dsa
8 pages
ABB Robot Training Basic 2 - Day 2 - ABB Standard Robot Program - V1.1
No ratings yet
ABB Robot Training Basic 2 - Day 2 - ABB Standard Robot Program - V1.1
17 pages
Computer System and Performance New
No ratings yet
Computer System and Performance New
13 pages
The Parametric Design of A Steel Pipe Bridge
No ratings yet
The Parametric Design of A Steel Pipe Bridge
42 pages
05 Laboratory Exercise 1
No ratings yet
05 Laboratory Exercise 1
6 pages
Series BVM: Web Applications
No ratings yet
Series BVM: Web Applications
7 pages
NSI-AD
No ratings yet
NSI-AD
1 page
Project Report-Allen
No ratings yet
Project Report-Allen
39 pages
2-Motorola ICC Pro General Technical Submittal GPRS
No ratings yet
2-Motorola ICC Pro General Technical Submittal GPRS
31 pages
vGPU Wiki
No ratings yet
vGPU Wiki
24 pages
FR App Synopsis
No ratings yet
FR App Synopsis
9 pages
BRKDCN 2498
No ratings yet
BRKDCN 2498
103 pages
Project+Management+With+ChatGPT Prompts
No ratings yet
Project+Management+With+ChatGPT Prompts
4 pages
L 0018678460 PDF
No ratings yet
L 0018678460 PDF
22 pages
HTML Interview Questions
No ratings yet
HTML Interview Questions
7 pages
HPE Support Center: Hpacucli Utility For Linux - All Commands Guide
No ratings yet
HPE Support Center: Hpacucli Utility For Linux - All Commands Guide
2 pages
SAP Script Recording & Playback With MS Excel Integration
No ratings yet
SAP Script Recording & Playback With MS Excel Integration
5 pages
U-Wave Guide - MITUTOYO
No ratings yet
U-Wave Guide - MITUTOYO
31 pages
Tutorial Week 5 Answers
No ratings yet
Tutorial Week 5 Answers
8 pages
Module 6
No ratings yet
Module 6
116 pages
Identify Microsoft Power Platform Components: 5. Determine The Required Power Apps App Type
No ratings yet
Identify Microsoft Power Platform Components: 5. Determine The Required Power Apps App Type
61 pages
Analytic Applications - Part 2
No ratings yet
Analytic Applications - Part 2
69 pages
28-Computer Basic Questions With Answers PDF Notes for All Exams
No ratings yet
28-Computer Basic Questions With Answers PDF Notes for All Exams
43 pages
Bryce 3D Tutorial 2 Animation
No ratings yet
Bryce 3D Tutorial 2 Animation
3 pages

AI-Lecture 8 (Machine Learning Overview)

Uploaded by

AI-Lecture 8 (Machine Learning Overview)

Uploaded by

Artificial Intelligence

Bicol University College of Science

• Hundreds new every year

• Every machine learning algorithm has three

– regression: predict numerical values

– clustering: group data according to "distance"

• Ethics, e.g., Amazon has/had

• Predicts continuous values.

setosa versicolor virginica

- Study models accuracy

• Missing Values • Scaling

Example 1 0.0 small red true

Example 2 9.3 medium red false

Example 3 8.2 small blue false

Example M 5.7 medium green true

• Special types of layers

classification scenery story direction

You might also like