Module 01 - Introduction (1)

The document outlines the course CYBR 7240, which provides an introduction to machine learning (ML) concepts and algorithms for upper-level undergraduates. It covers various topics including supervised, unsupervised, and reinforcement learning, along with practical applications and evaluation methods. Prerequisites include basic knowledge of probability, linear algebra, and calculus.

Uploaded by

KarimBerra

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Module 01 - Introduction (1)

Uploaded by

KarimBerra

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

CYBR 7240

Cyber Analytics and Intelligence

Introduction
(Adapted from various sources)
Course Description
• This introductory course is designed:
to give upper level undergraduate a broad overview of many concepts and
algorithms in ML.
covers the theory and practical algorithms for machine learning from a variety of
perspectives.
• Topics:
Statistical and probabilistic methods, generative and discriminative models, linear
and logistic regression, decision tree learning, unsupervised learning and clustering
and dimensionality reduction.

• In addition, the course covers fundamental concepts such as training,

validation, overfitting, and error rates
Prerequisites
• Basics knowledge of probability, linear algebra and calculus.
• For example, standard probability distributions and also how to calculate
derivatives.
A Few Quotes
• “A breakthrough in machine learning would be worth ten Microsofts” (Bill
Gates, Founder, Microsoft)
• “Machine learning is the next Internet”
• (Tony Tether, Director, DARPA)
• Machine learning is the hot new thing”
• (John Hennessy, President, Stanford)
• “Machine learning is Google’s top priority” (Eric Schmidt, Chairman,
Alphabet)
• “Machine learning is Microsoft Research’s largest investment area” (Peter
Lee, Head, Microsoft Research)
• “‘Data scientist’ is the hottest job title in Silicon Valley” (Tim O’Reilly,
Founder, O’Reilly Media)
What is ML?
• Algorithm and processes that learn from past data in order to predict
future outcomes.
• Set of mathematical techniques enables a process of info mining,
pattern discovery, and drawing inference from data.
What is ML?
Example:
• If shape of object is rounded and having color Red then
it will be labelled as –Apple.
• If shape of object is long curving cylinder having color Green-Yellow then it
will be labelled as –Banana.
What is Machine Learning?
Machine Learning Based Data Analytics
• Machine learning eliminates a lot of needs for
human monitoring in analytics
• That is not to say it can do everything by itself
• The models can learn from data, or be trained
from data to determine
• Which features are important
• New features that are not known to the users
• Which set of rules will best map features to the
desired output
• Machine learning models can continue learning to
adapt to new data

Key difference: Machine learning models learn from

data (or are trained from data) instead of being built
step-by-step by analysts
Sample, Feature, and Label
Types of Data
• Numerical vs.
Categorical (Nominal)
ML in practice
• Understanding domain, prior knowledge, and goals
• Data integration, selection, cleaning
• pre-processing, etc.
• Learning models
• Interpreting results
• Consolidating and deploying discovered knowledge
• Loop
Types of learning
• Supervised (inductive) learning
• Training data includes desired outputs
• Unsupervised learning
• Training data does not include desired outputs
• Semi-supervised learning
• Training data includes a few desired outputs
• Reinforcement learning
• Rewards from sequence of actions
What is ML (Supervised)?
• Supervised:
• we teach or train the machine using data which is well labeled.
• Data is already tagged with the correct answer.
• Given new set of examples, the ML supervised learning algorithm analyses
the training data and produces a correct outcome from labeled data.

• Two categories:
• Classification: A classification problem is when the output variable is a
category, such as “Red” or “blue” or “disease” and “no disease”.
• Regression: A regression problem is when the output variable is a real value,
such as “dollars” or “weight”.
Classification
Example:
Ref: Western Digital
Typical Supervised Learning Techniques
• K Nearest Neighbors
• Linear Regression
• Logistic Regression
• Support Vector Machines (SVMs)
• Decision Trees
• Random Forests
• Certain Types of Neural Networks
Supervised Learning Techniques
Unsupervised learning
• Create an internal representation of the input, capturing
regularities/structure in data
• Example:
Clustering: Discover groups of similar inputs (documents, images, etc)
What is ML (Unsupervised)?
Unsupervised
• Training of ML using information that is neither classified nor labeled.
• ML algorithms act on that information without guidance (group
unsorted information according to similarities, patterns and
differences without any prior training of data).
• No training will be given to the machine.
• ML is restricted to find the hidden structure in unlabeled data.
What is ML?
Two categories of algorithms:
• Clustering: discovering the inherent groupings in the data, such as
grouping customers by purchasing behavior.
• Association: discovering rules that describe large portions of data,
such as people that buy X also tend to buy Y.
Unsupervised Learning Techniques
What is ML?
Example:
• Document Clustering
• Finding fraudulent transactions
Clustring
Example:
Clustring
Example:
Ref: Western Digital
Example: Association Rules
• Stores can base on customers’
purchase history to determine their
shopping patterns
• If someone buys certain combinations of
products, it’s more likely they will also
buy some other products
• Useful for placing items in stores and
targeting ads
Typical Unsupervised Learning Techniques
• Clustering
• k-Means
• Hierarchical Cluster Analysis (HCA)
• Expectation Maximization
• Visualization and dimensionality reduction
• Principal Component Analysis (PCA)
• Kernel PCA Locally-Linear Embedding (LLE)
• t-distributed Stochastic Neighbor Embedding (t-SNE)
• Association rule learning
• Apriori
• Eclat
Supervised vs. Unsupervised Learning
#Account Balance 3-month 6-month Outcome
past due past due History Literature Math Chemistry Physics Group
3 120 0 0 Good 70 75 90 95 93 Good at Science
1 100 120 0 Good 77 79 85 83 81 Average at both
5 1000 600 300 Bad 90 95 75 80 73 Good at Social
3 300 100 0 Bad 90 90 95 90 95 Good at both

Supervised models Unsupervised models

 Need to know both the features and  Do not need to know both the features
the labels during training and the labels during training
 Which means we feed both the  Which means we feed only the
features and the labels in data to a features of to an unsupervised model
supervised model  Outcome is solely speculated by the
 Try to recreate labels from features that model
match the true labels as accurate as  May or may not be true!
possible
What is ML?
• Reinforcement Learning
• The agent acts in an environment in order to maximize the rewards and
minimize the penalty.
• Unlike supervised learning, no data is provided to the agent.
• The agent itself takes action or sequence of actions whether right or wrong
to perform a task and learn from the experience.
• Example:
• Game Playing
• Robot Navigation
Reinforcement Learning
• Trains agents to take actions in an
environment that results in the
most reward
• Is more on the Artificial
Intelligence side of machine
learning
Semi-supervised Learning

• A hybrid between supervised and unsupervised learning

• Predefined labels are available for a small portions of data
• The rest are unlabeled
The Overall Picture Machine Learning

Supervised Unsupervised Semi-Supervised Reinforcement

Learning Learning Learning Learning

 Classification  Clustering
 Regression  Dimension
Reduction
 Association Rules
The machine learning framework
y = f(x)
output prediction Image
function feature

• Training: given a training set of labeled examples {(x1,y1), …, (xN,yN)}, estimate the
prediction function f by minimizing the prediction error on the training set
• Testing: apply f to a never before seen test example x and output the predicted
value y = f(x)

Slide credit: L. Lazebnik

ML Steps
Training Training
Labels
Training
Images
Image Learned
Training
Features model

Testing
Image Learned
Prediction
Features model
Test Image
Slide credit: D. Hoiem and L.
Lazebnik
Types of testing
• Evaluate performance by testing on data NOT used for testing
(both should be randomly sampled)
• Cross validation methods for small data sets
• The more (relevant) data the better.
Testing
• How well the learned system work?
• Generalization
• Performance on unseen or unknown scenarios or data
• Brittle vs robust performance
Evaluation
• Given some data, how can we tell if a function is “good”?
• Accuracy
• Precision and recall
• Squared error
• Likelihood
• Posterior probability
• Cost / Utility
• Margin
• Entropy
• K-L divergence
• Etc.

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Unit 1
No ratings yet
Unit 1
19 pages
Fundamentals of ML 1
No ratings yet
Fundamentals of ML 1
38 pages
1. Machine Learning - Introduction
No ratings yet
1. Machine Learning - Introduction
73 pages
CHP 1
No ratings yet
CHP 1
47 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Unit 1 PDF
No ratings yet
Unit 1 PDF
135 pages
Classification of Machine Learning
No ratings yet
Classification of Machine Learning
73 pages
Introduction To Machine Learning-1
No ratings yet
Introduction To Machine Learning-1
28 pages
02 ML Supervised Learning
No ratings yet
02 ML Supervised Learning
32 pages
1 Overview
No ratings yet
1 Overview
22 pages
Week 6 - Lecture 11-1
No ratings yet
Week 6 - Lecture 11-1
28 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
5 pages
ML (Theorey)
No ratings yet
ML (Theorey)
18 pages
L3 - Supervised and Unsupervised Learning
100% (3)
L3 - Supervised and Unsupervised Learning
24 pages
MLP Unit-I
No ratings yet
MLP Unit-I
62 pages
Brief of Machine Learning
No ratings yet
Brief of Machine Learning
13 pages
UNIT-1 DLL
No ratings yet
UNIT-1 DLL
73 pages
Chapter 5 Introduction To ML-1
100% (1)
Chapter 5 Introduction To ML-1
32 pages
Unit-4object Segmentation Regression Vs Segmentation Supervised and Unsupervised Learning Tree Building Regression Classification Overfitting Pruning and Complexity Multiple Decision Trees
No ratings yet
Unit-4object Segmentation Regression Vs Segmentation Supervised and Unsupervised Learning Tree Building Regression Classification Overfitting Pruning and Complexity Multiple Decision Trees
25 pages
Machine Learning: Louis Fippo Fitime
No ratings yet
Machine Learning: Louis Fippo Fitime
37 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
Unit5_ML_introduction
No ratings yet
Unit5_ML_introduction
32 pages
Intorduction of ML
No ratings yet
Intorduction of ML
14 pages
Machine Learning Types
No ratings yet
Machine Learning Types
30 pages
6CS4 AI Unit-4 @zammers
No ratings yet
6CS4 AI Unit-4 @zammers
129 pages
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
No ratings yet
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
35 pages
Data Science
No ratings yet
Data Science
4 pages
CPCS335 - Chapter 8-Final
No ratings yet
CPCS335 - Chapter 8-Final
23 pages
Chapter 01 Introduction to ML
No ratings yet
Chapter 01 Introduction to ML
178 pages
Machine Learning - its types
No ratings yet
Machine Learning - its types
8 pages
Python UNIT-5
100% (1)
Python UNIT-5
67 pages
Machine Learning (AI)
No ratings yet
Machine Learning (AI)
19 pages
machine learning notes
No ratings yet
machine learning notes
20 pages
ai faheem
No ratings yet
ai faheem
16 pages
DIR Notes 1
No ratings yet
DIR Notes 1
39 pages
What Is Machine Learning-UNIT III
No ratings yet
What Is Machine Learning-UNIT III
12 pages
Lec-1 Introduction
No ratings yet
Lec-1 Introduction
65 pages
Deep Learnng IA
No ratings yet
Deep Learnng IA
69 pages
Lesson 2 - Fundamentals of Machine Learning and Deep Learning
No ratings yet
Lesson 2 - Fundamentals of Machine Learning and Deep Learning
100 pages
Machine Learning L1
No ratings yet
Machine Learning L1
34 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
DM Chapter 0
No ratings yet
DM Chapter 0
4 pages
9e27d2e7-5dfa-4b8b-b760-d1fb4a21abd0
No ratings yet
9e27d2e7-5dfa-4b8b-b760-d1fb4a21abd0
24 pages
(AIML) : Pimpri Chinchwad College of Engineering & Research, Ravet
No ratings yet
(AIML) : Pimpri Chinchwad College of Engineering & Research, Ravet
9 pages
Module 1 PPT
No ratings yet
Module 1 PPT
122 pages
CE469 - Introduction To Machine Learning: Lecturer Contact
No ratings yet
CE469 - Introduction To Machine Learning: Lecturer Contact
33 pages
Supervised and Deep Learning
No ratings yet
Supervised and Deep Learning
83 pages
22wj8a6630ml ppt
No ratings yet
22wj8a6630ml ppt
12 pages
Machine Learning Notes
100% (1)
Machine Learning Notes
8 pages
2023-07-31T16-38-24.741Z-What Is Machine Learning
No ratings yet
2023-07-31T16-38-24.741Z-What Is Machine Learning
8 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
8 pages
ML Unit 1
No ratings yet
ML Unit 1
19 pages
Machine Learning: Presentation
100% (2)
Machine Learning: Presentation
23 pages
Unit-5 Machine Learning
No ratings yet
Unit-5 Machine Learning
25 pages
AI
No ratings yet
AI
52 pages
ML
No ratings yet
ML
17 pages
FY 2026 Governor's Report - FINAL
No ratings yet
FY 2026 Governor's Report - FINAL
400 pages
Atlanta Evergreen Marriott Floor Plan (1)
No ratings yet
Atlanta Evergreen Marriott Floor Plan (1)
1 page
Manual
No ratings yet
Manual
185 pages
Selenium_Supplementation_in_Obese_Patients_with_Su
No ratings yet
Selenium_Supplementation_in_Obese_Patients_with_Su
11 pages
Selenium_Supplementation_in_Obese_Patients_with_Su
No ratings yet
Selenium_Supplementation_in_Obese_Patients_with_Su
11 pages
Usman
No ratings yet
Usman
4 pages
Handout Interactive Teaching Large Lectures
No ratings yet
Handout Interactive Teaching Large Lectures
14 pages
Tle 6 - Industrial Arts Module 4
100% (2)
Tle 6 - Industrial Arts Module 4
19 pages
Activity 2
No ratings yet
Activity 2
4 pages
ADM Science G5.Q4.Module 2
No ratings yet
ADM Science G5.Q4.Module 2
20 pages
Byju's
No ratings yet
Byju's
7 pages
Daily Lesson Log of M11/12Sp-Iiih-1 (Week Eight-Day Two)
No ratings yet
Daily Lesson Log of M11/12Sp-Iiih-1 (Week Eight-Day Two)
2 pages
PES201 Soft Skill PDF
No ratings yet
PES201 Soft Skill PDF
12 pages
Anyone Can Learn to Be a Better Leader
No ratings yet
Anyone Can Learn to Be a Better Leader
8 pages
Assignment 3
No ratings yet
Assignment 3
8 pages
ESSAY-The Case Against Cellphones in Schools
No ratings yet
ESSAY-The Case Against Cellphones in Schools
1 page
Motivation: Letter
No ratings yet
Motivation: Letter
2 pages
Psych 430 Undergraduate Internship in Ps PDF
No ratings yet
Psych 430 Undergraduate Internship in Ps PDF
2 pages
Money Recognition Sped Lesson Plan 2022
100% (1)
Money Recognition Sped Lesson Plan 2022
2 pages
Unit of Work Stage 2
No ratings yet
Unit of Work Stage 2
11 pages
Goals of Learning Process
No ratings yet
Goals of Learning Process
9 pages
Accepted Manuscript: 10.1016/j.nedt.2018.09.027
No ratings yet
Accepted Manuscript: 10.1016/j.nedt.2018.09.027
21 pages
Buenlag National High School: Region I Schools Division Office I Pangasinan Calasiao District II
No ratings yet
Buenlag National High School: Region I Schools Division Office I Pangasinan Calasiao District II
4 pages
Social-Studies-Grade-9
No ratings yet
Social-Studies-Grade-9
124 pages
Unidad 6
No ratings yet
Unidad 6
13 pages
Lesson 4 Part 2 MEDIA LITERACY (Part 2)
No ratings yet
Lesson 4 Part 2 MEDIA LITERACY (Part 2)
37 pages
DSC Mat
No ratings yet
DSC Mat
41 pages
PLG518 Teacher Effectiveness
No ratings yet
PLG518 Teacher Effectiveness
20 pages
ILTAL2023 Rundown
No ratings yet
ILTAL2023 Rundown
3 pages
Unit Result Record Sheet: Amrinder Singh
100% (1)
Unit Result Record Sheet: Amrinder Singh
45 pages
Learning Skills Self-Assessment: Responsibility N S G E
No ratings yet
Learning Skills Self-Assessment: Responsibility N S G E
3 pages
Peer Eval Rubrics For LITERARY MAGZINE
No ratings yet
Peer Eval Rubrics For LITERARY MAGZINE
2 pages
Position District MVOC
No ratings yet
Position District MVOC
58 pages
BTM - Assignment Brief and Guide Module 2 - OSHE - Assessment - 3000 Word June 2023
No ratings yet
BTM - Assignment Brief and Guide Module 2 - OSHE - Assessment - 3000 Word June 2023
11 pages
University Guidance Center Organizational Chart: Mission
No ratings yet
University Guidance Center Organizational Chart: Mission
2 pages