Machine Learning With Matlab PDF
Machine Learning With Matlab PDF
Abhishek Gupta
Sr. Application Engineer
2
Machine Learning
Characteristics and Examples
Characteristics
– Lots of data (many variables)
– System too complex to know
the governing equation
(e.g., black-box modeling)
Examples
– Pattern recognition (speech, images)
AAA 93.68% 5.55% 0.59% 0.18% 0.00% 0.00% 0.00% 0.00%
A 0.14%
92.60%
4.18%
4.03%
91.02%
0.73%
3.90%
0.15%
0.60%
0.00%
0.08%
0.00%
0.00%
0.06%
0.08%
BB 0.03%
0.23%
0.12%
7.49%
0.73%
87.86%
8.27%
3.78%
86.74%
0.39%
3.28%
0.06%
0.18%
0.16%
0.64%
D 0.00%
0.00%
0.00%
0.00%
0.00%
0.37%
0.00%
1.84%
0.00%
6.24%
0.00%
81.88%
0.00%
9.67%
100.00%
AAA AA A BBB BB B CCC D
4
Challenges – Machine Learning
5
Overview – Machine Learning
Unsupervised
Clustering
Learning
Classification
Supervised
Learning
Regression
Develop predictive
model based on both
input and output data
6
Unsupervised Learning
k-Means,
Fuzzy C-Means
Hierarchical
Clustering Neural
Networks
Gaussian
Mixture
Hidden Markov
Model
7
Supervised Learning
Regression
Classification
8
Supervised Learning - Workflow
Speed up Computations
Select Model
Measure Accuracy
9
Example – Bank Marketing Campaign
Goal:
– Predict if customer would subscribe to
bank term deposit based on different 100
Bank Marketing Campaign
Misclassification Rate
attributes
90
80
70
60
Percentage
No
Misclassified
50
Yes
Approach: 40
30
Misclassified
10
–
0
B
or s
s
et
s
VM
n
ge
is
r ee
aye
dT
s sio
lN
ly s
ag
hb
nT
o rt
ce
ura
B
na
eB
g re
ig
ive
pp
du
Ne
A
io
Ne
Tre
Re
cis
Su
Re
nt
Na
t
res
ina
De
tic
gis
cr im
ea
k -n
–
Lo
Dis
Reduce model complexity
– Use classifier for prediction
90
80
60
Percentage
No
aid discovery 50
40
Misclassified
Yes
Misclassified
30
20
10
B
or s
s
et
s
VM
n
ge
is
r ee
aye
dT
s sio
lN
ly s
ag
hb
nT
o rt
ce
ura
B
na
eB
g re
ig
ive
pp
du
Ne
A
io
Ne
Tre
Re
cis
Su
Re
nt
Na
t
res
ina
De
tic
gis
cr im
ea
k -n
Lo
Dis
Quick prototyping; Focus on
modeling not programming
12
Clustering
Overview
1
0.6
0.5
Why use clustering?
0.4
– Identify outliers
0.3
– Resulting groups may be 0.2
the matter of interest 0.1
0
-0.1 0 0.1 0.2 0.3 0.4 0.5 0.6
13
Example – Clustering Corporate Bonds
Hierarchical Clustering
500 1.6
Goal: 1.4
Dist Metric:spearman
1000
Data Point #
1500
1
2000
together 2500
0.8
0.6
3000
0.4
3500
0.2
Approach: 4000
1000 2000 3000 4000
techniques 500
1000
Dist Metric:cosine
0.6
– Evaluate clusters for validity
Data Point #
1500
2000 0.4
2500
3000 0.2
3500
4000 0
1000 2000 3000 4000
Data Point #
14
Example – Clustering Corporate Bonds
Hierarchical Clustering
500 1.6
Numerous clustering functions with 1.4
Dist Metric:spearman
1000
Data Point #
1500
1
2000
0.8
2500
0.6
3000
0.4
Interactive visualizations to aid 3500
4000
0.2
k-Means Clustering
0.8
Dist Metric:cosine
0.6
Data Point #
1500
2000 0.4
15
Short-term Load Forecaster
Goal:
– Develop a tool for Excel users to generate next day electricity
demand predictions
Requirements:
– Easy to use interface
– Accurate predictive model
16
Deploying MATLAB Applications to Excel
3
Toolboxes
1 MATLAB End-User
Desktop Machine
MATLAB Compiler
2
MATLAB
Builder EX
.dll .bas
17
Deployment Highlights
Database Servers Desktop Applications
.exe
Excel
Spreadsheets
HADOOP
Royalty-free deployment
Point-and-click workflow
18
MATLAB for Machine Learning
Extract value from data Machine learning, Video, Image, and Financial
Depth and breadth of algorithms in classification, clustering,
and regression
Computation speed Fast training and computation
Parallel computation, Optimized libraries
19
Learn More: Machine Learning with
MATLAB
mathworks.com/machine-learning
20
Training Services
Exploit the full potential of MathWorks products
23
Consulting Services
Accelerating return on investment
A global team of experts supporting every stage of tool and process integration
Continuous Improvement
Automation
Process and Technology
Standardization
Full Application
Deployment
Process Assessment
Component
Deployment
Advisory Services
Jumpstart
Migration Planning
24
Technical Support
Resources
Over 100 support engineers
– All with MS degrees (EE, ME, CS)
– Local support in North America,
Europe, and Asia
Comprehensive, product-specific Web
support resources
25
MATLAB Central
Community for MATLAB and Simulink
users
Over 1 million visits per month
File Exchange
– Upload/download access to free files
including MATLAB code, Simulink models,
and documents
– Ability to rate files, comment, and ask questions
– More than 12,500 contributed files, 300
submissions per month, 50,000 downloads
per month
Newsgroup
– Web forum for technical discussions about
MathWorks products
– More than 300 posts per day
Blogs
– Commentary from engineers who design, build,
and support MathWorks products
– Open conversation at blogs.mathworks.com
27