PLSR Using MATLAB

The document provides a detailed guide on performing Partial Least-Squares Regression (PLSR) using MATLAB R2018a, including steps for importing data, normalizing it, and executing the regression. It outlines the necessary code snippets for data manipulation, regression analysis, and visualization of results, such as percent variance explained and scores plots. Additionally, it covers cross-validation techniques to assess the model's predictive capability.

Uploaded by

madadi morad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views6 pages

PLSR Using MATLAB

Uploaded by

madadi morad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Lazzara Lab

Samantha Clayton
April 14, 2019

Partial Least-Squares Regression (PLSR) in MATLAB R2018a

Importing Data into MATLAB

1. Click on the Home tab in Matlab. Press the “Import Data” button and select the dataset you would like to use.

2. The dataset will open onto a screen. Select the data you would like to use then press the “Import Selection” button.
The output type options are table, column vectors, numeric matrix, cell array, or string array depending on what you
are importing. I suggest using a string array for this process to retain the column labels (if your dataset has them) and
to more easily manipulate them. If you select something other than a table, make sure that all the rows and columns
you wish to import are selected, as MATLAB may not select them by default.

a. You can also click on the arrow on the Import Selection button to get a drop-down menu. You can click on
generate script or function to auto-import this if you need to run it again.
b. Or you could save the workspace after importing all of the files you need and load the workspace rather than
having to import the data again if you need to run this a separate time.

save(plsr_workspace) % saves workspace as a .mat file with the given filename

load(plsr_workspace.mat) % loads workspace into MATLAB
Lazzara Lab
Samantha Clayton
April 14, 2019

3. The dataset will be imported into MATLAB as the data output type you selected with the same name as the original
file. If the data is in two separate files, repeat the previous steps for the second file. The remaining steps and sample
code outlined below are written assuming that the data is imported using string arrays. Separate the data and labels:
X = str2double(Xmatrix(2:end, 2:end));
Y = str2double(Ymatrix(2:end, 2:end));
X_col_labels = cellstr(Xmatrix(1,2:end));
X_row_labels = cellstr(Xmatrix(2:end,1));
Y_col_labels = cellstr(Ymatrix(1,2:end));
Y_row_labels = cellstr(Ymatrix(2:end,1));

4. If x and y data are in the same table, allocate the x and y portions of the array into two separate arrays:
Data = str2double(data_matrix(2:end,2:end));
X = Data(:,1:last_x_col)
Y = Data(:,first_y_col:end) %last_x_col and first_y_col need to be defined based on your input
X_col_labels = str2cell(Xmatrix(1,2:end));
Y_col_labels = X_row_labels;
X_row_labels(data_matrix(2:last_x_col,1)
Y_row_labels(data_matrix(first_y_col:end,1)

Performing the Regression

5. The data needs to be normalized before using PLSR. You can use the zscore function to do this.
z_x = zscore(X);
z_y = zscore(Y);

6. Code for PSLR function:

ncomp = 5; %use only 5 components → change this to desired number of components
[x_loadings, y_loadings, x_scores, y_scores, beta, pctvar, mse, stats] = plsregress(z_x, z_y, ncomp);
Lazzara Lab
Samantha Clayton
April 14, 2019
- If there are any missing values in the matrix, they must be removed before proceeding to this step.
- If ncomp is omitted, its default value is min(size(X,1)-1,size(X,2)).
- You can also specify parameter name/value pairs for cross-validation, ‘cv’ or ‘mcreps’ (See step 10 for sample
code used to perform cross-validation).

From the MathWorks documentation for plsregress:

● Plsregress computes a partial least-squares (PLS) regression of Y on X, using ncomp PLS components, and
returns the predictor and response loadings in XL and YL, respectively.
● X is an n-by-p matrix of predictor variables, with rows corresponding to observations and columns to variables.
● Y is an n-by-m response matrix.
● XL is a p-by-ncomp matrix of predictor loadings, where each row contains coefficients that define a linear
combination of PLS components that approximate the original predictor variables.
● YL is an m-by-ncomp matrix of response loadings, where each row contains coefficients that define a linear
combination of PLS components that approximate the original response variables.
● XS represents the predictor scores, that is, the PLS components that are linear combinations of the variables in X.
XS is an n-by-ncomp orthonormal matrix with rows corresponding to observations and columns to components.
● YS represents the linear combinations of the responses with which the PLS components XS have maximum
covariance. YS is an n-by-ncomp matrix with rows corresponding to observations and columns to components.
YS is neither orthogonal nor normalized.
● PCTVAR is a 2-by-ncomp matrix containing the percentage of variance explained by the model. The first row of
PCTVAR contains the percentage of variance explained in X by each PLS component, and the second row
contains the percentage of variance explained in Y. (These values are R2Y values.)
● MSE is a 2-by-(ncomp+1) matrix containing estimated mean-squared errors for PLS models with 0:ncomp
components. The first row of MSE contains mean-squared errors for the predictor variables in X, and the second
row contains mean-squared errors for the response variables in Y.
● Stats returns a structure with fields: W (a p-by-ncomp matrix of PLS weights, XS = X0*W), T2 (the T2 statistic
for each point in XS), Xresiduals (the predictor residuals, that is, X0 – XS*XL), and Yresiduals (the response
residuals, that is, Y0 – XS*YL)

Displaying outputs:

7. Follow the example code provided below to complete the percent variance in y explained by each component (R2Y):

figure;
plot(1:ncomp,cumsum(100*pctvar(2,:)),'-bo');
xlabel('Components')
ylabel('Percent Variance Explained in Y')
Lazzara Lab
Samantha Clayton
April 14, 2019

8. Follow the example code below to create a scores plot (done below for first two components):

figure;
xlabel('Component 1')
ylabel('Component 2')
hold on
x = x_scores(:,1);
y = x_scores(:,2);
scatter(x,y);
xlim([-1 1]);
ylim([-1 1]);
box on
hax = gca;
line([0 0],get(hax,'YLim'),'Color','k','LineStyle','--')
hline = refline([0 0]); hline.Color = 'k'; hline.LineStyle = '--';
labels = X_row_labels;
yline = 0;
xline = 0;
dx = 0.02; dy = 0.02; % displacement so the text does not overlay the data points
text(x+dx, y+dy, labels, 'Fontsize', 10, 'Interpreter', 'none'); % labeling points
Lazzara Lab
Samantha Clayton
April 14, 2019

9. Follow the example code provided below to create a loadings plot:

This portion deals with duplicate values in the data before labeling. It is not necessary if there are none.

matches = zeros(max(size(x_loadings(:,2))),max(size(x_loadings(:,2)))); % allocate matrix

if size(unique(x_loadings(:,1:2))) ~= size(x_loadings(:,1:2)) % check if there all values are unique
[C,ia,ic] = unique(x_loadings(:,1:2),'rows'); % getting all unique values
end
duplicate_ind = setdiff(1:size(x_loadings, 1), ia); % get indices of values that were repeated
duplicate_value = x_loadings(duplicate_ind, 1:2); % storing duplicate values

%% Plotting loading values

figure;
x1 = C(:,1); %if portion above is not used, replace C with x_loadings
y1 = C(:,2);
x_dup = x_loadings(duplicate_ind,1);
y_dup = x_loadings(duplicate_ind,2);
scatter(x1,y1,'o')
xlabel('Loading 1')
ylabel('Loading 2')
xlim([-3 3]); ylim([-3 3]); %adjust these limits as necessary to include all data points on your plot
box on
hax = gca;
line([0 0],get(hax,'YLim'),'Color','k','LineStyle','--')
hline = refline([0 0]); hline.Color = 'k'; hline.LineStyle = '--';
cell_col = X_col_labels(ia); %if portion above is not used remove ia and just leave X_row_labels
Lazzara Lab
Samantha Clayton
April 14, 2019
yline = 0; xline = 0;
dx = max(x_loadings(:,1))/50;
dy = max(x_loadings(:,2))/50; % displacement so the text does not overlay the points
text(x1+dx, y1+dy, cell_col, 'Fontsize', 10, 'Interpreter', 'none','Color','k');

The following portion shifts the labels on points with the same value. If there are no duplicate values this portion is
again not necessary.

for i = 1:size(duplicate_value,1) - 1
text(x_dup(i+1)+dx, y_dup(i+1)+dy-2.5*i*dy, X_col_labels(duplicate_ind(i+1)), 'Fontsize', 10, 'Interpreter',
'none','Color','k')
end

Cross-validation

10. To perform a PLSR calculation with cross-validation, modify the plsregress command accordingly:

ncomp = 5; %use only 5 components → change this to desired number of components

fcv = 5; %perform PLSR with 5-fold cross-validation -- this is an integer value, up to the total number of
%rows in the input data sets (for this sample data set, can be up to 8)
mcreps = 1; %number of Monte-Carlo repetitions used during cross-validation

[x_loadings, y_loadings, x_scores, y_scores, beta, pctvar, mse, stats] = plsregress(z_x, z_y, ncomp, 'CV',
ncv,’mcreps’,mcreps);

%Calculate Q2Y values for each model component using mean squared errors from cross-validation. For %this particular
data set, the Q2Y values will be negative, indicating that the PLSR model developed here %is not a predictive one.
Q2Y = 1- mse(2,2:end)/sum(sum((z_y-mean(z_y)).^2)./size(z_y,1));

Donald L. Katz, Robert L. Lee - Natural Gas Engineering - Production and Storage (Mcgraw Hill Chemical Engineering Seri
No ratings yet
Donald L. Katz, Robert L. Lee - Natural Gas Engineering - Production and Storage (Mcgraw Hill Chemical Engineering Seri
394 pages
Partial Least Squares Regression (PLSR)
100% (1)
Partial Least Squares Regression (PLSR)
55 pages
Modeling and Design of Fluidized Catalytic Cracking Riser
No ratings yet
Modeling and Design of Fluidized Catalytic Cracking Riser
57 pages
Human Aspects - Rapoport PDF
No ratings yet
Human Aspects - Rapoport PDF
4 pages
Chi U. Ikoku - Natural Gas Production Engineering - Krieger Publishing Company (1991)
No ratings yet
Chi U. Ikoku - Natural Gas Production Engineering - Krieger Publishing Company (1991)
271 pages
Adjectives - Opposites
No ratings yet
Adjectives - Opposites
2 pages
Malachia Ormanian - The Church of Armenia - Her History, Doctrine, Rule, Discipline PDF
No ratings yet
Malachia Ormanian - The Church of Armenia - Her History, Doctrine, Rule, Discipline PDF
316 pages
Machine Learning With MATLAB Quick Reference
No ratings yet
Machine Learning With MATLAB Quick Reference
36 pages
(Online Teaching) b1 Preliminary For Schools Speaking Part 3 Vocabulary
0% (1)
(Online Teaching) b1 Preliminary For Schools Speaking Part 3 Vocabulary
9 pages
Exercise
No ratings yet
Exercise
15 pages
Econometrics in MATLAB: ARMAX, Pseudo Ex-Post Forecasting, GARCH and EGARCH, Implied Volatility
No ratings yet
Econometrics in MATLAB: ARMAX, Pseudo Ex-Post Forecasting, GARCH and EGARCH, Implied Volatility
18 pages
Basics of English Grammer
No ratings yet
Basics of English Grammer
10 pages
Final-Research Proposal
100% (1)
Final-Research Proposal
57 pages
B05 Matlab
No ratings yet
B05 Matlab
9 pages
Alice
No ratings yet
Alice
7 pages
Performance Task Newtons Olympic
100% (2)
Performance Task Newtons Olympic
1 page
Riveter PDF
No ratings yet
Riveter PDF
176 pages
Non Organic Hearing Loss
No ratings yet
Non Organic Hearing Loss
59 pages
Examples Lab 2022
No ratings yet
Examples Lab 2022
12 pages
Applied Energy: Cheng Fan, Fu Xiao, Yang Zhao
No ratings yet
Applied Energy: Cheng Fan, Fu Xiao, Yang Zhao
12 pages
The PLS Method - Partial Least Squares Projections To Latent Structures
No ratings yet
The PLS Method - Partial Least Squares Projections To Latent Structures
44 pages
Simulate A Regression Model For A Given Dataset: EX NO:03 - Date
No ratings yet
Simulate A Regression Model For A Given Dataset: EX NO:03 - Date
4 pages
CH-2 Java Classes and Inheritence
No ratings yet
CH-2 Java Classes and Inheritence
97 pages
Rsimpls
No ratings yet
Rsimpls
37 pages
2009 Modeling of A Permeate Flux of Cross-Flow Membrane Filtration of
No ratings yet
2009 Modeling of A Permeate Flux of Cross-Flow Membrane Filtration of
12 pages
AGWAT 2020 1341 Original V0
No ratings yet
AGWAT 2020 1341 Original V0
90 pages
Construction of A Dead-End Type Micro - To R.O. Membrane Test Cell and Performance Test With The Laboratory-Made and Commercial Membranes
No ratings yet
Construction of A Dead-End Type Micro - To R.O. Membrane Test Cell and Performance Test With The Laboratory-Made and Commercial Membranes
10 pages
A 3D CFD Simulation of A Self Inducing Pitched Blade Turbine Down Ow
No ratings yet
A 3D CFD Simulation of A Self Inducing Pitched Blade Turbine Down Ow
9 pages
Handout 4. Equivalence in Translation: Key Points
No ratings yet
Handout 4. Equivalence in Translation: Key Points
8 pages
A Second Order Kinetics of Palm Oil Transesterification
No ratings yet
A Second Order Kinetics of Palm Oil Transesterification
6 pages
Agnar Hoskuldsson
No ratings yet
Agnar Hoskuldsson
18 pages
2012 (Prediction of Permeation Flux Decline During MF of Oily
No ratings yet
2012 (Prediction of Permeation Flux Decline During MF of Oily
8 pages
20mia1006 FDA LAB REGRESSION TYPES
No ratings yet
20mia1006 FDA LAB REGRESSION TYPES
11 pages
2009 Prediction of Microfiltration Membrane Fouling Using Artificial Neural
No ratings yet
2009 Prediction of Microfiltration Membrane Fouling Using Artificial Neural
7 pages
PLS Tutorial PDF
No ratings yet
PLS Tutorial PDF
12 pages
How To Connect To A Remote SQL Server
No ratings yet
How To Connect To A Remote SQL Server
15 pages
Introduction Linear Regression 2015
No ratings yet
Introduction Linear Regression 2015
9 pages
2020 ANN and RSM Models Approach For Optimization of HVOF Coating
No ratings yet
2020 ANN and RSM Models Approach For Optimization of HVOF Coating
6 pages
Pratical1 TRP 2013 2014
No ratings yet
Pratical1 TRP 2013 2014
3 pages
Articulo
No ratings yet
Articulo
9 pages
ECS4863 SOLUTION Activity 4.3 - An Introduction To EViews
No ratings yet
ECS4863 SOLUTION Activity 4.3 - An Introduction To EViews
6 pages
The Philippines Indigenous Communities in Visayas Region
No ratings yet
The Philippines Indigenous Communities in Visayas Region
111 pages
Chapter3 First Application Linear Regression
No ratings yet
Chapter3 First Application Linear Regression
8 pages
3 TABLEAU Terminolgy
No ratings yet
3 TABLEAU Terminolgy
12 pages
ECE1250S14 M3LectProbs
No ratings yet
ECE1250S14 M3LectProbs
5 pages
NNA Introduction
No ratings yet
NNA Introduction
28 pages
Applied Linear Regression
No ratings yet
Applied Linear Regression
13 pages
BestPractices zDevOps
No ratings yet
BestPractices zDevOps
16 pages
Leaf Protein Concentration of Alfalfa Juice by Membrane Technology
No ratings yet
Leaf Protein Concentration of Alfalfa Juice by Membrane Technology
11 pages
Handling The Dataset Using R - Word
No ratings yet
Handling The Dataset Using R - Word
54 pages
Applied Linear Regression
No ratings yet
Applied Linear Regression
6 pages
Transpositiontec
No ratings yet
Transpositiontec
10 pages
Winslade 2000
No ratings yet
Winslade 2000
17 pages
Machine Learning Coursera All Exercies
75% (12)
Machine Learning Coursera All Exercies
117 pages
Programming Exercise 1: Linear Regression: Machine Learning
No ratings yet
Programming Exercise 1: Linear Regression: Machine Learning
15 pages
Wise Pls Properties
No ratings yet
Wise Pls Properties
51 pages
Programming Exercise 1: Linear Regression: Machine Learning
No ratings yet
Programming Exercise 1: Linear Regression: Machine Learning
15 pages
01 Future Simple
No ratings yet
01 Future Simple
8 pages
Write A Program For Generalized Bresenham's Line Drawing Algorithm
No ratings yet
Write A Program For Generalized Bresenham's Line Drawing Algorithm
4 pages
WWW - AD-POWER - CN: Class-D Amplifier Module
No ratings yet
WWW - AD-POWER - CN: Class-D Amplifier Module
6 pages
Revamp - of - Naphtha - Hydrotreating - Process - in - An - Iranian Refinery
No ratings yet
Revamp - of - Naphtha - Hydrotreating - Process - in - An - Iranian Refinery
7 pages
En Tanagra Python StatsModels PDF
No ratings yet
En Tanagra Python StatsModels PDF
20 pages
A Guided Tour To Machine Learning Using MATLAB
No ratings yet
A Guided Tour To Machine Learning Using MATLAB
15 pages
Definition of Litertaure
No ratings yet
Definition of Litertaure
4 pages
Contoh RPH
No ratings yet
Contoh RPH
3 pages
Encouraging Students To Speak (Nastaran Naghi Mousavi)
No ratings yet
Encouraging Students To Speak (Nastaran Naghi Mousavi)
13 pages
3rd Sum in Eng
No ratings yet
3rd Sum in Eng
6 pages
CSIT228 Object-Oriented Programming 2
No ratings yet
CSIT228 Object-Oriented Programming 2
8 pages
hst951 7
No ratings yet
hst951 7
32 pages
Instructions For ParLeS
No ratings yet
Instructions For ParLeS
13 pages
Modeling Input/Output Data: Partial Least Squares (PLS)
No ratings yet
Modeling Input/Output Data: Partial Least Squares (PLS)
18 pages
Shadman 2016, How Much Do You Know About The Methods For Determining
No ratings yet
Shadman 2016, How Much Do You Know About The Methods For Determining
5 pages
BBC Learning Englihs (Can, Could, Be Able To)
No ratings yet
BBC Learning Englihs (Can, Could, Be Able To)
5 pages
Machine Learning Coursera All Exercies PDF
No ratings yet
Machine Learning Coursera All Exercies PDF
117 pages
Partial Least Squares A Tutorial
No ratings yet
Partial Least Squares A Tutorial
12 pages
2.WSS - Enquiry Routines
No ratings yet
2.WSS - Enquiry Routines
7 pages
13 Multivariate Calibration
No ratings yet
13 Multivariate Calibration
14 pages
Memory Hierarchy in Computer Architecture
No ratings yet
Memory Hierarchy in Computer Architecture
4 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
98 pages
Using Matlab Ident
No ratings yet
Using Matlab Ident
19 pages
Module 2 Lab Activity - Regression
No ratings yet
Module 2 Lab Activity - Regression
9 pages
Wonder 2 Phonics Answer Key
No ratings yet
Wonder 2 Phonics Answer Key
2 pages
Package Plsvarsel': R Topics Documented
No ratings yet
Package Plsvarsel': R Topics Documented
23 pages
Curve Fitting in MATLAB Notes
No ratings yet
Curve Fitting in MATLAB Notes
7 pages
Problem Set 1 Solution Numerical Methods
No ratings yet
Problem Set 1 Solution Numerical Methods
32 pages
Hastasya Bhushanam Danam - Eng
No ratings yet
Hastasya Bhushanam Danam - Eng
7 pages
Da Lab File 2
No ratings yet
Da Lab File 2
13 pages
Journal of Statistical Software: The Pls Package: Principal Component and Partial Least Squares Regression in R
No ratings yet
Journal of Statistical Software: The Pls Package: Principal Component and Partial Least Squares Regression in R
23 pages
Influence Properties of Partial Squares
No ratings yet
Influence Properties of Partial Squares
20 pages
Abdi 2003 PLSRegression
No ratings yet
Abdi 2003 PLSRegression
7 pages
MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
Pirouette 4.5 Spec
No ratings yet
Pirouette 4.5 Spec
1 page
L. D. College of Engineering: Lab Manual For
No ratings yet
L. D. College of Engineering: Lab Manual For
70 pages
Matlab Intro For Asset Pricing
No ratings yet
Matlab Intro For Asset Pricing
5 pages
Exercise 1 Instruction Pca
No ratings yet
Exercise 1 Instruction Pca
9 pages
Introduction To Matlab Lecture Advanced Data Analysis Jan2012
No ratings yet
Introduction To Matlab Lecture Advanced Data Analysis Jan2012
50 pages
ML Exercise 1
No ratings yet
ML Exercise 1
15 pages
Department of Metallurgical Engineering and Materials Science, IIT Bombay
No ratings yet
Department of Metallurgical Engineering and Materials Science, IIT Bombay
5 pages
Worked Examples in Mechanical Vibrations using MATLAB
From Everand
Worked Examples in Mechanical Vibrations using MATLAB
Eric Okoth Ogur
No ratings yet
MATLAB for Beginners: A Gentle Approach - Revised Edition
From Everand
MATLAB for Beginners: A Gentle Approach - Revised Edition
Peter I. Kattan
3.5/5 (11)
MATLAB for Beginners: A Gentle Approach
From Everand
MATLAB for Beginners: A Gentle Approach
Peter I. Kattan
No ratings yet
MATLAB for Beginners: A Gentle Approach - Revised Edition
From Everand
MATLAB for Beginners: A Gentle Approach - Revised Edition
Peter Kattan
No ratings yet
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
3/5 (4)
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
4/5 (2)
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet