Principal Component Analysis

Principal Component Analysis (PCA) is a popular dimensionality reduction technique that uses linear algebra to project high-dimensional data into a lower-dimensional space for visualization and model training. It works by performing an eigendecomposition or singular value decomposition on the data to find the most relevant components. Singular Value Decomposition (SVD) is another popular dimensionality reduction method that decomposes a matrix into three component matrices. Latent Semantic Analysis (LSA) applies SVD to text data represented as word-document matrices to distill documents into their most important semantic elements.

Uploaded by

Kang Chul

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views3 pages

Principal Component Analysis

Uploaded by

Kang Chul

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Principal Component Analysis

Often a dataset has many columns, perhaps tens, hundreds, thousands or more. Modeling data
with many features is challenging, and models built from data that include irrelevant features
are often less skillful than models trained from the most relevant data. It is hard to know which
features of the data are relevant and which are not. Methods for automatically reducing the
number of columns of a dataset are called dimensionality reduction, and perhaps the most
popular is method is called the principal component analysis or PCA for short. This method is
used in machine learning to create projections of high-dimensional data for both visualization
and for training models. The core of the PCA method is a matrix factorization method from
linear algebra. The eigendecomposition can be used and more robust implementations may use
the singular-value decomposition or SVD.

Singular-Value Decomposition
Another popular dimensionality reduction method is the singular-value decomposition method
or SVD for short. As mentioned and as the name of the method suggests, it is a matrix
factorization method from the field of linear algebra. It has wide use in linear algebra and can
be used directly in applications such as feature selection, visualization, noise reduction and
more. We will see two more cases below of using the SVD in machine learning.

Latent Semantic Analysis

In the sub-field of machine learning for working with text data called natural language
processing, it is common to represent documents as large matrices of word occurrences. For
example, the columns of the matrix may be the known words in the vocabulary and rows may
be sentences, paragraphs, pages or documents of text with cells in the matrix marked as the
count or frequency of the number of times the word occurred. This is a sparse matrix
representation of the text. Matrix factorization methods such as the singular-value
decomposition can be applied to this sparse matrix which has the effect of distilling the
representation down to its most relevant essence. Documents processed in thus way are much
easier to compare, query and use as the basis for a supervised machine learning model. This
form of data preparation is called Latent Semantic Analysis or LSA for short, and is also
known by the name Latent Semantic Indexing or LSI.
3.10. Recommender Systems 15

Recommender Systems
Predictive modeling problems that involve the recommendation of products are
called recommender systems, a sub-field of machine learning. Examples
include the recommendation of books based on previous purchases and
purchases by customers like you on Amazon, and the recommendation of
movies and TV shows to watch based on your viewing history and viewing
history of subscribers like you on Netflix. The development of recommender
systems is primarily concerned with linear algebra methods. A simple example
is in the calculation of the similarity between sparse customer behavior vectors
using distance measures such as Euclidean distance or dot products. Matrix
factorization methods like the singular-value decomposition are used widely in
recommender systems to distill item and user data to their essence for querying
and searching and comparison.

Deep Learning
Artificial neural networks are nonlinear machine learning algorithms that are
inspired by elements of the information processing in the brain and have proven
effective at a range of problems not least predictive modeling. Deep learning is
the recent resurged use of artificial neural networks with newer methods and
faster hardware that allow for the development and training of larger and
deeper (more layers) networks on very large datasets. Deep learning methods
are routinely achieve state-of-the-art results on a range of challenging problems
such as machine translation, photo captioning, speech recognition and much
more.
At their core, the execution of neural networks involves linear algebra data
structures multiplied and added together. Scaled up to multiple dimensions,
deep learning methods work with vectors, matrices and even tensors of inputs
and coefficients, where a tensor is a matrix with more than two dimensions.
Linear algebra is central to the description of deep learning methods via matrix
notation to the implementation of deep learning methods such as Google’s
TensorFlow Python library that has the word ”tensor” in its name.

Summary
In this chapter, you discovered 10 common examples of machine learning that
you may be familiar with that use and require linear algebra. Specifically, you
learned:
ˆ The use of linear algebra structures when working with data such as
tabular datasets and images.
ˆ Linear algebra concepts when working with data preparation such as
one hot encoding and dimensionality reduction.
ˆ The in-grained use of linear algebra notation and methods in sub-
fields such as deep learning, natural language processing and
recommender systems.

3.12.1 Next
This is the end of the first part, in the next part you will discover how to
manipulate arrays of data in Python using NumPy.

Solutions Konica Minolta
100% (3)
Solutions Konica Minolta
355 pages
SAP WM Config
100% (4)
SAP WM Config
44 pages
Nemo File Format Specification - 2.24
100% (1)
Nemo File Format Specification - 2.24
484 pages
Rizal Life and Works: Critical Analysis of The Rizal Law
No ratings yet
Rizal Life and Works: Critical Analysis of The Rizal Law
7 pages
Rizal Exam 2
No ratings yet
Rizal Exam 2
15 pages
Iso 24978-2009
No ratings yet
Iso 24978-2009
96 pages
The Contemporary World
100% (10)
The Contemporary World
175 pages
05 Linear Algebra and Machine Learning
0% (1)
05 Linear Algebra and Machine Learning
24 pages
SCSA3015 Deep Learning Unit 3
100% (1)
SCSA3015 Deep Learning Unit 3
23 pages
23 Figures of Speech and Literary Criticism and Strategies
50% (2)
23 Figures of Speech and Literary Criticism and Strategies
9 pages
Auditing Theory PDF Salosagcol PDF
0% (4)
Auditing Theory PDF Salosagcol PDF
4 pages
Haulage Calculation - Minesight Haulage
100% (2)
Haulage Calculation - Minesight Haulage
12 pages
Computation Methods SLM Copy Uploaded 1742876667475
No ratings yet
Computation Methods SLM Copy Uploaded 1742876667475
81 pages
شباتر اله مجمعه
No ratings yet
شباتر اله مجمعه
126 pages
Leniear Algebra Operation For Machine Learning
No ratings yet
Leniear Algebra Operation For Machine Learning
10 pages
FDS Module II-I
No ratings yet
FDS Module II-I
27 pages
Life and Works of Jose Rizal: Prelim Midterm
No ratings yet
Life and Works of Jose Rizal: Prelim Midterm
23 pages
Module 1 Lecture 3 - Linear Algibra
No ratings yet
Module 1 Lecture 3 - Linear Algibra
34 pages
Application of Linear Algebra New
No ratings yet
Application of Linear Algebra New
10 pages
Matrix Factorisation
No ratings yet
Matrix Factorisation
39 pages
Week 7 11 Reviewer CRWT
100% (1)
Week 7 11 Reviewer CRWT
21 pages
Visualization 9 Dim Reduction
No ratings yet
Visualization 9 Dim Reduction
73 pages
The Power of Matrices in Machine Learning
No ratings yet
The Power of Matrices in Machine Learning
9 pages
Https Mail-Attachment - Googleusercontent.com Attachment Ui 2&ik 4728441efd&view Att&Th 13b2de7e3402c510&attid 0
No ratings yet
Https Mail-Attachment - Googleusercontent.com Attachment Ui 2&ik 4728441efd&view Att&Th 13b2de7e3402c510&attid 0
384 pages
NECO BECE ICT Computer Studies Practical Questions For JSS3
100% (12)
NECO BECE ICT Computer Studies Practical Questions For JSS3
2 pages
Mathophilia
No ratings yet
Mathophilia
18 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
85 pages
ML Notes Question Bank Exstraction From Notes
No ratings yet
ML Notes Question Bank Exstraction From Notes
30 pages
TTL 2 Reviewer
No ratings yet
TTL 2 Reviewer
7 pages
CHP 4
No ratings yet
CHP 4
72 pages
Examples of Linear Algebra in Machine Learning 11022025 060334pm
No ratings yet
Examples of Linear Algebra in Machine Learning 11022025 060334pm
12 pages
Corporate Information: Organization Structure
No ratings yet
Corporate Information: Organization Structure
11 pages
Final2 Math EE
No ratings yet
Final2 Math EE
77 pages
Copy of Funded Companies List For Eastcoast
No ratings yet
Copy of Funded Companies List For Eastcoast
30 pages
Telecom Business Information System Abstract
No ratings yet
Telecom Business Information System Abstract
5 pages
How To Install Esim On Samsung
No ratings yet
How To Install Esim On Samsung
20 pages
Technology in Action: Alan Evans Kendall Martin Mary Anne Poatsy Tenth Edition
No ratings yet
Technology in Action: Alan Evans Kendall Martin Mary Anne Poatsy Tenth Edition
86 pages
Ai Application
No ratings yet
Ai Application
28 pages
Maths Roadmap For Machine Learning
No ratings yet
Maths Roadmap For Machine Learning
21 pages
Ensi3 PRML s6 Encoders
No ratings yet
Ensi3 PRML s6 Encoders
19 pages
SVD Chin
No ratings yet
SVD Chin
9 pages
Numerical Linear Algebra in Data Mining: Lars Eld en
No ratings yet
Numerical Linear Algebra in Data Mining: Lars Eld en
58 pages
VMware Interview Questions and Answers
0% (1)
VMware Interview Questions and Answers
7 pages
AI and Linear Algebra
No ratings yet
AI and Linear Algebra
2 pages
Operation of Frigopack E FMV With Rs485 Modbus Rtu: Connections
No ratings yet
Operation of Frigopack E FMV With Rs485 Modbus Rtu: Connections
2 pages
Aiml Model
No ratings yet
Aiml Model
13 pages
MULTIPLE CHOICE. Choose The One Alternative That Best Completes The Statement or Answers The Question
No ratings yet
MULTIPLE CHOICE. Choose The One Alternative That Best Completes The Statement or Answers The Question
19 pages
SVD Tutorial 2022
No ratings yet
SVD Tutorial 2022
24 pages
Unit 1 - Recap of Machine Learning Concepts
No ratings yet
Unit 1 - Recap of Machine Learning Concepts
20 pages
Linear Algebra and Some of It Application To Machine Learning 1
No ratings yet
Linear Algebra and Some of It Application To Machine Learning 1
17 pages
Applications of Linear Algebra
No ratings yet
Applications of Linear Algebra
3 pages
Khadeejah ConferenceExtract 2024
No ratings yet
Khadeejah ConferenceExtract 2024
16 pages
What Is Information Retrieval (IR)
No ratings yet
What Is Information Retrieval (IR)
9 pages
Merchandising Summary
No ratings yet
Merchandising Summary
15 pages
Dimensionality Reduction: Pca, SVD, MDS, Ica, and Friends
No ratings yet
Dimensionality Reduction: Pca, SVD, MDS, Ica, and Friends
50 pages
Unit 5 Notes
No ratings yet
Unit 5 Notes
20 pages
Wk01 Machine Learning
No ratings yet
Wk01 Machine Learning
6 pages
U5 - SVD - 5th Sem - DS
No ratings yet
U5 - SVD - 5th Sem - DS
17 pages
Reserch Papers On Deep Learning Mpgi
No ratings yet
Reserch Papers On Deep Learning Mpgi
6 pages
Linear Algebra Data
No ratings yet
Linear Algebra Data
3 pages
Module2 Governance
No ratings yet
Module2 Governance
27 pages
SCM Final
No ratings yet
SCM Final
15 pages
iOS Development With Xamarin Cookbook: Chapter No. 12 "Multitasking"
No ratings yet
iOS Development With Xamarin Cookbook: Chapter No. 12 "Multitasking"
19 pages
Singular Value Decomposition (SVD) in Matrix Factorization
No ratings yet
Singular Value Decomposition (SVD) in Matrix Factorization
7 pages
Algo Assignment-P2 Binary SRCH
No ratings yet
Algo Assignment-P2 Binary SRCH
7 pages
Lec 4 - Data Science
No ratings yet
Lec 4 - Data Science
3 pages
Cocos 1000: Contaminationcontrol Software Operating Manual
No ratings yet
Cocos 1000: Contaminationcontrol Software Operating Manual
27 pages
LabVIEW Workbook v1.2
No ratings yet
LabVIEW Workbook v1.2
39 pages
Artificial Neural Network Bao
No ratings yet
Artificial Neural Network Bao
26 pages
Installation Guide - GhostBSD Wiki
No ratings yet
Installation Guide - GhostBSD Wiki
1 page
Meco Reviewer
No ratings yet
Meco Reviewer
9 pages
Managerial Economics: PV FV I
No ratings yet
Managerial Economics: PV FV I
9 pages
Generative Approach Under Computer Aided Process Planning (
No ratings yet
Generative Approach Under Computer Aided Process Planning (
14 pages
Midterm Lesson 1 4 Stas
No ratings yet
Midterm Lesson 1 4 Stas
9 pages
FoodMart BO Case Study
No ratings yet
FoodMart BO Case Study
18 pages
IT Software Developer
No ratings yet
IT Software Developer
2 pages
Heartrate Measurement Through Fingertip
No ratings yet
Heartrate Measurement Through Fingertip
20 pages
Effectiveness of Advertisement
No ratings yet
Effectiveness of Advertisement
5 pages
Introduction To Linear Algebra
No ratings yet
Introduction To Linear Algebra
3 pages
Functions To Create Arrays
No ratings yet
Functions To Create Arrays
3 pages
Different Kinds of Language Varieties
No ratings yet
Different Kinds of Language Varieties
2 pages
Linear Algebra: Submitted by Ahmad Saeed Submitted To Sir Muzzam Ali BITM-F18-022
No ratings yet
Linear Algebra: Submitted by Ahmad Saeed Submitted To Sir Muzzam Ali BITM-F18-022
5 pages
Before An Emergency: 1. Be Prepared
No ratings yet
Before An Emergency: 1. Be Prepared
2 pages
Midterm1 Sample
No ratings yet
Midterm1 Sample
6 pages
43 - Install Windows From Many ISO Files All On One Flash Drive Using FiraDisk - RMPrepUSB
No ratings yet
43 - Install Windows From Many ISO Files All On One Flash Drive Using FiraDisk - RMPrepUSB
8 pages
Denon Asd51n W Protocol v1.0.0
No ratings yet
Denon Asd51n W Protocol v1.0.0
9 pages
Kirty Prabhakar Vedula
No ratings yet
Kirty Prabhakar Vedula
2 pages
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
From Everand
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
Marcin Jamro
No ratings yet
Data Scientist Roadmap
From Everand
Data Scientist Roadmap
Mohammed Ahmed
5/5 (1)
Machine Learning Fundamentals: Concepts, Models, and Applications
From Everand
Machine Learning Fundamentals: Concepts, Models, and Applications
Amar Sahay
No ratings yet
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Developing Analytic Talent: Becoming a Data Scientist
From Everand
Developing Analytic Talent: Becoming a Data Scientist
Vincent Granville
3/5 (7)
Computer Data
From Everand
Computer Data
Angel Gabaldon
No ratings yet
Mastering Algorithms and Data Structures
From Everand
Mastering Algorithms and Data Structures
Manish Soni
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Exploring the World of Data Science and Machine Learning
From Everand
Exploring the World of Data Science and Machine Learning
NIBEDITA Sahu
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Machine Learning Algorithms for Data Scientists: An Overview
From Everand
Machine Learning Algorithms for Data Scientists: An Overview
Vinaitheerthan Renganathan
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Mastering Data Structures and Algorithms in C and C++
From Everand
Mastering Data Structures and Algorithms in C and C++
Sachin Naha
No ratings yet
Visualizing Data Structures
From Everand
Visualizing Data Structures
Rhonda Hoenigman
No ratings yet
Pattern Recognition: Fundamentals and Applications
From Everand
Pattern Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
From Everand
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
Peter Bradley
No ratings yet
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
From Everand
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
Fouad Sabry
No ratings yet
Automatic Image Annotation: Fundamentals and Applications
From Everand
Automatic Image Annotation: Fundamentals and Applications
Fouad Sabry
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet

Principal Component Analysis

Uploaded by

Principal Component Analysis

Uploaded by

Principal Component Analysis

Latent Semantic Analysis

You might also like