0% found this document useful (0 votes)

302 views6 pages

Linear Algebra - A Powerful Tool For Data Science

Uploaded by

Premier Publishers

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

302 views6 pages

Linear Algebra - A Powerful Tool For Data Science

Uploaded by

Premier Publishers

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

International Journal of Statistics and Mathematics

Vol. 6(3), pp. 137-142, December, 2019. © www.premierpublishers.org. ISSN: 2375-0499

Research Article

Linear Algebra – A Powerful Tool for Data Science

Hasheema Ishchi
Lecturer in Mathematics Department, Education Faculty, Jawzjan University, Sheberghan, Afghanistan
Email Address: [email protected]

Analysis of data is an important task in data managements systems. Many mathematical tools are
used in data analysis. A new division of data management has appeared in machine learning,
linear algebra, an optimal tool to analyse and manipulate the data. Data science is a multi-
disciplinary subject that uses scientific methods to process the structured and unstructured data
to extract the knowledge by applying suitable algorithms and systems. The strength of linear
algebra is ignored by the researchers due to the poor understanding. It powers major areas of
Data Science including the hot fields of Natural Language Processing and Computer Vision. The
data science enthusiasts finding the programming languages for data science are easy to analyze
the big data rather than using mathematical tools like linear algebra. Linear algebra is a must-
know subject in data science. It will open up possibilities of working and manipulating data. In
this paper, some applications of Linear Algebra in Data Science are explained.

Keywords: Data, Information, Data Science, Linear Algebra

INTRODUCTION

Data science is the field of study that combines domain Linear Algebra is the heart to almost all areas of
expertise, programming skills, and knowledge of mathematics like geometry and functional analysis
mathematics and statistics to extract meaningful insights (Hilbert and Lopez, 2011). Its concepts are a crucial
from data. Data science practitioners apply machine prerequisite for understanding the theory behind Data
learning algorithms to numbers, text, images, video, audio, Science. The data scientist doesn’t need to
and more to produce artificial intelligence systems to understand Linear Algebra before getting started in Data
perform tasks that ordinarily require human intelligence. In Science, but at some point, it is necessary to understand
turn, these systems generate insights which analysts and how the different algorithms really work. Linear algebra in
business users can translate into tangible business value data science is used as follows.
(Ambrust et al., 2010). Machine learning is the branch of
data science used to design algorithms that automatically Scalars, Vectors, Matrices and Tensors
extract valuable information from data. The focus here is
on “automatic”, i.e., machine learning is general-purpose • A scalar is a single number
methodologies that can be applied on datasets, while • A vector is an array of numbers.
producing something that is meaningful (Kakhani et al.,
• A matrix is a 2-D array
2015; Philip et al., 2014).
• A tensor is a n-dimensional array with n>2
Linear algebra is the branch of mathematics concerning
linear equations, linear functions and their representations
through matrices and vector spaces. It helps us to
understand geometric terms in higher dimensions, and
perform mathematical operations on them. By definition,
algebra deals primarily with scalars (one-dimensional
entities), but Linear Algebra has vectors and matrices
(entities which possess two or more dimensional
Fig.1. Representation of data in data science using linear
components) to deal with linear equations and functions
algebra
(Will, 2014).

Linear Algebra – A Powerful Tool for Data Science

Ishchi H. 138

APPLICATIONS OF LINEAR ALGEBRA IN DATA the origin to the vector if the only permitted directions are
SCIENCES parallel to the axes of the space.

In this 2D space, consider the vector (3, 4) by traveling 3

units along the x-axis and then 4 units parallel to the y-axis
(as shown). Or travelled 4 units along the y-axis first and
then 3 units parallel to the x-axis. In either case, travelled
a total of 7 units.

L2 Norm: Also known as the Euclidean Distance. L2

Norm is the shortest distance of the vector from the origin
as shown by the red path in the figure below:

Fig. 2: Applications of linear algebra in data sciences

Linear Algebra in Machine Learning

The following are the some application areas of linear

algebra in machine learning.
1. Loss functions
2. Regularization
3. Covariance Matrix Fig. 3: Euclidean Distance
4. Support Vector Machine Classification
This distance is calculated using the Pythagoras Theorem.
Loss functions It is the square root of (3^2 + 4^2), which is equal to 5. The
predicted values are stored in a vector P and the expected
Consider how good a model is, say a Linear Regression values are stored in a vector E. Then P-E is the difference
model, and fits a given data: vector. And the norm of P-E is the total loss for the
• Some arbitrary prediction function (a linear function for prediction.
a Linear Regression Model)
• Use it on the independent features of the data to Regularization
predict the output
• Calculate how far-off the predicted output is from the Regularization is a very important concept in data science.
actual output It’s a technique we use to prevent models from overfitting.
• Use these calculated values to optimize prediction Regularization is actually another application of the Norm.
function using some strategy like Gradient Descent A model is said to overfit when it fits the training data too
well. Such a model does not perform well with new data
It is difficult to calculate how different prediction is from the because it has learned even the noise in the training data.
expected output. This issue can be resolved using loss It will not be able to generalize on data that it has not seen
function. A loss function is an application of the Vector before. The below illustration sums up this idea really well:
Norm in Linear Algebra. The norm of a vector can simply
be its magnitude. There are many types of vector norms. Regularization penalizes overly complex models by adding
Here, discussed two types. the norm of the weight vector to the cost function. Since
we want to minimize the cost function, we will need to
L1 Norm: Also known as the Manhattan Distance or minimize this norm. This causes unrequired components
Taxicab Norm. The L1 Norm is the distance travelled from of the weight vector to reduce to zero and prevents the
prediction function from being overly complex.

Linear Algebra – A Powerful Tool for Data Science

Int. J. Stat. Math. 139

Fig 4: Regularization

The L1 and L2 norms we discussed above are used in two Support Vector Machine Classification
types of regularization:
• L1 regularization used with Lasso Regression Support vector machine is the most common classification
• L2 regularization used with Ridge Regression algorithms that regularly produces remarkable results. It is
an application of the concept of Vector Spaces in Linear
Covariance Matrix Algebra. Support Vector Machine, or SVM, is a
discriminative classifier that works by finding a decision
Bivariate analysis is an important step in data exploration surface. It is a supervised machine learning algorithm. In
to study the relationship between pairs of variables. this algorithm, we plot each data item as a point in an n-
Covariance or Correlation is measures used to study dimensional space (where n is the number of features) with
relationships between two continuous variables. the value of each feature being the value of a particular
coordinate. Then, perform classification by finding the
Covariance indicates the direction of the linear relationship hyperplane that differentiates the two classes very well i.e.
between the variables. A positive covariance indicates that with the maximum margin, which is C is this case.
an increase or decrease in one variable is accompanied
by the same in another. A negative covariance indicates
that an increase or decrease in one is accompanied by the
opposite in the other.

Fig 6: Support Vector Machine

A hyperplane is a subspace whose dimensions are one

less than its corresponding vector space, so it would be a
straight line for a 2D vector space, a 2D plane for a 3D
vector space and so on. Again, Vector Norm is used to
calculate the margin.

Linear Algebra in Dimensionality Reduction

Fig 5: Co-variance 1. Principal Component Analysis
2. Singular Value Decomposition
On the other hand, correlation is the standardized value of
Covariance. A correlation value tells us both the strength 1. Principal Component Analysis
and direction of the linear relationship and has the range
from -1 to 1. Using the concepts of transpose and matrix Principal Component Analysis, or PCA, is an unsupervised
multiplication in Linear Algebra, there is another dimensionality reduction technique. PCA finds the
expression for the covariance matrix: directions of maximum variance and projects the data
along them to reduce the dimensions. Without going into
𝑐𝑜𝑣 = 𝑋 𝑇 𝑋 the math, these directions are the eigenvectors of the
Here, X is the standardized data matrix containing all covariance matrix of the data (Gupta et al., 2010; Slavkovic
numerical features. and Jevtic, 2012)

Linear Algebra – A Powerful Tool for Data Science

Ishchi H. 140

Finally, multiply the truncated matrices to obtain the

transformed matrix A_k. It has the dimensions m x k. So,
it has k features with k < n. On applying truncated SVD to
the Digits data, the below plot was obtained.

Eigenvectors for a square matrix are special non-zero

vectors whose direction does not change even after
applying linear transformation (which means multiplying)
with the matrix. They are shown as the red-colored vectors
in the figure below:

Fig 7: Eigen Vectors

2. Singular Value Decomposition Linear Algebra in Natural Language Processing

1. Word Embeddings
Singular Value Decomposition (SVD) is underrated and
not discussed enough. It is an amazing technique of matrix 2. Latent Semantic Analysis
decomposition with diverse applications. Here focused
about SVD in Dimensionality Reduction. Specifically, this Word Embeddings
is known as Truncated SVD. Machine learning algorithms cannot work with raw textual
• Start with the large m x n numerical data matrix A, data. The raw data needs to be converted to some
where m is the number of rows and n is the number of numerical and statistical features to create model inputs.
features There are many ways for extracting features from text
data, such as:
• Meta attributes of a text, like word count, special
character count, etc.
• NLP attributes of text using Parts-of-Speech tags and
Grammar Relations like the number of proper nouns
• Word Vector Notations or Word Embeddings
Word Embeddings is a way of representing words as low
dimensional vectors of numbers while preserving their
Decompose it into 3 matrices as shown here: context in the document. These representations are
Choose k singular values based on the diagonal matrix obtained by training different neural networks on a large
and truncate (trim) the 3 matrices accordingly: amount of text which is called a corpus. They also help in
analyzing syntactic similarity among words:
Linear Algebra – A Powerful Tool for Data Science
Int. J. Stat. Math. 141

Word2Vec and GloVe are two popular models to create Word Embeddings.
Latent Semantic Analysis
This grayscale image of the digit zero is made of 8 x 8 =
Latent Semantic Analysis (LSA), or Latent Semantic 64 pixels. Each pixel has a value in the range 0 to 255. A
Indexing, is one of the techniques of Topic Modeling. It is value of 0 represents a black pixel and 255 represent a
another application of Singular Value Decomposition. white pixel. Conveniently, an m x n grayscale image can
Latent means ‘hidden’. True to its name, LSA attempts to be represented as a 2D matrix with m rows and n columns
capture the hidden themes or topics from the documents with the cells containing the respective pixel values:
by leveraging the context around the words.
• First, generate the Document-Term matrix for the data
Use SVD to decompose the matrix into 3 matrices:
• Document-Topic matrix
• Topic Importance Diagonal Matrix
• Topic-term matrix
• Truncate the matrices based on the importance of
topics
Linear Algebra in Computer Vision

Deep learning methods can achieve state – of – the – art

results on challenging computer vision problems such as
image classification, object detection and face recognition.
• Image representation as Tensors
• Convolution and Image Processing

Image representation as Tensors

A computer does not process images as humans do.

Machine learning algorithms need numerical features to
work with. A digital image is made up of small indivisible A colored image is generally stored in the RGB system.
units called pixels. Consider the figure below: Each image can be thought of as being represented by
three 2D matrices, one for each R, G and B channel. A
pixel value of 0 in the R channel represents zero intensity
of the Red color and of 255 represents the full intensity of
the Red color. Each pixel value is then a combination of
the corresponding values in the three channels: In reality,
instead of using 3 matrices to represent an image, a tensor
is used. A tensor is a generalized n-dimensional matrix.
For an RGB image, a 3rd ordered tensor is used. Imagine
it as three 2D matrices stacked one behind another:

Convolution and Image Processing

2D Convolution is a very important operation in image

processing. It consists of the below steps:
• Start with a small matrix of weights, called a kernel or
a filter
Linear Algebra – A Powerful Tool for Data Science
Ishchi H. 142

• Slide this kernel on the 2D input data, performing Hilbert M, Lopez P (2011). The world’s technological
element-wise multiplication capacity to store, communicate and compute
• Add the obtained values and put the sum in a single information. Science. 332: 60-65.
output pixel Jones BF, Wuchty S, Uzzi B. (2008). Multi-University
Research Teams: Shifting Impact Geography and
Stratification in Science. Science 322: 1259-1262.
Kakhani MK, Kakhani S, Biradar SR (2015). Research
issues in big data analytics, International Journal of
Application or Innovation in Engineering and
Management. 2: 228-232.
Philip CL, Chen Q, Zhang CY (2014). Data-intensive
The function can seem a bit complex but it’s widely used applications, challenges, techniques and technologies:
for performing various image processing operations like A survey on big data. Information Sciences. 275: 314-
sharpening and blurring the images and edge detection. 347.
Slavkovic M, Jevtic D (2012). Face Recognition Using
Eigenface Approach. Serbian Journal of Electrical
CONCLUSION Engineering. 9: 121-130.
Sturm P, Ramalingam S, Tardif JP, Gasparini S, Barreto
Linear algebra has vast uses in real world. Linear algebra J, (2010) Camera models and fundamental concepts
methods are applied on the data science to improve the used in geometric computer vision. Foundations and
efficiency of the algorithms to attain the more accurate Trends in Computer Graphics and Vision. 6: 1–183.
results. In this paper, compiled the applications of linear Szeliski R (2010) Computer vision algorithms and
algebra in data sciences and given an insight of each applications. London: Springer.
method. The data scientists can be used linear algebra as Waldrop MM (1992). Complexity: The Emerging Science
tool analyze the data sets. Machine learning approaches at the Edge of Order and Chaos. Simon & Schuster.
are of particular interest considering steadily increasing ISBN: 978-0671872342
search outputs and accessibility of the existing evidence is Will H (2014) Linear Algebra for Computer Vision.
a particular challenge of the research field quality Attribution-Share Alike 4.0 International (CC BY-SA
improvement. 4.0). 1-14.
Wuchty S, Jones BF, Uzzi B. (2007). The increasing
dominance of teams in production of knowledge.
REFERENCES Science 316: 1038-1039.

Armbrust A. Fox R, Griffith R, Joseph AD, Konwinski A,

Lee G, Patterson D, Rabkin A, Stoica I, Zaharia M.
(2010). A view of cloud computing. Commun. ACM. 53:
50-58.
Bollen J, Van H, de Sompel, Hagberg A, Chute R,
Rodriguez MA, Balakireva L (2009), Click stream Data
Yields High-Resolution Maps of Science. PLoS ONE
4: 1-11.
Chapman P, Clinton J, Kerber R, Shearer C, Wirth R. Accepted 18 November 2019
(2000), In: CRISP-DM 1.0: Step-by-step data mining
guide. The CRISP-DM Consortium. Citation: Ishchi H. (2019). Linear Algebra – A Powerful
Davenport TH, Harris JG (2007). Competing on Analytics: Tool for Data Science. International Journal of Statistics
The New Science of Winning. 1st Ed. Harvard Business and Mathematics, 6(3): 137-142.
School Press. ISBN: 9781422103326
Dean J, Ghemawat S (2008). Map Reduce: Simplified data
processing on large clusters. Commun. ACM. 51: 107-
113. Copyright: © 2019 Ishchi H. This is an open-access article
Gupta S, Sahoo OP, Goel A, Gupta R. (2010). A new distributed under the terms of the Creative Commons
optimized approach to face recognition using eigen Attribution License, which permits unrestricted use,
faces. Global Journal of Computer Science and distribution, and reproduction in any medium, provided the
Technology, 10: 15–17. original author and source are cited.

Linear Algebra – A Powerful Tool for Data Science

Previous Year Placement Questions of ISI KOLKATA
No ratings yet
Previous Year Placement Questions of ISI KOLKATA
9 pages
Python Data Associate Certification Study Guide
No ratings yet
Python Data Associate Certification Study Guide
2 pages
Linear Algebra For Data Science (DataCamp)
No ratings yet
Linear Algebra For Data Science (DataCamp)
13 pages
Free Download Data Science Curriculum - Innomatics Research Labs Hyderabad, India
No ratings yet
Free Download Data Science Curriculum - Innomatics Research Labs Hyderabad, India
14 pages
Maths For Data Science
No ratings yet
Maths For Data Science
1 page
Practical Data Science Cookbook Sample Chapter
100% (1)
Practical Data Science Cookbook Sample Chapter
31 pages
ST2195 Complete
No ratings yet
ST2195 Complete
430 pages
05 Linear Algebra and Machine Learning
0% (1)
05 Linear Algebra and Machine Learning
24 pages
Big Data For Social and Economic Analysis Blazquez and Domenech 2018
No ratings yet
Big Data For Social and Economic Analysis Blazquez and Domenech 2018
13 pages
04 Notes 6250 f13
0% (1)
04 Notes 6250 f13
16 pages
Linear Algebra - Assignment 2 - : Unit 4 - Week 2
No ratings yet
Linear Algebra - Assignment 2 - : Unit 4 - Week 2
4 pages
Lecture 7
No ratings yet
Lecture 7
14 pages
Electronics TXT Book - I PUC
No ratings yet
Electronics TXT Book - I PUC
356 pages
Basic Linear Algebra For Deep Learning - Built in
No ratings yet
Basic Linear Algebra For Deep Learning - Built in
18 pages
Distance-Based Methods - KNN
No ratings yet
Distance-Based Methods - KNN
8 pages
Data Science Math Skills
No ratings yet
Data Science Math Skills
1 page
Applications of Linear Algebra in Data Science
No ratings yet
Applications of Linear Algebra in Data Science
6 pages
Optimization For Data Science
No ratings yet
Optimization For Data Science
18 pages
A Report Submitted in Partial Fulfillment of The Requirement of The Award of Degree of
No ratings yet
A Report Submitted in Partial Fulfillment of The Requirement of The Award of Degree of
35 pages
Mathophilia
No ratings yet
Mathophilia
18 pages
Python Data Science Cookbook - (Index)
No ratings yet
Python Data Science Cookbook - (Index)
9 pages
Stat 1261/2260: Principles of Data Science (Fall 2021) Assignment 1: R and Rstudio
No ratings yet
Stat 1261/2260: Principles of Data Science (Fall 2021) Assignment 1: R and Rstudio
10 pages
Notes 20220602
No ratings yet
Notes 20220602
208 pages
Linear Algebra Challenging Problems For Students, 2 Edition (Fuzhen Zhang)
100% (1)
Linear Algebra Challenging Problems For Students, 2 Edition (Fuzhen Zhang)
266 pages
CBR Linear Algebra
No ratings yet
CBR Linear Algebra
18 pages
Applied Data Science Module: Learn The Fundamentals of Data Science, Tuition-Free
No ratings yet
Applied Data Science Module: Learn The Fundamentals of Data Science, Tuition-Free
10 pages
Data Science New
No ratings yet
Data Science New
9 pages
Intro To Data Science With DB
No ratings yet
Intro To Data Science With DB
33 pages
Shaik MubarakMlGZ
No ratings yet
Shaik MubarakMlGZ
15 pages
Data Science Tools Study Guides For MIT's 15.003
No ratings yet
Data Science Tools Study Guides For MIT's 15.003
23 pages
Data Science Regular Handout
No ratings yet
Data Science Regular Handout
25 pages
Pattern Recognition - A Statistical Approach
No ratings yet
Pattern Recognition - A Statistical Approach
6 pages
Clojure For Data Science - Sample Chapter
100% (1)
Clojure For Data Science - Sample Chapter
61 pages
Summary MATH1131 Algebra
No ratings yet
Summary MATH1131 Algebra
42 pages
Vector Mechanics For Engineers: Dynamics 11th Edition Ferdinand P. Beer PDF Download
No ratings yet
Vector Mechanics For Engineers: Dynamics 11th Edition Ferdinand P. Beer PDF Download
62 pages
U02Lecture07 Classification
100% (1)
U02Lecture07 Classification
56 pages
BSC Statistics
No ratings yet
BSC Statistics
141 pages
Data Science
No ratings yet
Data Science
74 pages
Causes, Consequences and Remedies of Juvenile Delinquency in The Context of Sub-Saharan Africa: A Study of 70 Juvenile Delinquents in The Eritrean Capital, Asmara.
No ratings yet
Causes, Consequences and Remedies of Juvenile Delinquency in The Context of Sub-Saharan Africa: A Study of 70 Juvenile Delinquents in The Eritrean Capital, Asmara.
20 pages
Photo Elasticity Principle
100% (1)
Photo Elasticity Principle
66 pages
Machine Learning
100% (1)
Machine Learning
185 pages
DataScience With R (Assignment 5-Report)
No ratings yet
DataScience With R (Assignment 5-Report)
9 pages
Occurrence and Extent of Fusarium Head Blight On Wheat Cultivars in Somalia
No ratings yet
Occurrence and Extent of Fusarium Head Blight On Wheat Cultivars in Somalia
8 pages
A Hands-On Introduction To Data Science
No ratings yet
A Hands-On Introduction To Data Science
2 pages
Optimisation Theory Lecture Notes
No ratings yet
Optimisation Theory Lecture Notes
139 pages
Mit Data Science Program
100% (1)
Mit Data Science Program
15 pages
R Notes Chapter 1. Data Type and Data Entry
No ratings yet
R Notes Chapter 1. Data Type and Data Entry
54 pages
Algorithms and Data Structures
0% (1)
Algorithms and Data Structures
161 pages
Stat 331 Course Notes
No ratings yet
Stat 331 Course Notes
79 pages
CEMA Belt Book-Pg7
No ratings yet
CEMA Belt Book-Pg7
1 page
Decision Tree Classifier-Introduction, ID3
No ratings yet
Decision Tree Classifier-Introduction, ID3
34 pages
The Gainz Manual
No ratings yet
The Gainz Manual
28 pages
Calculation of Gasoline Additives With Aspen Plus® V8.0: 1. Lesson Objectives
No ratings yet
Calculation of Gasoline Additives With Aspen Plus® V8.0: 1. Lesson Objectives
16 pages
Pptca1 13030823093 BSM101 Aiml B
No ratings yet
Pptca1 13030823093 BSM101 Aiml B
8 pages
Walpole - CH 2
No ratings yet
Walpole - CH 2
62 pages
Logistic Regression in R
No ratings yet
Logistic Regression in R
19 pages
Contact Stress Analysis of Spur Gear Teeth Pair PDF
No ratings yet
Contact Stress Analysis of Spur Gear Teeth Pair PDF
6 pages
IIT Kharagpur Data Science PDF
No ratings yet
IIT Kharagpur Data Science PDF
22 pages
Bioinformatics F&amp M 20100722 Bujak
100% (1)
Bioinformatics F&amp M 20100722 Bujak
27 pages
Knowledge, Attitude and Practices of School Children On Prevention and Control of Superficial Fungal in Western Kenya
No ratings yet
Knowledge, Attitude and Practices of School Children On Prevention and Control of Superficial Fungal in Western Kenya
7 pages
EXP No - 1
No ratings yet
EXP No - 1
5 pages
Nda Phy
No ratings yet
Nda Phy
3 pages
Collision Between Identical Particles
No ratings yet
Collision Between Identical Particles
14 pages
Lecture 10 Tensor and Tensor Algebra 2 PDF
No ratings yet
Lecture 10 Tensor and Tensor Algebra 2 PDF
14 pages
Finetek SP200
0% (1)
Finetek SP200
2 pages
Impact of Provision of Litigation Supports Through Forensic Investigations On Corporate Fraud Prevention in Nigeria
100% (1)
Impact of Provision of Litigation Supports Through Forensic Investigations On Corporate Fraud Prevention in Nigeria
8 pages
New About SLS Advancement
No ratings yet
New About SLS Advancement
38 pages
Lesson 2-6 Ratios and Proportions - Demo
No ratings yet
Lesson 2-6 Ratios and Proportions - Demo
45 pages
2022 Msce Phy I Provisional Marking Scheme
No ratings yet
2022 Msce Phy I Provisional Marking Scheme
12 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
Application of Geophysical and Remote Sensing Techniques To Delineate Lateritic Bauxite Deposit Zone in Orin Ekiti, Nigeria
No ratings yet
Application of Geophysical and Remote Sensing Techniques To Delineate Lateritic Bauxite Deposit Zone in Orin Ekiti, Nigeria
21 pages
76 - Sample - Chapter Kunci M2K3 No 9
No ratings yet
76 - Sample - Chapter Kunci M2K3 No 9
94 pages
Glossary of Terms in Reservoir Engineering
No ratings yet
Glossary of Terms in Reservoir Engineering
1 page
Improving The Efficiency of Ratio Estimators by Calibration Weightings
No ratings yet
Improving The Efficiency of Ratio Estimators by Calibration Weightings
9 pages
Influence of Nitrogen and Spacing On Growth and Yield of Chia (Salvia Hispanica) in Meru County, Kenya
No ratings yet
Influence of Nitrogen and Spacing On Growth and Yield of Chia (Salvia Hispanica) in Meru County, Kenya
10 pages
Role of Organic and Inorganic Fertilizers On The Performance of Some Medicinal Plants
No ratings yet
Role of Organic and Inorganic Fertilizers On The Performance of Some Medicinal Plants
8 pages
Effect of Topping Levels and Days To Topping of Vernonia (Vernonia Galamensis (Cass.) Less.) On Variability of Seed Maturity and Seed Yield at Wondo Genet, Southern Ethiopia
No ratings yet
Effect of Topping Levels and Days To Topping of Vernonia (Vernonia Galamensis (Cass.) Less.) On Variability of Seed Maturity and Seed Yield at Wondo Genet, Southern Ethiopia
7 pages
Assessments of Herbicides Resistant Weeds in Metahara Sugar Estate
No ratings yet
Assessments of Herbicides Resistant Weeds in Metahara Sugar Estate
7 pages
Gender Analysis of Healthcare Expenditures in Rural Nigeria
No ratings yet
Gender Analysis of Healthcare Expenditures in Rural Nigeria
13 pages
Spatio-Temporal Dynamics of Climatic Parameters in Togo
No ratings yet
Spatio-Temporal Dynamics of Climatic Parameters in Togo
19 pages
Estimation of Genetic Parameters For Yield and Quality Attributes in Tomato (Lycopersicon Esculentum MILL) Genotypes
No ratings yet
Estimation of Genetic Parameters For Yield and Quality Attributes in Tomato (Lycopersicon Esculentum MILL) Genotypes
10 pages
Harnessing The Power of Agricultural Waste: A Study of Sabo Market, Ikorodu, Lagos State, Nigeria
No ratings yet
Harnessing The Power of Agricultural Waste: A Study of Sabo Market, Ikorodu, Lagos State, Nigeria
7 pages
Interfacial Microstructure and Mechanical Properties of Diffusion Bonded
No ratings yet
Interfacial Microstructure and Mechanical Properties of Diffusion Bonded
8 pages
Evaluation of Agro-Morphological Performances of Hybrid Varieties of Chili Pepper (Capsicum Frutescens L.) in Northern Benin
No ratings yet
Evaluation of Agro-Morphological Performances of Hybrid Varieties of Chili Pepper (Capsicum Frutescens L.) in Northern Benin
9 pages
Evaluation and Demonstration of Irrigation Regime On Hot Pepper (Capsicum Annuum L.) in Benna-Tsemay Woreda, Southern Ethiopia
No ratings yet
Evaluation and Demonstration of Irrigation Regime On Hot Pepper (Capsicum Annuum L.) in Benna-Tsemay Woreda, Southern Ethiopia
6 pages
Use of Orange Fleshed Sweet Potato (Ipomoea Batatas (L) Lam) To Combat Vitamin A Deficiency.
No ratings yet
Use of Orange Fleshed Sweet Potato (Ipomoea Batatas (L) Lam) To Combat Vitamin A Deficiency.
6 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
Urban Liveability in The Context of Sustainable Development: A Perspective From Coastal Region of West Bengal
No ratings yet
Urban Liveability in The Context of Sustainable Development: A Perspective From Coastal Region of West Bengal
15 pages
Multivariate Analysis of Tea (Camellia Sinensis (L.) O. Kuntze) Clones On Morphological Traits in Southwestern Ethiopia
No ratings yet
Multivariate Analysis of Tea (Camellia Sinensis (L.) O. Kuntze) Clones On Morphological Traits in Southwestern Ethiopia
8 pages
PDFTrade Liberalization and Agriculture Sector Trade Balance in Cameroon: Need For Policy Reform
No ratings yet
PDFTrade Liberalization and Agriculture Sector Trade Balance in Cameroon: Need For Policy Reform
10 pages
Lecture Notes For Algorithms For Data Science: 1 Nearest Neighbors
No ratings yet
Lecture Notes For Algorithms For Data Science: 1 Nearest Neighbors
3 pages
"Land Consolidation" As A Solution For Rural Infrastructure Problems
No ratings yet
"Land Consolidation" As A Solution For Rural Infrastructure Problems
9 pages
Comparative Study On Productive and Reproductive Performances of Indigenous, Red Chittagong Cattle, and Crossbred Cattle in Bangladesh
No ratings yet
Comparative Study On Productive and Reproductive Performances of Indigenous, Red Chittagong Cattle, and Crossbred Cattle in Bangladesh
5 pages
Intrebari Fermi
No ratings yet
Intrebari Fermi
12 pages
Gentrification and Its Effects On Minority Communities - A Comparative Case Study of Four Global Cities: San Diego, San Francisco, Cape Town, and Vienna
No ratings yet
Gentrification and Its Effects On Minority Communities - A Comparative Case Study of Four Global Cities: San Diego, San Francisco, Cape Town, and Vienna
24 pages
New Text Document
No ratings yet
New Text Document
14 pages
Performance Evaluation of Upland Rice (Oryza Sativa L.) and Variability Study For Yield and Related Traits in South West Ethiopia
No ratings yet
Performance Evaluation of Upland Rice (Oryza Sativa L.) and Variability Study For Yield and Related Traits in South West Ethiopia
5 pages
Influence of Conferences and Job Rotation On Job Productivity of Library Staff in Tertiary Institutions in Imo State, Nigeria
No ratings yet
Influence of Conferences and Job Rotation On Job Productivity of Library Staff in Tertiary Institutions in Imo State, Nigeria
6 pages
Unit 3 - Week 1 Quiz
No ratings yet
Unit 3 - Week 1 Quiz
3 pages
Assessing The Spatial Distribution of Groundwater Quality of Addis Ababa City by Using Geographical Information Systems.
No ratings yet
Assessing The Spatial Distribution of Groundwater Quality of Addis Ababa City by Using Geographical Information Systems.
18 pages
Urban Form and Land Use Transformations in The High Mountain Town of Martadi (Bajura) Nepal
No ratings yet
Urban Form and Land Use Transformations in The High Mountain Town of Martadi (Bajura) Nepal
10 pages
Research Article
No ratings yet
Research Article
7 pages
Physics 354 - Astrophysics: Spring 09
No ratings yet
Physics 354 - Astrophysics: Spring 09
4 pages
Oil and Fatty Acid Composition Analysis of Ethiopian Mustard (Brasicacarinataa. Braun) Landraces
No ratings yet
Oil and Fatty Acid Composition Analysis of Ethiopian Mustard (Brasicacarinataa. Braun) Landraces
11 pages
6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
Review On Postharvest Handling Practices of Root and Tuber Crops.
No ratings yet
Review On Postharvest Handling Practices of Root and Tuber Crops.
9 pages
Chapter 04 Exercises
No ratings yet
Chapter 04 Exercises
9 pages
A Guide To Teaching Data Science PDF
No ratings yet
A Guide To Teaching Data Science PDF
26 pages
Aashto M332-14
No ratings yet
Aashto M332-14
8 pages
Enhancing Social Capital During The Pandemic: A Case of The Rural Women in Bukidnon Province, Southern Philippines
No ratings yet
Enhancing Social Capital During The Pandemic: A Case of The Rural Women in Bukidnon Province, Southern Philippines
10 pages
Ma MSCMT 03
No ratings yet
Ma MSCMT 03
3 pages
Mech - Syllabus 7 Sem
No ratings yet
Mech - Syllabus 7 Sem
5 pages
Radicals
No ratings yet
Radicals
2 pages
TSPS - PS - Float Switch
No ratings yet
TSPS - PS - Float Switch
3 pages
6 Different Ways To Compensate For Missing Values in A Dataset
No ratings yet
6 Different Ways To Compensate For Missing Values in A Dataset
6 pages
FTB TBN100 10a
No ratings yet
FTB TBN100 10a
2 pages
PM0058-D Brochure, OPTI CCA-TS, English (Web)
No ratings yet
PM0058-D Brochure, OPTI CCA-TS, English (Web)
4 pages
Quantum Mechanics Course Stationary Ptexamples
No ratings yet
Quantum Mechanics Course Stationary Ptexamples
14 pages
Bri DB 1100
No ratings yet
Bri DB 1100
2 pages
Python Natural Language Processing Cookbook: Over 60 recipes for building powerful NLP solutions using Python and LLM libraries
From Everand
Python Natural Language Processing Cookbook: Over 60 recipes for building powerful NLP solutions using Python and LLM libraries
Zhenya Antić
No ratings yet

Linear Algebra - A Powerful Tool For Data Science

Uploaded by

Linear Algebra - A Powerful Tool For Data Science

Uploaded by

International Journal of Statistics and Mathematics

Vol. 6(3), pp. 137-142, December, 2019. © www.premierpublishers.org. ISSN: 2375-0499

Linear Algebra – A Powerful Tool for Data Science

Keywords: Data, Information, Data Science, Linear Algebra

Linear Algebra – A Powerful Tool for Data Science

In this 2D space, consider the vector (3, 4) by traveling 3

L2 Norm: Also known as the Euclidean Distance. L2

Fig. 2: Applications of linear algebra in data sciences

Linear Algebra in Machine Learning

The following are the some application areas of linear

Linear Algebra – A Powerful Tool for Data Science

Fig 6: Support Vector Machine

A hyperplane is a subspace whose dimensions are one

Linear Algebra in Dimensionality Reduction

Linear Algebra – A Powerful Tool for Data Science

Finally, multiply the truncated matrices to obtain the

Eigenvectors for a square matrix are special non-zero

Fig 7: Eigen Vectors

2. Singular Value Decomposition Linear Algebra in Natural Language Processing

Deep learning methods can achieve state – of – the – art

Image representation as Tensors

A computer does not process images as humans do.

Convolution and Image Processing

2D Convolution is a very important operation in image

Armbrust A. Fox R, Griffith R, Joseph AD, Konwinski A,

Linear Algebra – A Powerful Tool for Data Science

You might also like