The Role of Mathematics in Machine Learning: March 2023
The Role of Mathematics in Machine Learning: March 2023
net/publication/369452709
CITATIONS READS
0 1,203
1 author:
Sopan Talekar
Nashik District Maratha Vidya Prasarak Samaj's K.B.T. College of Engineering
16 PUBLICATIONS 24 CITATIONS
SEE PROFILE
All content following this page was uploaded by Sopan Talekar on 23 March 2023.
Dr Vipul V Bag
Professor, N K Orchid College of Engineering, Solapur, Maharashtra, India
1 INTRODUCTION
In Computer Science there are many tasks and problems which are strenuous using computer
programs, traditional methods and instructions. Creating a dynamic game, and desktop
application are enormous and complex, making the best machine for a person, making a self-
driving car where the computer recognizes objects, is not so easy. These are not things that
computers cannot easily do. The world is full of data produced by different sources such as
computers, phones and other devices in the form of images, music, words, spreadsheets, and
videos, and it doesn't seem to be slowing down anytime soon. People analyze data and
implement a system to examine an opportunity in a data outline, but the redundant and ever-
growing nature of data makes it difficult to do this by hand. One way to turn this around is to
make the computer learn and figure out how to improve itself through many exercises. This is
machine learning (ML). Effective use of high-performance computing architectures,
unmatched execution power, and productive use of statistical tools for data extraction are
factors that make ML is trending technology for the scientific computing community.
ML algorithms use assumptions based on scientific models, including calculus statistics, and
probability. ML promises to derive meaning from all data. ML is a mixture of tools and
technologies that can be used to answer questions about your data. ML helps various e-
commerce companies offer the right products to their customers to analyze the review of the
customer and improve the product. To be able to do this, we need to combine it with
mathematics, which has a lot of programming.
3.1 Statistics
Statistics is the core of everything [8]. It has been extensively used for the valuation of the
value of a inhabitants parameter. Calculus expresses how to make a model learn and how a
model can be optimized. Linear algebra makes running these algorithms feasible on enormous
data sets [13].
TABLE 1. Data set used to create a model of apartment cost estimator concerning the
area of the apartment
Price Per Square Foot Price of Plot Price Per Square Foot Price of Plot
85 534760
65 535717
70 833333
15 728377
95 899945
80 914339
60 403601
55 437328
Probability assistance prediction of event occurrence. Let us consider a problem to predict the
price of a plot in terms of the area of the Plot. Considering a data set, having 2 columns, one is
the price per square foot of a given plot and the other is the total price of the plot. This is the
only marker to predict the price of the plot as a whole. Here present some kind of correlation
in the data set. The predictive model gives an idea about the correlation between dependent
and independent variables which ultimately improves the capability of a model to predict the
price of the Plot as a whole provided the price per square foot. The graph shown in Fig. 3, has
an x-axis measuring the price per square foot and a y-axis measuring the price of the house. It
is a scattered plot. Ideally, a line can be found that intersects as many data points as possible.
This line can be used for prediction.
Price of Plot
1000000
900000
800000
700000
600000
500000
400000
300000
200000
100000
0
0 20 40 60 80 100
FIGURE 3. Graph showing the correlation between Price per square foot and Price of the Plot
In mathematics, the field of statistics acts as a collection of technologies that extracts useful
information from data. It’s a tool for creating an understanding of a set of numbers. Statistical
inference is a process of predicting a larger population of data based on a smaller sample. In
statistics, we try to create a line so we use a statistical inference technique called linear
regression. This allows us to summarize and study the relationship between dependent and
independent variables. The way linear regression is represented is by using equation 1
y=mx + c (1)
Varying each of parameters, m and b, produces different linear models that define different
input-output mappings. Where y is the prediction (dependent variable) based on the input
variable x (independent variable). The point of intersection of the line with the y-axis is
represented by ’b’ and, ‘m’ the slope defines the relativeness of variables. So the value of y
can be predicted using the value of x provided the value of m and c is already known. The naive
way to find the value of m and c is to try out a bunch of different values. To check the
correctness of these values error functions are used. The error function tells how far off the
actual value is from the predicted value. There are lots of different types of statistical error
functions, least-squares one of them.
Figure 4: Line Plot of Different Line Models Produced by Varying the Slope and Intercept
Taken from Deep Learning
Probability theory is one of the major tools in building the model of predictions where probably
and unlikely are very common in daily life. To predicate appropriate occurrence we use
probability theory. In probability occurance of any event P(A) were ia a particular event and
P(A) represent probability ocuurance od that event . The result of the occurnace of that result
may be success or failure. For Example Picking a marable from sack which consists of balck
and white marble or tossing a coin has event were occurance of head or tail is 0.5.
Bayes' theorem gives the relationship between any two event occurance like selling the bread
with milk in an shopping store. Data mining is a field of Computer science that establish a
relation between a such event and gives prior knowledge about the particular event to occur
from a large dataset.
𝐵𝐴∗ ( )
𝑃(𝐴|𝐵) = (2)
( )
Were A and B events 𝑃(𝐴|𝐵)is the conditional probability that event A occurs when event B
Have already occurred.
4 CALCULUS
Calculus is the study of change. It helps to find the direction of change. In which direction
should the unknown variable change such that the prediction is more optimal and the error is
smaller? It got an optimization technique called gradient descent that will help to discover the
minimum value iteratively. It uses the error for a given data point to compute the gradient of
an unknown variable and the gradient can be used to update two variables. Then move on to
the next point and repeat the process over and over again till the minimum value is found. If
multiple variables are considered, then that is called multi-variant regression. The branch of
math that consult the multivariate spaces and the linear transformation between them is called
linear algebra. It provides a set of operations that can be performed on groups of numbers
known as matrices. Now the data set consists of M by I matrices, where samples have ’I’
features. Each variable has a weight. Calculus includes all gears for analysis of the association
amid capacities and their information sources. Regularly, in AI, we are attempting to discover
the sources of info that empower a capacity to best match the information. Neural systems are
one of the most well-known and fruitful reasonable structures in AI. They are developed from
an associated snare of neurons and enlivened by the structure of natural minds. The conduct of
every neuron is affected by a lot of control boundaries, every one of which should be
streamlined to best fit the information. The multivariate chain rule can be utilized to ascertain
the impact of every boundary of the systems, permitting them to be refreshed during
preparation. Multivariate math is required to construct numerous normal AI procedures. A wide
variety of data sets is derived from devices such as biochip transponders on farm animals,
automobiles with built-in sensors, smart homes, smart cities, or airplanes with sensors, heart
monitoring implants, and electric clams in coastal waters. Efficient ML algorithms for such
data sets can use hypotheses based on mathematical models involving both calculus and
statistics. These devices or sensors used inside physical, biological, or environmental systems
collect large volumes of data that follow mathematical models based on both calculus and
statistics. Calculus can be used to implement learning from patterns. Various combinations of
states and control are used by the ML model for analysis [15], [16].
Calculus plays an integral role in understanding the internal workings of machine learning
algorithms, such as the gradient descent algorithm for minimizing an error function. Calculus
provides us with the necessary tools to optimize complex objective functions as well as
functions with multidimensional inputs, which are representative of different machine learning
applications. A neural network model, whether shallow or deep, implements a function that
maps a set of inputs to expected outputs. The function implemented by the neural network is
learned through a training process, which iteratively searches for a set of weights that best
enable the neural network to model the variations in the training data.
5 Multivariate Calculus
We recurrently attempt to predict a variable that is dependent on multiple variables. For an
instant, we want to predict the weather in certain locations which are dependent on multiple
factors. If the result of your function a, is dependent on the given input variable b, then it's
declared as given in equation 3.
𝑎 = 𝑓(𝑏) (3)
Consequently, if the result of the variable c depends on b and c as an instance, then the function
is given equation (4)
𝑐 = 𝑓(𝑏, 𝑐) (4)
support vector algorithms, and multivariate calculus castoff to find the maximal margin.
CONCLUSION
Machine learning is a trending technology that is booming in the field of computer science with
building different models such for prediction, classification, audio or video recognition etc.
Building the model of machine learning depends on mathematical concepts such as probability,
calculus, and statistics. The role of the mathematical is very important for building appropriate
models and getting accurate results with minimum error. Probability is used for prediction and
classification-related models. Calculus is used for multivariate dependent results were statics
is used in classification and sampling in Machine learning.
REFERENCES
[1]. Khaliq, Abdul & Olumoyin, Kayode. (2022). Mathematics of Machine Learning.
10.13140/RG.2.2.25118.41284.
[2]. Linear Algebra. (2020). Mathematics for Machine Learning, 8-56.
doi:10.1017/9781108679930.004.
[3]. Probability and distributions. (2020). Mathematics for Machine Learning, 152-200.
Vector calculus. (2020).
[4]. Mathematics for Machine Learning, 120-151.
doi:10.1017/9781108679930.007:10.1017/9781108679930.008.
[5]. Karamitsos, Ioannis and Albarhami, Saeed and Apostolopoulos, Charalampos,
“Applying Develops Practices of Continuous Automation for Machine Learning”,
Multidisciplinary Digital Publishing Institute, vol 11, 2020, pp 363 3.
[6]. Srinivas Pyda, Srinivas Kareenhalli, “Mathematics and Machine Learning,
International Conference on Mathematics and Computing”, 2018, pp. 135-153 2
Non-Linear Optical Material: Calcium Borolactate, Asian Journal of Chemistry; Vol. 28, No.
12 (2016).
[36]. Padmaja, D. L., Nagaprasad, S., Pant, K., & Kumar, Y. P. (2022). Role of Artificial
Intelligence and Deep Learning in Easier Skin Cancer Detection through Antioxidants Present
in Food. Journal of Food Quality, 2022.
[37]. Padmaja, D. L. (2021). Performance Analysis of Different Architectures on Face Mask
Detection. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(13),
377-381.
[38]. Gundu, K. S., Dhyaram, L. P., Ramana Rao, G. N. V., & Surya Deepak, G. (2023,
January). Comparative Analysis of Energy Consumption in Text Processing Models. In
Advancements in Smart Computing and Information Security: First International Conference,
ASCIS 2022, Rajkot, India, November 24–26, 2022, Revised Selected Papers, Part I (pp. 107-
116). Cham: Springer Nature Switzerland.
[39]. Ramirez-Asis, E., Guzman-Avalos, M., Mazumdar, B. D., Padmaja, D. L., Mishra, M.,
Hirolikar, D. S., & Kaliyaperumal, K. (2022). Metaheuristic Methods for Efficiently Predicting
and Classifying Real Life Heart Disease Data Using Machine Learning. Mathematical
Problems in Engineering, 2022.
[40]. Padmaja, D. L., Tammali, S., Gajavelly, N., & Reddy, K. S. (2022, May). A
comparative study on natural disasters. In 2022 International Conference on Applied Artificial
Intelligence and Computing (ICAAIC) (pp. 1704-1709). IEEE.
[41]. Padmaja, D. L., Sruthi, B. S., Deepak, G. S., & Harsha, G. S. (2022, April). Analysis
to Predict Coronary Thrombosis Using Machine Learning Techniques. In 2022 International
Conference on Sustainable Computing and Data Communication Systems (ICSCDS) (pp. 21-
27). IEEE.
[42]. Padmaja, D. L., & Sriharsha, G. K. (2022, December). Challenges in Crop Selection
Using Machine Learning. In Artificial Intelligence and Data Science: First International
Conference, ICAIDS 2021, Hyderabad, India, December 17–18, 2021, Revised Selected
Papers (pp. 66-76). Cham: Springer Nature Switzerland.
[43]. Padmaja, D. L., Nagaprasad, S., Pant, K., & Kumar, Y. P. (2022). Role of Artificial
Intelligence and Deep Learning in Easier Skin Cancer Detection through Antioxidants Present
in Food. Journal of Food Quality, 2022.
[44]. Baker, M. R., Padmaja, D. L., Puviarasi, R., Mann, S., Panduro-Ramirez, J., Tiwari,
M., & Samori, I. A. (2022). Implementing Critical Machine Learning (ML) Approaches for
Generating Robust Discriminative Neuroimaging Representations Using Structural Equation
Model (SEM). Computational and Mathematical Methods in Medicine, 2022.
[45]. Lakshmipadmaja, D., & Vishnuvardhan, B. (2018). Classification performance
improvement using random subset feature selection algorithm for data mining. Big Data
Research, 12, 1-12.
[46]. Padmaja, D. L., & Vishnuvardhan, B. (2018). Evaluating the influence of parameter
values on the performance of random subset feature selection algorithm on scientific data. Data
& Knowledge Engineering, 117, 174-182.
[47]. Padmaja, D. L., & Vishnuvardhan, B. (2016, February). Comparative study of feature
subset selection methods for dimensionality reduction on scientific data. In 2016 IEEE 6th
International Conference on Advanced Computing (IACC) (pp. 31-34). IEEE.