A Detailed Analysis of The Supervised Machine Learning Algorithms
A Detailed Analysis of The Supervised Machine Learning Algorithms
ABSTRACT: In the field of computer science known as "machine learning," a computer makes predictions about
the tasks it will perform next by examining the data that has been given to it. The computer can access data via
interacting with the environment or by using digitalized training sets. In contrast to static programming
algorithms, which require explicit human guidance, machine learning algorithms may learn from data and
generate predictions on their own. Various supervised and unsupervised strategies, including rule-based
techniques, logic-based techniques, instance-based techniques, and stochastic techniques, have been presented in
order to solve problems. Our paper's main goal is to present a comprehensive comparison of various cutting-edge
supervised machine learning techniques.
1. INTRODUCTION
Machine learning offers systems the capacity to learn automatically and improve over time without explicit
coding. Machine learning algorithms are helpful in situations where it is unfeasible to deploy explicitly written
algorithms with high performance. Giving some numbers as input and receiving an ordered list as an output makes
it straightforward to complete a task like sorting integers. Here, we know what to provide as input and the steps
to take to get the result we want. However, some tasks are difficult to understand, such as email filtering to separate
spam from valid communications. Here, we are aware of the required input and the form of the output, which is
true. Throughout this case, we are aware of the required input and that the output will take the form of true or
false, but the instructions that must be given to the programme in order for it to carry out these operations are
unclear. We use data to our advantage and give instructions to the machine to evaluate the data and interpret it
intelligently in such unusual scenarios where there is no set algorithm to accomplish success [1]. Concrete
technology is constantly being revised and improved since it is relatively inexpensive compared to other building
materials and is frequently employed in engineering constructions around the world. [1]. Concrete is in high
demand due to urbanization's quick and technologically advanced development [2], as it has numerous desirable
features such compressive strength, shape-ability, and environmental resistance [3]. The benefits of concrete are
also listed as including porosity, damage tolerance, fire resistance, durability, and acoustic insulation. [4].
2. Learning Strategies
Machine learning employs the following strategies:
34 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.
NIET Journal of Engineering & Technology (NIETJET)
Volume 10, Issue summer 2022 ISSN: 2229-5828 (Print)
If a specific training set is provided, such as (x1, y1), (xi, yi) X R, where X is the space of input patterns. In SV
regression, our objective is to find a fitting function f(x) with a deviation from the target (yi) gained for the relevant
training data set that is smaller than. The function ought to be rather flat. Or you may say that any error that is less
than is acceptable [12]. The linear equation (f) - f (x) = (w, x) + b where (•, •) is the dot product of X and w denotes
flatness in this instance. We must limit the norm to a minimum in order to be sure.
35 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.
NIET Journal of Engineering & Technology (NIETJET)
Volume 10, Issue summer 2022 ISSN: 2229-5828 (Print)
36 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.
NIET Journal of Engineering & Technology (NIETJET)
Volume 10, Issue summer 2022 ISSN: 2229-5828 (Print)
derived for nearly all other parameters. The rows reflect the actual values or class labels to which the data object
truly belongs, while the columns indicate the predicted values by the classifier as shown in Fig 1.
A confusion matrix is a sort of table that counts the number of instances of data that are correctly and wrongly
categorised. The number of established classes is indicated by the matrix nXn. for the data collection.
References:
[1] Domingos, P. “A few useful things to know about machine learning”, Communications of the ACM,
55(10),2012 pp.1.
37 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.
NIET Journal of Engineering & Technology (NIETJET)
Volume 10, Issue summer 2022 ISSN: 2229-5828 (Print)
[2] Mohri, M., Rostamizadeh, A. and Talwalker, A. “Foundations of machine learning”, Cambridge, MA: MIT
Press,2012.
[3] Nguyen, T. and Shirai, K. “Text Classification of Technical Papers Based on Text Segmentation”, Natural
Language Processing and Information Systems, 2013,pp.278-284.
[4] Deng, L. and Li, X. “Machine Learning Paradigms for Speech Recognition: An Overview”, IEEE
Transactions on Audio, Speech, and Language Processing, 21(5), 2013, pp.1060-1089.
[5] Siswanto, A., Nugroho, A. and Galinium, M. “Implementation of face recognition algorithm for biometrics
based time attendance system”, 2014 International Conference on ICT For Smart Society (ICISS).
[6] Chen, Z. and Huang, X. “End-to-end learning for lane keeping of self-driving cars”, 2017 IEEE Intelligent
Vehicles Symposium (IV).
[7] Yong, S., Hagenbuchner, M. and Tsoi, A. “Ranking Web Pages Using Machine Learning Approaches”, 2008
IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.
[8] Wei, Z., Qu, L., Jia, D., Zhou, W. and Kang, M. “Research on the collaborative filtering recommendation
algorithm in ubiquitous computing”, 2010 8th World Congress on Intelligent Control and Automation.
[9] Kononenko, I. “Machine learning for medical diagnosis: history, state of the art and perspective”, Artificial
Intelligence in Medicine, 23(1), 2011, pp.89-109.
[10] Jordan, M. “Statistical Machine Learning and Computational Biology”,IEEE International Conference on
Bioinformatics and Biomedicine (BIBM 2007).
[11] Thangavel, S., Bkaratki, P. and Sankar, A. “Student placement analyzer: A recommendation system using
machine learning”, 4th International Conference on Advanced Computing and Communication Systems
(ICACCS-2017).
[12] Byun, H. and Lee, S., “Applications of Support Vector Machines for Pattern Recognition: A Survey”. Pattern
Recognition with Support Vector Machines, 2002, pp.214-215.
[13] Support vector machine regression algorithm [Online],
http://chemeng.utoronto.ca/~datamining/dmc/support_vector_machine_reg.htm, last access 22.08.2017.
[14] Kotsiantis, S. “Decision trees: a recent overview. Artificial Intelligence Review”, 39(4), 2011, pp.262-267.
[15] Andy Liaw and Matthew Wiener “Classification and Regression by randomForest”, R News, ISSN 1609-
363, vol. 2/3, December 2002, pp. 18-22.
[16] Tibshirani, R.”Regression shrinkage and selection via the lasso: a retrospective”, Journal of the Royal
Statistical Society: Series B (Statistical Methodology), 73(3), 2011, pp.273-282.
[17] Brownlee, J. “Logistic Regression for Machine Learning - Machine Learning Mastery”, [online] Machine
Learning Mastery. Available at: http://machinelearningmastery.com/logistic-regression-for-machine-
learning/ [Accessed 12 Aug. 2017].
[18] The steepness of the curve of logistic regression [Online],
http://chemeng.utoronto.ca/~datamining/dmc/logistic_regression.htm, last access 22.08.2017.
[19] Bicego, M. and Loog, M, “Weighted K-Nearest Neighbor revisited”, 23rd International Conference on
Pattern Recognition (ICPR), 2016, pp. 1642-1647.
[20] Ting, K. and Zheng, Z. “Improving the Performance of Boosting for Naive Bayesian Classification.
Methodologies for Knowledge Discovery and Data Mining”, 1999, pp.296-298.
[21] Peng Ye, “The decision tree classification and its application research in personnel management”,
Proceedings of 2011 International Conference on Electronics and Optoelectronics, 2011, pp. 1-4.
[22] Entropy of a decision tree classification algorithm [Online],
http://chemeng.utoronto.ca/~datamining/dmc/decision_tree.htm, last access 22.08.2017.
[23] Muda, Z., Yassin, W., Sulaiman, M. and Udzir, N. “Intrusion detection based on k-means clustering and
OneR classification”, 2011 7th International Conference on Information Assurance and Security (IAS).
[24] Kerdegari, H., Samsudin, K., Ramli, A. and Mokaram, S. “Evaluation of fall detection classification
approaches”, 2012 4th International Conference on Intelligent and Advanced Systems (ICIAS2012)
[25] Bigdeli, Y.; Barbato, M. Use of a low-cost concrete-like fluorogypsum-based blend for applications in
underwater and coastal protection structures. In Proceedings of the OCEANS 2017—Anchorage Conference,
Anchorage, AK, USA, 18–21 September 2017; pp. 1–5. Available online:
https://ieeexplore.ieee.org/abstract/document/8232181 (accessed on 5 May 2021).
[26] Reiter, L.; Wangler, T.; Anton, A.; Flatt, R.J. Setting on demand for digital concrete—Principles,
measurements, chemistry, validation. Cem. Concr. Res. 2020, 132, 106047. [CrossRef]
[27] Amran, Y.M.; Alyousef, R.; Alabduljabbar, H.; El-Zeadani, M. Clean production and properties of
geopolymer concrete; A review. J. Clean. Prod. 2020, 251, 119679. [CrossRef]
38 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.