ML Notes

Machine learning (ML) is a subfield of artificial intelligence focused on developing algorithms that learn from data to perform tasks without explicit instructions. It has applications in various fields such as natural language processing, computer vision, and medicine, with deep learning techniques enhancing its capabilities. The history of ML dates back to the 1950s, evolving from early cognitive studies to a distinct field emphasizing practical problem-solving using statistical methods.

Uploaded by

vineetha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views18 pages

ML Notes

Uploaded by

vineetha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 18

Machine learning

 Article

 Talk
 Language
Download PDF
Watch
Edit
For the journal, see Machine Learning (journal).

"Statistical learning" redirects here. For statistical learning in linguistics,

see statistical learning in language acquisition.

Machine learning (ML) is a field of study in artificial intelligence concerned with the
development and study of statistical algorithms that can learn from data and generalize to unseen
data, and thus perform tasks without explicit instructions.[1] Within a subdiscipline in machine
learning, advances in the field of deep learning have allowed neural networks, a class of
statistical algorithms, to surpass many previous machine learning approaches in performance. [2]
ML finds application in many fields, including natural language processing, computer
vision, speech recognition, email filtering, agriculture, and medicine.[3][4] The application of ML to
business problems is known as predictive analytics.
Statistics and mathematical optimization (mathematical programming) methods comprise the
foundations of machine learning. Data mining is a related field of study, focusing on exploratory
data analysis (EDA) via unsupervised learning.[6][7]
From a theoretical viewpoint, probably approximately correct learning provides a framework for
describing machine learning.
History
edit
See also: Timeline of machine learning

The term machine learning was coined in 1959 by Arthur Samuel, an IBM employee and pioneer
in the field of computer gaming and artificial intelligence.[8][9] The synonym self-teaching
computers was also used in this time period.[10][11]
Although the earliest machine learning model was introduced in the 1950s when Arthur
Samuel invented a program that calculated the winning chance in checkers for each side, the
history of machine learning roots back to decades of human desire and effort to study human
cognitive processes.[12] In 1949, Canadian psychologist Donald Hebb published the book The
Organization of Behavior, in which he introduced a theoretical neural structure formed by certain
interactions among nerve cells.[13] Hebb's model of neurons interacting with one another set a
groundwork for how AIs and machine learning algorithms work under nodes, or artificial
neurons used by computers to communicate data.[12] Other researchers who have studied
human cognitive systems contributed to the modern machine learning technologies as well,
including logician Walter Pitts and Warren McCulloch, who proposed the early mathematical
models of neural networks to come up with algorithms that mirror human thought processes.[12]
By the early 1960s, an experimental "learning machine" with punched tape memory, called
Cybertron, had been developed by Raytheon Company to
analyse sonar signals, electrocardiograms, and speech patterns using rudimentary reinforcement
learning. It was repetitively "trained" by a human operator/teacher to recognize patterns and
equipped with a "goof" button to cause it to reevaluate incorrect decisions.[14] A representative
book on research into machine learning during the 1960s was Nilsson's book on Learning
Machines, dealing mostly with machine learning for pattern classification.[15] Interest related to
pattern recognition continued into the 1970s, as described by Duda and Hart in 1973.[16] In 1981 a
report was given on using teaching strategies so that an artificial neural network learns to
recognize 40 characters (26 letters, 10 digits, and 4 special symbols) from a computer terminal. [17]
Tom M. Mitchell provided a widely quoted, more formal definition of the algorithms studied in
the machine learning field: "A computer program is said to learn from experience E with respect
to some class of tasks T and performance measure P if its performance at tasks in T, as measured
by P, improves with experience E."[18] This definition of the tasks in which machine learning is
concerned offers a fundamentally operational definition rather than defining the field in cognitive
terms. This follows Alan Turing's proposal in his paper "Computing Machinery and
Intelligence", in which the question "Can machines think?" is replaced with the question "Can
machines do what we (as thinking entities) can do?".[19]
Modern-day machine learning has two objectives. One is to classify data based on models which
have been developed; the other purpose is to make predictions for future outcomes based on
these models. A hypothetical algorithm specific to classifying data may use computer vision of
moles coupled with supervised learning in order to train it to classify the cancerous moles. A
machine learning algorithm for stock trading may inform the trader of future potential
predictions.[20]
Relationships to other fields
edit
Artificial intelligence
edit
Machine learning as subfield of AI[21]

As a scientific endeavor, machine learning grew out of the quest for artificial intelligence (AI).
In the early days of AI as an academic discipline, some researchers were interested in having
machines learn from data. They attempted to approach the problem with various symbolic
methods, as well as what were then termed "neural networks"; these were
mostly perceptrons and other models that were later found to be reinventions of the generalized
linear models of statistics.[22] Probabilistic reasoning was also employed, especially in automated
medical diagnosis.[23]: 488
However, an increasing emphasis on the logical, knowledge-based approach caused a rift
between AI and machine learning. Probabilistic systems were plagued by theoretical and
practical problems of data acquisition and representation.[23]: 488 By 1980, expert systems had come
to dominate AI, and statistics was out of favor.[24] Work on symbolic/knowledge-based learning
did continue within AI, leading to inductive logic programming(ILP), but the more statistical line
of research was now outside the field of AI proper, in pattern recognition and information
retrieval.[23]: 708–710, 755 Neural networks research had been abandoned by AI and computer
science around the same time. This line, too, was continued outside the AI/CS field, as
"connectionism", by researchers from other disciplines including John Hopfield, David
Rumelhart, and Geoffrey Hinton. Their main success came in the mid-1980s with the reinvention
of backpropagation.[23]: 25
Machine learning (ML), reorganized and recognized as its own field, started to flourish in the
1990s. The field changed its goal from achieving artificial intelligence to tackling solvable
problems of a practical nature. It shifted focus away from the symbolic approaches it had
inherited from AI, and toward methods and models borrowed from statistics, fuzzy logic,
and probability theory.[24]
Data compression
edit
This section is an excerpt from Data compression § Machine learning.[edit]

There is a close connection between machine learning and compression. A system that predicts
the posterior probabilities of a sequence given its entire history can be used for optimal data
compression (by using arithmetic coding on the output distribution). Conversely, an optimal
compressor can be used for prediction (by finding the symbol that compresses best, given the
previous history). This equivalence has been used as a justification for using data compression as
a benchmark for "general intelligence".[25][26][27]
An alternative view can show compression algorithms implicitly map strings into implicit feature
space vectors, and compression-based similarity measures compute similarity within these
feature spaces. For each compressor C(.) we define an associated vector space ℵ, such that C(.)
maps an input string x, corresponding to the vector norm ||~x||. An exhaustive examination of the
feature spaces underlying all compression algorithms is precluded by space; instead, feature
vectors chooses to examine three representative lossless compression methods, LZW, LZ77, and
PPM.[28]
According to AIXI theory, a connection more directly explained in Hutter Prize, the best possible
compression of x is the smallest possible software that generates x. For example, in that model, a
zip file's compressed size includes both the zip file and the unzipping software, since you can not
unzip it without both, but there may be an even smaller combined form.
Examples of AI-powered audio/video compression software include NVIDIA Maxine, AIVC.
[29]
Examples of software that can perform AI-powered image compression
include OpenCV, TensorFlow, MATLAB's Image Processing Toolbox (IPT) and High-Fidelity
Generative Image Compression.[30]
In unsupervised machine learning, k-means clustering can be utilized to compress data by
grouping similar data points into clusters. This technique simplifies handling extensive datasets
that lack predefined labels and finds widespread use in fields such as image compression.[31]
Data compression aims to reduce the size of data files, enhancing storage efficiency and
speeding up data transmission. K-means clustering, an unsupervised machine learning algorithm,
is employed to partition a dataset into a specified number of clusters, k, each represented by
the centroid of its points. This process condenses extensive datasets into a more compact set of
representative points. Particularly beneficial in image and signal processing, k-means clustering
aids in data reduction by replacing groups of data points with their centroids, thereby preserving
the core information of the original data while significantly decreasing the required storage
space.[32]
Large language models (LLMs) are also efficient lossless data compressors on
some data sets, as demonstrated by DeepMind's research with the Chinchilla
70B model. Developed by DeepMind, Chinchilla 70B effectively compressed
data, outperforming conventional methods such as Portable Network
Graphics (PNG) for images and Free Lossless Audio Codec (FLAC) for audio. It
achieved compression of image and audio data to 43.4% and 16.4% of their
original sizes, respectively. There is, however, some reason to be concerned
that the data set used for testing overlaps the LLM training data set, making
it possible that the Chinchilla 70B model is only an efficient compression tool
on data it has already been trained on.[33][34]

Data mining
edit
Machine learning and data mining often employ the same methods and overlap significantly, but
while machine learning focuses on prediction, based on known properties learned from the
training data, data mining focuses on the discovery of (previously) unknown properties in the
data (this is the analysis step of knowledge discovery in databases). Data mining uses many
machine learning methods, but with different goals; on the other hand, machine learning also
employs data mining methods as "unsupervised learning" or as a preprocessing step to improve
learner accuracy. Much of the confusion between these two research communities (which do
often have separate conferences and separate journals, ECML PKDD being a major exception)
comes from the basic assumptions they work with: in machine learning, performance is usually
evaluated with respect to the ability to reproduce known knowledge, while in knowledge
discovery and data mining (KDD) the key task is the discovery of
previously unknown knowledge. Evaluated with respect to known knowledge, an uninformed
(unsupervised) method will easily be outperformed by other supervised methods, while in a
typical KDD task, supervised methods cannot be used due to the unavailability of training data.
Machine learning also has intimate ties to optimization: Many learning problems are formulated
as minimization of some loss function on a training set of examples. Loss functions express the
discrepancy between the predictions of the model being trained and the actual problem instances
(for example, in classification, one wants to assign a label to instances, and models are trained to
correctly predict the preassigned labels of a set of examples).[35]
Generalization
edit
Characterizing the generalization of various learning algorithms is an active topic of current
research, especially for deep learning algorithms.
Statistics
edit
Machine learning and statistics are closely related fields in terms of methods, but distinct in their
principal goal: statistics draws population inferences from a sample, while machine learning
finds generalizable predictive patterns.[36] According to Michael I. Jordan, the ideas of machine
learning, from methodological principles to theoretical tools, have had a long pre-history in
statistics.[37] He also suggested the term data science as a placeholder to call the overall field.[37]
Conventional statistical analyses require the a priori selection of a model most suitable for the
study data set. In addition, only significant or theoretically relevant variables based on previous
experience are included for analysis. In contrast, machine learning is not built on a pre-structured
model; rather, the data shape the model by detecting underlying patterns. The more variables
(input) used to train the model, the more accurate the ultimate model will be.[38]
Leo Breiman distinguished two statistical modeling paradigms: data model and algorithmic
model,[39] wherein "algorithmic model" means more or less the machine learning algorithms
like Random Forest.
Some statisticians have adopted methods from machine learning, leading to a combined field that
they call statistical learning.[40]
Statistical physics
edit
Analytical and computational techniques derived from deep-rooted physics of disordered
systems can be extended to large-scale problems, including machine learning, e.g., to analyse the
weight space of deep neural networks.[41] Statistical physics is thus finding applications in the area
of medical diagnostics.[42]
Theory
Approaches


1.
2.
3.
4.

Models
Applications





Limitations
Model assessments
Ethics
Hardware
Software





Journals


Conferences


Last edited 23 hours ago by OAbot

 Neural network (machine learning)Computational model used in

machine learning, based on connected, hierarchical functions
 Unsupervised learningParadigm in machine learning that uses no
classification labels
 Glossary of artificial intelligenceList of definitions of terms and
concepts commonly used in the study of artificial intelligence


 Content is available under CC BY-SA 4.0 unless otherwise noted.

 Privacy policy

 Contact Wikipedia

 Code of Conduct

 Developers

 Statistics

 Cookie statement

 Terms of Use

 Desktop

MLT Unit 1 Notes
No ratings yet
MLT Unit 1 Notes
29 pages
Machine Learning Basics For Beginners
No ratings yet
Machine Learning Basics For Beginners
60 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
Machine Learning
No ratings yet
Machine Learning
40 pages
About Machinelearning
No ratings yet
About Machinelearning
22 pages
Machine Learning (ML)
No ratings yet
Machine Learning (ML)
45 pages
Machine Learning
No ratings yet
Machine Learning
39 pages
Machine Learning
No ratings yet
Machine Learning
39 pages
Machine Learning Glimpse
No ratings yet
Machine Learning Glimpse
37 pages
FAM Unit4
No ratings yet
FAM Unit4
11 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Kaifeng Lius Senior Thesis
No ratings yet
Kaifeng Lius Senior Thesis
20 pages
Alzubi 2018 J. Phys. Conf. Ser. 1142 012012
No ratings yet
Alzubi 2018 J. Phys. Conf. Ser. 1142 012012
16 pages
On Machine Learning
No ratings yet
On Machine Learning
20 pages
3
No ratings yet
3
13 pages
Part 1
No ratings yet
Part 1
10 pages
Machine Learning Documentation
No ratings yet
Machine Learning Documentation
18 pages
ML-1st Unit
No ratings yet
ML-1st Unit
23 pages
ML New New 1
No ratings yet
ML New New 1
15 pages
Machin Learning
No ratings yet
Machin Learning
6 pages
L001 Introduction
No ratings yet
L001 Introduction
15 pages
Machine Learning UNIT I
No ratings yet
Machine Learning UNIT I
42 pages
Notes Unit 1 ML
No ratings yet
Notes Unit 1 ML
17 pages
Machine Learning Tutorial
100% (2)
Machine Learning Tutorial
139 pages
Alzubi 2018 J. Phys. Conf. Ser. 1142 012012
No ratings yet
Alzubi 2018 J. Phys. Conf. Ser. 1142 012012
16 pages
MLF 1
No ratings yet
MLF 1
15 pages
ML-UNIT - I - Part A
No ratings yet
ML-UNIT - I - Part A
88 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
17 pages
ml3 2
No ratings yet
ml3 2
59 pages
Machine Learning
No ratings yet
Machine Learning
81 pages
Faiml Unit 2
No ratings yet
Faiml Unit 2
7 pages
A Study On Machine Learning Algorithms and Its Applications
No ratings yet
A Study On Machine Learning Algorithms and Its Applications
13 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
Machine Learning
No ratings yet
Machine Learning
1 page
Lecture 13 Intro Machine Learning
No ratings yet
Lecture 13 Intro Machine Learning
56 pages
Unit 1
No ratings yet
Unit 1
88 pages
Machine Learning Machine Learning and Da
No ratings yet
Machine Learning Machine Learning and Da
19 pages
Machine Learning: 1.1 Types of Problems and Tasks
No ratings yet
Machine Learning: 1.1 Types of Problems and Tasks
9 pages
ML Basic
No ratings yet
ML Basic
12 pages
University Institute of Technology Barkatullah University Bhopal
No ratings yet
University Institute of Technology Barkatullah University Bhopal
16 pages
Unit 4
No ratings yet
Unit 4
39 pages
? Machine Learning
No ratings yet
? Machine Learning
2 pages
Report On Machine Learning
No ratings yet
Report On Machine Learning
13 pages
ML Notes
No ratings yet
ML Notes
202 pages
1 ML
No ratings yet
1 ML
24 pages
Unit 5 Machine Learning
No ratings yet
Unit 5 Machine Learning
14 pages
Unit-1 ML
No ratings yet
Unit-1 ML
23 pages
Soranson Python-Machine-Learning RuLit Me 683600
No ratings yet
Soranson Python-Machine-Learning RuLit Me 683600
99 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
24 pages
Lec1 PDF
No ratings yet
Lec1 PDF
16 pages
Wikipedia Machine Learning
No ratings yet
Wikipedia Machine Learning
6 pages
Monthly Budget Algorithm and Debugging
No ratings yet
Monthly Budget Algorithm and Debugging
4 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
23 pages
ML
No ratings yet
ML
18 pages
REGULA - FALSI METHOD Notes
0% (1)
REGULA - FALSI METHOD Notes
14 pages
1 - Machine Learning (Start)
No ratings yet
1 - Machine Learning (Start)
32 pages
Karthik
No ratings yet
Karthik
10 pages
Machine Learning (ML) Is The Study of Computer Algorithms That Improve Automatically Through
No ratings yet
Machine Learning (ML) Is The Study of Computer Algorithms That Improve Automatically Through
2 pages
Canonical Problem Forms: Ryan Tibshirani Convex Optimization 10-725
No ratings yet
Canonical Problem Forms: Ryan Tibshirani Convex Optimization 10-725
27 pages
Machine Learning Seminar Report
20% (5)
Machine Learning Seminar Report
26 pages
DS Chapter 1
No ratings yet
DS Chapter 1
41 pages
Daa Mini Project
100% (1)
Daa Mini Project
13 pages
Module 3 - Ensemble Learning
No ratings yet
Module 3 - Ensemble Learning
178 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
F1 - Changes Guide - 2020
No ratings yet
F1 - Changes Guide - 2020
11 pages
Multivariate Data Analysis Software - Sartorius
No ratings yet
Multivariate Data Analysis Software - Sartorius
13 pages
Filter 5
No ratings yet
Filter 5
149 pages
Lecture 02 - Review of Statistics - McLave - 2 Per Page
No ratings yet
Lecture 02 - Review of Statistics - McLave - 2 Per Page
65 pages
From Cryptography To Steganography: Detecting Hidden Data in The Digital World
No ratings yet
From Cryptography To Steganography: Detecting Hidden Data in The Digital World
7 pages
STATS
No ratings yet
STATS
18 pages
Lecture Notes On Cryptography: With Full Explanation
No ratings yet
Lecture Notes On Cryptography: With Full Explanation
284 pages
Chapter 8
No ratings yet
Chapter 8
13 pages
Hwsoln 06
No ratings yet
Hwsoln 06
11 pages
Introduction To ANSYS Meshing: Workshop Plan
No ratings yet
Introduction To ANSYS Meshing: Workshop Plan
26 pages
Nidhi Paper
No ratings yet
Nidhi Paper
5 pages
Homework 3: Answer
No ratings yet
Homework 3: Answer
14 pages
Converting The Sutherland Equation Units and Determine The Units of Its Constants
No ratings yet
Converting The Sutherland Equation Units and Determine The Units of Its Constants
2 pages
FRM Course Syllabus IPDownload
No ratings yet
FRM Course Syllabus IPDownload
2 pages
Chapter 07 Optimal Risky Portfolios: Answer Key
No ratings yet
Chapter 07 Optimal Risky Portfolios: Answer Key
44 pages
O
No ratings yet
O
2 pages
RC Circuit Step Response I: Find The Differential Equation That Describes The Circuit Below
No ratings yet
RC Circuit Step Response I: Find The Differential Equation That Describes The Circuit Below
7 pages
Problem 1: Informed Search: 15-381: Artificial Intelligence
No ratings yet
Problem 1: Informed Search: 15-381: Artificial Intelligence
7 pages
Answer All Questions, Each Carries3 Marks.: Page 1 of 2
No ratings yet
Answer All Questions, Each Carries3 Marks.: Page 1 of 2
2 pages
Ba Sas
No ratings yet
Ba Sas
5 pages
Lab 11: Implementation of The BINARY SEARCH TREE Data Structure With The Help of Algorithms
No ratings yet
Lab 11: Implementation of The BINARY SEARCH TREE Data Structure With The Help of Algorithms
4 pages
Fuzzy Expert Systems Using CLIPS
No ratings yet
Fuzzy Expert Systems Using CLIPS
11 pages
Global State: - Global State of A Distributed System Consists of
No ratings yet
Global State: - Global State of A Distributed System Consists of
4 pages
Complex Engineering Problem DCS 2020
No ratings yet
Complex Engineering Problem DCS 2020
1 page