ML Learning
ML Learning
In basic terms, ML is the process of training a piece of software, called a model, to make useful predictions or generate content from data.
ML systems fall into one or more of the following categories based on how they learn to make predictions or generate conten
Supervised learning
Unsupervised learning
Reinforcement learning
Generative AI
Supervised learning models can make predictions after seeing lots of data with the correct answers and then discovering the c
Unsupervised learning models make predictions by being given data that does not contain any correct answers.
Datasets are made up of individual examples that contain features and a label
Features are the values that a supervised model uses to predict the label/input variable
The label is the "answer," or the value we want the model to predict.
A dataset is characterized by its size and diversity. Size indicates the number of examples. Diversity indicates the range those e
Training
Before a supervised model can make predictions, it must be trained. To train a model, we give
the model a dataset with labeled examples. The model's goal is to work out the best solution for
predicting the labels from the features. The model finds the best solution by comparing its
predicted value to the label's actual value. Based on the difference between the predicted and
actual values—defined as the loss—the model gradually updates its solution. In other words,
the model learns the mathematical relationship between the features and the label so that it
can make the best predictions on unseen data.
ut being programmed explicitly
ers and then discovering the connections between the elements in the data that produce the correct answers.
orrect answers.
A labeled example consists of one or more
features and a label. Labeled examples
are used during training.
sity indicates the range those examples cover. Good datasets are both large and highly diverse
Loss
During the training of a supervised model,
a measure of how far a model's prediction
is from its label.
The two main data structures defined in the pandas package are DataFrame and Series
A DataFrame is a tabular data structure that is represented as a two-dimensional table. It is composed of rows, columns, index
observatio
n
posed of rows, columns, indexes, and cells. It is very similar to a sheet in Excel or a table in a database