0% found this document useful (0 votes)
20 views

Machine Learning

learn machine learning and data visualization using this .pdf book

Uploaded by

willaaa269
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views

Machine Learning

learn machine learning and data visualization using this .pdf book

Uploaded by

willaaa269
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 29

Objectives:

Upon completion of this module, the learner


will be able to Apply Data Pre-processing,
Develop Machine Learning Model and
Perform Model Deployment
Learning assumed
to be in place

Python programming, Mathematical


analysis statistics and probability
Learning outcome 1: Apply
Data Pre�processing
I.C.1.Description of Machine
learning concepts
A subset of artificial intelligence (AI) and computer
science, machine learning (ML) deals with the study
and use of data and algorithms like how humans
learn.
ML allows software applications to improve their prediction
accuracy without being specifically programmed to do so. It
estimates new output values by using historical data as input.
Classification of Machine Learning

1. Supervised learning:
Supervised learning is the types of machine learning in which
machines are trained using well "labelled" training data, and on basis
of that data, machines predict the output. The labelled data means
some input data is already tagged with the correct output.
In supervised learning, the training data provided to the machines
work as the supervisor that teaches the machines to predict the output
correctly. I
How Supervised Learning Works?

In supervised learning, models are trained using labelled


dataset, where the model learns about each type of data.
Once the training process is completed, the model is tested
on the basis of test data (a subset of the training set), and
then it predicts the output.
Suppose we have a dataset of different types of shapes which includes
square, rectangle, triangle, and Polygon. Now the first step is that we
need to train the model for each shape.
If the given shape has four sides, and all the sides are equal, then it will
be labelled as a Square.
If the given shape has three sides, then it will be labelled as a triangle.
If the given shape has six equal sides then it will be labelled as
hexagon.
Now, after training, we test our model using the test set, and the
task of the model is to identify the shape.
The machine is already trained on all types of shapes, and when it
finds a new shape, it classifies the shape on the bases of a number
of sides, and predicts the output.
What is a machine learning Model?

A machine learning model is a program that can find patterns or make


decisions from a previously unseen dataset.

In image recognition, a machine learning model can be taught to


recognize objects - such as cats or dogs. A machine learning model
can perform such tasks by having it 'trained' with a large dataset.
A machine learning model can perform such tasks by
having it 'trained' with a large dataset. During
training, the machine learning algorithm is optimized
to find certain patterns or outputs from the dataset,
depending on the task. The output of this process -
often a computer program with specific rules and
data structures - is called a machine learning model.
What is a machine learning Algorithm?
A machine learning algorithm is a mathematical method to
find patterns in a set of data. Machine Learning algorithms
are often drawn from statistics, calculus, and linear algebra.
Some popular examples of machine learning algorithms
include linear regression, decision trees, random forest.
What is Model Training in machine learning?

The process of running a machine learning


algorithm on a dataset (called training data) and
optimizing the algorithm to find certain patterns or
outputs is called model training. The resulting
function with rules and data structures is called the
trained machine learning model.
Unsupervised Machine Learning?

In unsupervised machine learning, the algorithm is provided


an input dataset, but not rewarded or optimized to specific
outputs, and instead trained to group objects by common
characteristics. For example, recommendation engines on
online stores rely on unsupervised machine learning,
specifically a technique called clustering.
1.2. Data Collection and Acquisition
Definition of data

Source of data

6V’s of Big data

Types of data
1.3 Visualization results are properly interpreted based
on its statistical analysis.

Data visualization is the graphical representation of


information and data. By using visual elements like charts,
graphs, and maps, data visualization tools provide an
accessible way to see and understand trends, outliers,
and patterns in data.
Common Types of Data Visualizations
● Bar Charts: Useful for comparing quantities across categories.
● Line Graphs: Ideal for showing trends over time.
● Pie Charts: Good for showing proportions of a whole.
● Scatter Plots: Useful for showing relationships between two
variables.
● Heat Maps: Great for showing data density or patterns across
two dimensions.
Data visualization plays a crucial role in the machine learning (ML) process, aiding
in various stages such as data exploration, model evaluation, and results
interpretation. Here’s how it integrates with machine learning:

Importance of Data Visualization in Machine Learning


1. Data Exploration:
● Visualizations help identify patterns, trends, and anomalies in the data.
● Tools like histograms and box plots can reveal the distribution of
features and potential outliers.

1.3. Interpret Data Visualization
✓ Description of data Visualization tools

Matplotlib

Seaborn

Plotly

Tableau

Power BI
✓ Use Types of Data Visualization
Scatter Plots
Line Plots
Bar Charts
Histograms
Box Plots
Heat map

You might also like