machine learning ppt
machine learning ppt
Data
Crucial to machine learning, data is the raw information that is
processed and analyzed by algorithms.
It can be structured or unstructured and comes in various forms
such as text, images, numbers, and more.
Quality and quantity of data significantly influence the
performance of a machine learning model.
Data can come in various forms such as structured data (e.g.
databases), unstructured data (e.g. text, images), and semi
structured data (e.g.JSON files).
Model
In machine learning, a model represents what an algorithm has
learned from the training data.
It is the output that you get after training an algorithm with data,
and it is this model that is used to make predictions or decisions.
Models need to be validated and tested to ensure they generalize
well to unseen data.
By: Petros Abebe Mizan Tepi University Jun, 2023 8/21
Types of Machine Learning
Classification
Classification is the process of categorizing data or objects into
predefined classes or categories based on their features or
attributes.
Classification is the process of categorizing data or objects into
predefined classes or categories based on their features or
attributes.
Classification involves predicting a categorical or discrete
output variable based on a set of input features.
The goal of classification is to learn a model that can accurately
assign new input data to one of several predefined categories or
classes.
Regression
Regression involves predicting a continuous or numerical output
variable based on a set of input features.
The goal of regression is to learn a model that can accurately
predict a numerical value for new input data.
Regression is used to predict a continuous output based on
historical input data only.
Some common applications of regression include predicting
stock prices, estimating housing prices, and forecasting weather.
Dimensionality reduction
It is a technique used to reduce the number of features in a
dataset while retaining as much of the original information as
possible.
This is useful for visualizing high-dimensional data or for
reducing the computational complexity of a machine learning
model.
By: Petros Abebe Mizan Tepi University Jun, 2023 16/21
Reinforcement Learning
NumPy
NumPy is the fundamental package for scientific computing in
Python.
The name is an acronym for ”Numeric Python” or ”Numerical
Python”.
Python with powerful data structures, implementing
multi-dimensional arrays and matrices.
matplotlib
These libraries allow users to plot much cleaner and
sophisticated graphs.
Matplotlib is an visualization library in Python for 2D plots of
arrays.
pandas
pandas is a Python package that provides fast, flexible, and
expressive data structures designed to make working with
”relational” or ”labeled” data both easy and intuitive.
Pandas is a Python library that is mainly used for data analysis.
Scikit-learn
Scikit-learn is a Python library which is used for classical
machine learning algorithms.
Scikit-learn is popular in Machine learning developers as it
supports supervised and unsupervised learning algorithms.
This library can also be used for data-analysis, and data-mining
process.