Classifying in Machine Learning
Classifying in Machine Learning
CLASSIFYING IN
MACHINE LEARNING
PRESENTATION
PRESENTED BY:
Association
CLUSTERING:
Clustering is the process of arranging a group of objects in such a manner that the
objects in the same group (which is referred to as a cluster) are more similar to
each other than to the objects in any other group. Data professionals often use
clustering in the Exploratory Data Analysis phase to discover new information and
patterns in the data. As clustering is unsupervised machine learning, it doesn’t
require a labeled dataset.
ASSOCIATION:
Association learning, often referred to in the context of association rule learning, is
a rule-based machine learning method for discovering interesting relations
between variables in large databases. It is intended to identify strong rules
discovered in databases using some measures of interestingness.
In fact, many Machine Learning problems belong to this group because collecting
labeled data takes a lot of time and has high costs. Many types of data even require
experts to label (medical images, for example). In contrast, unlabeled data can be
collected at low cost from the internet.
REINFORCEMENT LEARNING:
Reinforcement learning (RL) is a machine learning (ML) technique that trains
software to make decisions to achieve the most optimal results. It mimics the
trial-and-error learning process that humans use to achieve their goals.
When there is a new data point to predict, Random Forest will make a prediction by
combining the predictions of all subtrees. Finally, the algorithm will choose the
result with the most votes to conclude the problem and situation.
ENTROPY AND GINI INDEX IN DECISION
TREE Both entropy and Gini index are
impurity measures used in decision
trees to guide the process of splitting
data points.
Example:
If we had total 10 data points in our dataset with 3 belonging to positive
class and 7 belonging to negative class , then we use the fomula:
Example:
THANK YOU VERY MUCH
FOR LISTENING.