0% found this document useful (0 votes)
445 views

3C's of Machine Learning PDF

This document discusses three machine learning techniques: classification, clustering, and collaboration. Classification involves assigning categories or labels to new observations based on a training set of pre-classified data. Clustering groups unlabeled data based on similarities without predefined labels. Collaboration identifies relationships between data, like products users may like based on similarities to other users' preferences.

Uploaded by

Mebratu Abuye
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
445 views

3C's of Machine Learning PDF

This document discusses three machine learning techniques: classification, clustering, and collaboration. Classification involves assigning categories or labels to new observations based on a training set of pre-classified data. Clustering groups unlabeled data based on similarities without predefined labels. Collaboration identifies relationships between data, like products users may like based on similarities to other users' preferences.

Uploaded by

Mebratu Abuye
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

JIMMA UNIVERSITY

JIMMA INSTITUTE OF TECHNOLOGY


FACULTY OF ELECTRICAL AND COMPUTER
ENGINEERING
DEGREE OF MASTER OF SCIENCE IN COMPUTER
ENGINEERING

Machine Learning Assignment 1

3C’s of Machine Learning

Mebratu Abuye RM0303/14

March 4, 2023
Jimma, Ethiopia
3C’s of Machine Learning

Classification, Clustering and Collaboration

Both Classification and Clustering is used for the categorization of objects into one or more
classes based on the features. They appear to be a similar process as the basic difference is
minute. In the case of Classification, there are predefined labels assigned to each input instance
according to their properties whereas in clustering those labels are missing.

Classification

As the name suggests, Classification is the task of “classifying things” into sub-categories. In
Machine Learning, Classification is the problem of identifying to which of a set of categories, a
new observation belongs, on the basis of a training set of data containing observations and whose
categories membership is known. Classification is a machine learning task that involves
assigning a class label to a given input based on a set of training data. The goal of classification
is to build a model that can accurately predict the class label for new, unseen data. Classification
is used for supervised learning. The process of classifying the input instances based on their
corresponding class labels is known as classification. As Classification have labels so there is
need of training and testing dataset for verifying the model created but there is no need for
training and testing dataset in clustering. Classification is more complex as compared to
clustering as there are many levels in the classification phase whereas only grouping is done in
clustering.

Examples are Logistic regression, Naive Bayes classifier, Support vector machines, etc.

Clustering
Clustering is one type of unsupervised learning where the goal is to partition the set of objects into
groups called clusters. Clustering is the collection of objects based on resemblance and distinction
between them. In simple words, we can say that it is an approach of collection of objects, so that
objects with similar functionalities come together and objects with different attributes move apart.
It is used to set the instances on the basis of their resemblance without class labels. It does not
prefer a training dataset and less complex as compared to the classification. Clustering is structure-
finding, typically among dense data of low or moderate dimension in a continuous space. It's really
defined by a distance function among data points. It typically employs some form of expectation-
maximization-style algorithm.
Examples are k-means clustering algorithm, Fuzzy c-means clustering algorithm, Gaussian (EM)
clustering algorithm, etc.

Collaboration

Collaborative is a Machine Learning technique used to identify relationships between pieces of


data. This technique is frequently used in recommender systems to identify similarities between
user data and items. Collaborative is a generic approach that can be summarized as "using
information from similar users or items to predict affinity to a given item". This means that
if Users A and B both like Product A, and User B also likes Product B, then Product B could be
recommended to User A by the system. The model keeps track of what products users like and
their characteristics to see what users, who like products with similar characteristics, enjoyed. The
model then makes its recommendations accordingly. Product features should be given numerical
values whenever possible as it makes decisions by the model more accurate. Once features are
identified and assigned values, data collection needs to begin.

There are two ways the model can identify whether or not a user enjoyed a product. The user can
be asked to give a numerical rating or the system can assume that the user likes whatever product
they use. Once user interests have been established, recommendations can be made. Collaborative
filtering is in general a ranking problem. Depending on how you look at it, the data are sparse,
high-dimensional and in a continuous space. It amounts to inferring which missing dimension has
the highest value. It typically proceeds via a matrix completion algorithm like low-rank
factorization.

You might also like