C10 AI Projectcycle Unit2
C10 AI Projectcycle Unit2
CLASS X
ARTIFICIAL INTELLIGENCE
INTRODUCTION TO AI PROJECT CYCLE
OBJECTIVES
• BASICS OF AI PROJECT CYCLE
• PROBLEM SCOPING
• DATA ACQUISTION
• DATA EXPLORATION
• DATA MODELLING
• EVALUATION AND DEVELOPMENT
AI PROJECT LIFE CYCLE
Eg:
To make an Artificially Intelligent system that can predict the
salary of an employee based on his previous salaries.
AI machine is trained with the data of employees previous
salaries based on which it will predict his next salary efficiently.
The previous salary data here is known as Training Data
while the next salary prediction data is known as the Testing
Data.
DATA FEATURES
For better efficiency of an AI project, the Training data needs to be
relevant and authentic. In the previous example, if the training data
was not of the previous salaries but of his expenses, the machine would
not have predicted his next salary correctly since the whole training
went wrong. Similarly, if the previous salary data was not authentic,
that is, it was not correct, then too the prediction could have gone
wrong.
DATA FEATURES
Hence….
For any AI project to be efficient, the training data should
be authentic and relevant to the problem statement
scoped.
.
DATA SOURCES
1.Bar Graphs
2.Histogram
3.Scatter plot
4.Pie plots etc
Modelling
The graphical representation makes the data
understandable for humans as we can discover
trends and patterns out of it. But when it comes
to machines accessing and analysing data, it
needs the data in the most basic form of
numbers (which is binary – 0s and 1s) and when
it comes to discovering patterns and trends in
data, the machine goes in for mathematical
representations of the same.
Modelling
The ability to mathematically describe the
relationship between parameters is the heart of
every AI model. Thus, whenever we talk about
developing AI models, it is the mathematical
approach towards analysing data which we refer
to.
Generally, AI models can be classified as follows:
Modelling Machine Learning
Learning Based
AI Models
Deep Learning
Rule Based
Rule Based Approach
Supervised learning
Unsupervised learning
Reinforcement learning
1. Supervised Learning
In a supervised learning model, the dataset which is fed
to the machine is labelled. In other words, we can say
that the dataset is known to the person who is training
the machine only then he/she is able to label the data. A
label is some information which can be used as a tag for
data. For example, students get grades according to the
marks they secure in examinations. These grades are
labels which categorise the students according to their
marks.
1. Supervised Learning
There are two types of Supervised
Learning models:
Classification: Where the data is
classified according to the labels. For
example, in the grading system,
students are classified on the basis
of the grades they obtain with
respect to their marks in the
examination. This model works on
discrete dataset which means the
data need not be continuous.
1. Supervised Learning
Regression: Such models
work on continuous data.
For example, if you wish to
predict your next salary, then
you would put in the data of
your previous salary, any
increments, etc., and would
train the model. Here, the
data which has been fed to
the machine is continuous.
2. Unsupervised Learning
An unsupervised learning model works on unlabelled
dataset. This means that the data which is fed to the
machine is random and there is a possibility that the
person who is training the model does not have any
information regarding it. The unsupervised learning
models are used to identify relationships, patterns and
trends out of the data which is fed into it. It helps the user
in understanding what the data is about and what are the
major features identified by the machine in it.
2. Unsupervised Learning
For example, you have a random data of 1000 dog images
and you wish to understand some pattern out of it, you
would feed this data into the unsupervised learning model
and would train the machine on it. After training, the
machine would come up with patterns which it was able to
identify out of it. The Machine might come up with
patterns which are already known to the user like colour or
it might even come up with something very unusual like
the size of the dogs.
2. Unsupervised Learning
Unsupervised learning models can be further
divided into two categories:
Clustering: Refers to the unsupervised
learning algorithm which can cluster the
unknown data according to the patterns or
trends identified out of it. The patterns
observed might be the ones which are
known to the developer or it might even
come up with some unique patterns out
of it.
2. Unsupervised Learning
Dimensionality Reduction: We humans are able to visualise upto
3-Dimensions only but according to a lot of theories and
algorithms, there are various entities which exist beyond 3-
Dimensions. For example, in Natural language Processing, the
words are considered to be N-Dimensional entities. Which means
that we cannot visualise them as they exist beyond our
visualisation ability. Hence, to make sense out of it, we need to
reduce their dimensions. Here, dimensionality reduction
algorithm is used.
As we reduce the dimension of an entity, the information which
it contains starts getting distorted.
2. Unsupervised Learning
For example, if we have a ball in our hand, it is 3-Dimensions
right now. But if we click its picture, the data transforms to 2-D as
an image is a 2-Dimensional entity. Now, as soon as we reduce
one dimension, at least 50% of the information is lost as now we
will not know about the back of the ball. Whether the ball was of
same colour at the back or not? Or was it just a hemisphere? If
we reduce the dimensions further, more and more information
will get lost.
Hence, to reduce the dimensions and still be able to make sense
out of the data, we use Dimensionality Reduction.
3. Reinforcement Learning
Reinforcement learning is a machine learning training
method based on rewarding desired behaviors and/or
punishing undesired ones.
In general, a reinforcement learning agent is able to
perceive and interpret its environment, take actions and
learn through trial and error.
Evaluation