0% found this document useful (0 votes)
12 views48 pages

C10 AI Projectcycle Unit2

The document outlines the AI project cycle, detailing key components such as problem scoping, data acquisition, data exploration, and modeling. It emphasizes the importance of understanding problems through the 4Ws framework and discusses various AI modeling approaches, including rule-based and learning-based methods. Additionally, it highlights the significance of evaluating AI models using metrics like accuracy and precision to ensure their effectiveness.

Uploaded by

abbad2160
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views48 pages

C10 AI Projectcycle Unit2

The document outlines the AI project cycle, detailing key components such as problem scoping, data acquisition, data exploration, and modeling. It emphasizes the importance of understanding problems through the 4Ws framework and discusses various AI modeling approaches, including rule-based and learning-based methods. Additionally, it highlights the significance of evaluating AI models using metrics like accuracy and precision to ensure their effectiveness.

Uploaded by

abbad2160
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 48

INDIAN SCHOOL MUSCAT

CLASS X

ARTIFICIAL INTELLIGENCE
INTRODUCTION TO AI PROJECT CYCLE
OBJECTIVES
• BASICS OF AI PROJECT CYCLE
• PROBLEM SCOPING
• DATA ACQUISTION
• DATA EXPLORATION
• DATA MODELLING
• EVALUATION AND DEVELOPMENT
AI PROJECT LIFE CYCLE

AI project cycle is the life cycle of an AI project.


Project cycle is a step by step process
to solve the problems using proven
scientific methods and drawing the
inference about it.
COMPONENTS OF PROJECT CYCLE
Components of project cycle are the steps
which contributes in completing the
Project.
COMPONENTS OF PROJECT
CYCLE
What is Problem Scoping?

Problem scoping is the process by


which we figure out the problem that
we need to solve.
Sustainable Development Goals

It is a fact that we are surrounded by problems .


The Sustainable Development Goals(SDG)
are a collection of seventeen(17) interlinked
objectives announced by the United nations
to transform our world with peace and
prosperity.
The aim is to achieve these goals by the
end of 2030.
Sustainable Development Goals
Sustainable Development Goals

Many goals correspond to the problems


which we might observe around us too.
One should look for such problems and
try to solve them as this would make
many lives better and help
our country achieve these goals.
4Ws Of Problem Scoping

The 4W's of Problem Scoping are Who,


What, Where and Why. These 4Ws
helps in identifying and understanding
the problem in a better and efficient
manner.
4Ws Of Problem Scoping

1.Who - "Who" part helps us in


comprehending and categorizing who
all are affected directly and indirectly
with the problem and who are called
the Stake Holders.
Stakeholders are the people who face this
problem and would be benefitted with the
4Ws Of Problem Scoping

2. What - "What" part helps us in


understanding and identifying the
nature of the problem
4Ws Of Problem Scoping

3. Where - "Where" does the problem


arises, situation and the location.
4. Why - "Why" is the given problem
worth solving. It speaks about the
benefits which the stakeholders would
get from the solution and how it will
benefit them as well as the society.
Problem Statement Template

The Problem Statement Template is


the summary of all the key points of
4W’s into one single template.

If the same problem occurs in the future,


this template helps to fix it easily.
DATA ACQUISTION
Data Acquisition is the process of collecting
accurate/reliable data from or the project.
Data refers to the raw facts or figures. It can
be of any form such as text, video, images,
audio etc., and it can be collected from
various source like the internet, journals,
newspapers and so on.
DATA FEATURES
Data features refer to the type of data COLLECTED.

Data can be either Training or Testing It is further classified as


1.Training Data: Training data refers to the input data fed to the
system with which the machine can be trained.
2.Testing Data: The processed/predicted data obtained as output is
known as testing data.
DATA FEATURES

Eg:
To make an Artificially Intelligent system that can predict the
salary of an employee based on his previous salaries.
AI machine is trained with the data of employees previous
salaries based on which it will predict his next salary efficiently.
The previous salary data here is known as Training Data
while the next salary prediction data is known as the Testing
Data.
DATA FEATURES
For better efficiency of an AI project, the Training data needs to be
relevant and authentic. In the previous example, if the training data
was not of the previous salaries but of his expenses, the machine would
not have predicted his next salary correctly since the whole training
went wrong. Similarly, if the previous salary data was not authentic,
that is, it was not correct, then too the prediction could have gone
wrong.
DATA FEATURES

Hence….
For any AI project to be efficient, the training data should
be authentic and relevant to the problem statement
scoped.
.
DATA SOURCES

There can be various ways in which you can


collect data. Some of them are:
Sometimes, you use the internet and try to acquire data for
your project from some random websites. Such data might
not be authentic as its accuracy cannot be proved. Due to
this, it becomes necessary to find a reliable source of data
from where some authentic information can be taken. At
the same time, we should keep in mind that the data which
we collect is open-sourced and not someone’s property.
Extracting private data can be an offence.
DATA FEATURES
One of the most reliable and authentic sources of
information, are the open-sourced websites hosted by
the government. These government portals have
general information collected in suitable format which
can be downloaded and used wisely.
Some of the open-sourced Govt. portals are:
data.gov.in, india.gov.in
Data Exploration
Now we have set the goal of our project and have also
found ways to acquire data. Data is a complex entity –
it is full of numbers and if anyone wants to make
some sense out of it, they have to work some
patterns out of it. For example, if you go to the library
and pick up a random book, you first try to go through
its content quickly by turning pages and by reading
the description before borrowing it for yourself,
because it helps you in understanding if the book is
appropriate to your needs and interests or not.
Data Exploration
Thus, to analyse the data, you need to visualise
it in some user-friendly format so that you can:
● Quickly get a sense of the trends, relationships
and patterns contained within the data.
● Define strategy for which model to use at a
later stage.
● Communicate the same to others effectively.
To visualise data, we can use various types of
visual representations.
Data Exploration
Some visual representations of data

1.Bar Graphs
2.Histogram
3.Scatter plot
4.Pie plots etc
Modelling
The graphical representation makes the data
understandable for humans as we can discover
trends and patterns out of it. But when it comes
to machines accessing and analysing data, it
needs the data in the most basic form of
numbers (which is binary – 0s and 1s) and when
it comes to discovering patterns and trends in
data, the machine goes in for mathematical
representations of the same.
Modelling
The ability to mathematically describe the
relationship between parameters is the heart of
every AI model. Thus, whenever we talk about
developing AI models, it is the mathematical
approach towards analysing data which we refer
to.
Generally, AI models can be classified as follows:
Modelling Machine Learning

Learning Based

AI Models
Deep Learning

Rule Based
Rule Based Approach

Refers to the AI modelling where the rules are defined


by the developer.
The machine follows the rules or instructions mentioned
by the developer and performs its task accordingly.
For example, we have a dataset which tells us about the
conditions on the basis of which we can decide if an
elephant may be spotted or not while on safari. The
parameters are: Outlook, Temperature, Humidity and
Wind.
Rule Based Approach
Now, let’s take various possibilities of these parameters
and see in which case the elephant may be spotted and
in which case it may not. After looking through all the
cases, we feed this data in to the machine along with
the rules which tell the machine all the possibilities.
The machine trains on this data and now is ready to be
tested.
Rule Based Approach
While testing the machine, we tell the machine that
Outlook = Overcast; Temperature = Normal; Humidity =
Normal and Wind = Weak. On the basis of this testing
dataset, now the machine will be able to tell if the
elephant has been spotted before or not and will display
the prediction to us. This is known as a rule-based
approach because we fed the data along with rules to
the machine and the machine after getting trained on
them is now able to predict answers for the same.
Rule Based Approach
A drawback/feature for this approach is that the
learning is static. The machine once trained, does not
take into consideration any changes made in the original
training dataset. That is, if you try testing the machine
on a dataset which is different from the rules and data
you fed it at the training stage, the machine will fail and
will not learn from its mistake. Once trained, the model
cannot improvise itself on the basis of feedbacks.
Rule Based Approach
Thus, machine learning gets introduced as an extension
to this as in that case, the machine adapts to change in
data and rules and follows the updated path only, while
a rule-based model does what it has been taught once.
Learning Based Approach
Refers to the AI modelling where the machine learns by
itself. Under the Learning Based approach, the AI model
gets trained on the data fed to it and then is able to
design a model which is adaptive to the change in data.
That is, if the model is trained with X type of data and
the machine designs the algorithm around it, the model
would modify itself according to the changes which
occur in the data so that all the exceptions are handled
in this case. For example, suppose you have a dataset
comprising of 100 images of apples and bananas each.
Learning Based Approach
These images depict apples and bananas in various
shapes and sizes. These images are then labelled as
either apple or banana so that all apple images are
labelled ‘apple’ and all the banana images have ‘banana’
as their label. Now, the AI model is trained with this
dataset and the model is programmed in such a way
that it can distinguish between an apple image and a
banana image according to their features and can
predict the label of any image which is fed to it as an
apple or a banana.
Learning Based Approach
After training, the machine is now fed with testing data.
Now, the testing data might not have similar images as
the ones on which the model has been trained. So, the
model adapts to the features on which it has been
trained and accordingly predicts if the image is of an
apple or banana. In this way, the machine learns by
itself by adapting to the new data which is flowing in.
This is the machine learning approach which introduces
the dynamicity in the model.
Learning Based Approach
The learning-based approach can further be divided into
three parts:

Supervised learning
Unsupervised learning
Reinforcement learning
1. Supervised Learning
In a supervised learning model, the dataset which is fed
to the machine is labelled. In other words, we can say
that the dataset is known to the person who is training
the machine only then he/she is able to label the data. A
label is some information which can be used as a tag for
data. For example, students get grades according to the
marks they secure in examinations. These grades are
labels which categorise the students according to their
marks.
1. Supervised Learning
There are two types of Supervised
Learning models:
Classification: Where the data is
classified according to the labels. For
example, in the grading system,
students are classified on the basis
of the grades they obtain with
respect to their marks in the
examination. This model works on
discrete dataset which means the
data need not be continuous.
1. Supervised Learning
Regression: Such models
work on continuous data.
For example, if you wish to
predict your next salary, then
you would put in the data of
your previous salary, any
increments, etc., and would
train the model. Here, the
data which has been fed to
the machine is continuous.
2. Unsupervised Learning
An unsupervised learning model works on unlabelled
dataset. This means that the data which is fed to the
machine is random and there is a possibility that the
person who is training the model does not have any
information regarding it. The unsupervised learning
models are used to identify relationships, patterns and
trends out of the data which is fed into it. It helps the user
in understanding what the data is about and what are the
major features identified by the machine in it.
2. Unsupervised Learning
For example, you have a random data of 1000 dog images
and you wish to understand some pattern out of it, you
would feed this data into the unsupervised learning model
and would train the machine on it. After training, the
machine would come up with patterns which it was able to
identify out of it. The Machine might come up with
patterns which are already known to the user like colour or
it might even come up with something very unusual like
the size of the dogs.
2. Unsupervised Learning
Unsupervised learning models can be further
divided into two categories:
Clustering: Refers to the unsupervised
learning algorithm which can cluster the
unknown data according to the patterns or
trends identified out of it. The patterns
observed might be the ones which are
known to the developer or it might even
come up with some unique patterns out
of it.
2. Unsupervised Learning
Dimensionality Reduction: We humans are able to visualise upto
3-Dimensions only but according to a lot of theories and
algorithms, there are various entities which exist beyond 3-
Dimensions. For example, in Natural language Processing, the
words are considered to be N-Dimensional entities. Which means
that we cannot visualise them as they exist beyond our
visualisation ability. Hence, to make sense out of it, we need to
reduce their dimensions. Here, dimensionality reduction
algorithm is used.
As we reduce the dimension of an entity, the information which
it contains starts getting distorted.
2. Unsupervised Learning
For example, if we have a ball in our hand, it is 3-Dimensions
right now. But if we click its picture, the data transforms to 2-D as
an image is a 2-Dimensional entity. Now, as soon as we reduce
one dimension, at least 50% of the information is lost as now we
will not know about the back of the ball. Whether the ball was of
same colour at the back or not? Or was it just a hemisphere? If
we reduce the dimensions further, more and more information
will get lost.
Hence, to reduce the dimensions and still be able to make sense
out of the data, we use Dimensionality Reduction.
3. Reinforcement Learning
Reinforcement learning is a machine learning training
method based on rewarding desired behaviors and/or
punishing undesired ones.
In general, a reinforcement learning agent is able to
perceive and interpret its environment, take actions and
learn through trial and error.
Evaluation

Once a model has been made and trained, it needs to go


through proper testing so that one can calculate the
efficiency and performance of the model. Hence, the
model is tested with the help of Testing Data (which was
separated out of the acquired dataset at Data Acquisition
stage) and the efficiency of the model is calculated on the
basis of the parameters mentioned below:
Evaluation

Accuracy Precision Recall F1 Score

You might also like