AI PROJECT CYCLE-1 Class 9
AI PROJECT CYCLE-1 Class 9
1. Problem Scoping-
Identifying a problem and having a vision to solve it is called Problem Scoping.
Scoping a problem is not that easy as we need to have a deep or understanding so
that the picture becomes clear while wear working to solve it. So, we use 4w
problem Canvas to understand the problem in a better way.
4-W Problem Canvas: It helps in identifying the key elements related to the
problem. The 4 W’s- Who, What , Where and Why.
i. Who? : This block helps in analysing the people who are getting affected directly or
indirectly due to a problem. Under this, we find out who are the 'Stakeholders' (those
people who face this problem and would be benefitted with the solution) to this
problem? Below are the questions that we need to discuss under this block.
1. Who are the stakeholders?
2. What do you know about them?
ii. What? : This block helps to determine the nature of the problem. What is the
problem and how do we know that it is a problem? Under this block, we also gather
evidence to prove that the problem you have selected actually exists. Below are the
questions that we need to discuss under this block.
1. What is the problem?
2. How do you know that it is a problem?
iii. Where? : This block will help us to look into the situation in which the problem
arises, the context of it, and the locations where it is prominent. Here is the Where
Canvas:
1. What is the context/situation in which the stakeholders experience the
problem?
iv. Why? : In the "Why" canvas, we think about the benefits which the stakeholders
would get from the solution and how it will benefit them as well as the society. Below
are the questions that we need to discuss under this block.
1. What would be of key value to the stakeholders?
2. How would it improve their situation?
2. Data Acquisition- This is the second stage of Al Project cycle. According to the term,
this stage is about acquiring data for the project. Whenever we want an Al project to be
able to predict an output, we need to train it first using data.
Types of Data- Two types of data is used while preparing Artificially Intelligent system.
(a) Trained Data
(b) Testing Data
(a) Training Data-It’s a portion of our actual dataset that is fed into the machine learning
model to discover and learn patterns. For better efficiency of an AI project, the
Training data needs to be relevant and authentic.
(b) Testing Data-The data which is used to check the model’s performance is known as
Testing Data.
For example, If you want to make an Artificially Intelligent system which can predict the
salary of any employee based on his previous salaries, you would feed the data of his
previous salaries into the machine. This is the data with which the machine can be
trained. Now, once it is ready, it will predict his next salary efficiently. The previous
salary data here is known as Training Data while the next salary prediction data set is
known as the Testing Data.
Data features refer to the type of data you want to collect. In above example, data
features would be salary amount, increment percentage, increment period, bonus, etc.
There can be various ways to collect the data. Some of them are:
1. Surveys
2. Web Scraping
3. Sensors
4. Cameras
5. Observations
6. API (Application Program Interface)
One of the most reliable and authentic sources of information, are the open-sourced
websites hosted by the government. Some of the open-sourced Govt. portals are:
data.gov.in, india.gov.in.
3. Data Exploration- While acquiring data, we must have noticed that the data is a
complex entity — it is full of numbers and if anyone wants to make some sense out
of it, they have to work some patterns out of it. Thus, to analyse the data, you need to
visualize it in some user-friendly format so that you can:
i. Quickly get a sense of the trends, relationships and patterns contained within
the data.
ii. Define strategy for which model to use at a later stage.
iii. Communicate the same to others effectively.
To visualize data, we can use various types of visual representations like Bar graph,
Histogram, Line Chart, Pie Chart.
4. Data Modelling- The graphical representation makes the data understandable for
humans as we can discover trends and patterns out of it, but machine can analyse the
data only when the data is in the most basic form of numbers(0&1). The ability to
mathematically describe the relationship between parameters is the heart of every AI
model.
Generally, AI models can be classified as follows:
An advantage for this approach is that the learning is dynamic. The learning-based
approach can further be divided into three parts:
a) Supervised Learning : In a supervised learning model, the dataset which is fed to
the machine is labelled. A label is some information which can be used as a tag for data.
For example, students get grades according to the marks they secure in examinations.
These grades are labels which categorize the students according to their marks. There
are two types of Supervised Learning models:
i. Classification: Where the data is classified according to the labels. This model works
on discrete dataset which means the data need not be continuous. For example, in the
grading system, students are classified on the basis of the grades they obtain with
respect to their marks in the examination.
ii. Regression: Such models work on continuous data. For example, if we wish to
redirect our next salary, then we would put in the data of our previous salary, any
increments, etc., and would train the model. Here, the data which has been fed to the
machine is continuous.
b) Unsupervised Learning- An unsupervised learning model works on unlabelled
dataset. This means that the data which is fed to the machine is random. This model is
used to identify relationships, patterns and trends out of the data which is fed into it. It
helps the user in understanding what the data is about and what are the major features
identified by the machine in it.
Unsupervised learning models can be done by using technique:
i. Clustering: It refers to the unsupervised learning algorithm which can cluster the
unknown data according to the patterns or trends identified out of it.
c) Reinforcement Learning- Reinforcement learning (RL) refers to a sub-field of
machine learning that enables AI-based systems to take actions in a dynamic
environment through trial and error to maximize the collective rewards based on the
feedback generated for individual activities. In the RL context, feedback refers to a
positive or negative notion reflected through rewards or punishments.
Differentiate between Rule Based and Learning Based Approach-
Rule Based Learning Based
It refers to the Al modelling where the It refers to the Al modelling where the
rules are defined by the developer. machine learns by itself
The machine once trained, does not take The machine once trained, does take into
into consideration any changes made in consideration any changes made in the
the original training dataset. original training dataset.
5. Evaluation- Once a model has been made and trained, it needs to go through proper
testing so that one can calculate the efficiency and performance of the model. Hence,
the model is tested with the help of Testing Data (which was separated out of the
acquired dataset at Data Acquisition stage) and the efficiency of the model is
calculated on the basis of the parameters mentioned below: