02 Ai Project Cycle Important Questions Answers 1
02 Ai Project Cycle Important Questions Answers 1
02 Ai Project Cycle Important Questions Answers 1
2. What are the various parameters which affect the problem ____________.
a. You need to acquire data which will become the base of your project
b. You go for data acquisition by collecting data from various reliable and authentic sources.
c. After exploring the patterns, you can decide upon the type of model you would build to achieve the
goal.
d. All of the above
Ans: d. All of the above
3. You need to __________ which will become the base of your project as it will help you in
understanding what the parameters that are related to problem scoping are.
a. Acquire Data
b. Database
c. Data Mining
d. None of the above
Ans: a. Acquire Data
5. Once the __________ is complete, you now need to test your model on some newly fetched data.
a. Data Acquisition
b. Modelling
c. Data Mining
d. None of the above
Ans: b. Modelling
7. The ________ Problem canvas helps in identifying the key elements related to the problem.
a. 4Ws
b. 6Ws
c. 2Ws
d. 3Ws
Ans: a. 4Ws
Prepared by: M. S. KumarSwamy, TGT(Maths) Page - 1 -
8. _________ are the people who face this problem and would be benefited with the solution.
a. Key Persons
b. Stakeholders
c. End user
d. None of the above
Ans: b. Stakeholders
9. ________ helps in analyzing the people getting affected directly or indirectly due to it.
a. Who
b. What
c. Where
d. Why
Ans: a. Who
11. “What” block helps to gather evidence from __________ to prove that the problem you have
selected actually exists.
a. Media
b. Announcements
c. Newspaper & Articles
d. All of the above
Ans: d. All of the above
12. “Where” block will help you look into the situation in which the ___________where it is
prominent.
a. Problem arises
b. The context of it
c. The locations
d. All of the above
Ans: d. All of the above
13. In “Why” block canvas, Which of the following canvases is the base of problem solving.
a. Who the people that would be benefitted by the solution
b. What is to be solved
c. Where will the solution be deployed
d. All of the above
Ans: d. All of the above
14. After filling the 4Ws Problem canvas, you now need to summarize all the cards into one
_________.
a. Template
b. Situation
c. Both a) and b)
d. None of the above
Ans: a. Template
17. ___________ can be a piece of information or facts and statistics collected together for reference
or analysis.
a. Database
b. Data
c. Data Type
d. None of the above
Ans: b. Data
18. Whenever we want an AI project to be able to predict an output, we need to _______ it first using
data.
a. Analyze
b. Train
c. Explore
d. All of the above
Ans: b. Train
19. You would feed the data into the machine. This is the data with which the machine can be trained.
Now, once it is ready, it will predict his next data efficiently. This previous data is known as
___________.
a. Testing Data
b. Training Data
c. Exploring Data
d. All of the above
Ans: b. Training Data
20. You would feed the data into the machine. This is the data with which the machine can be trained.
Now, once it is ready, it will predict his next data efficiently. This next data is known as
___________.
a. Testing Data
b. Training Data
c. Exploring Data
d. All of the above
Ans: a. Testing Data
21. For better efficiency of an AI project, the _________ needs to be relevant and authentic.
a. Testing Data
b. Training Data
c. Exploring Data
24. Sometimes, you use the internet and try to acquire data for your project from some random
websites. Such data might not be authentic as its accuracy cannot be proved. Due to this, it becomes
necessary to find a ____________.
a. Reliable source
b. Random source
c. Unauthorize source
d. All of the above
Ans: a. Reliable source
25. One of the most reliable and authentic sources of information where we can download the
authentic data for our project are ____________.
a. Private websites
b. Government websites
c. Personal websites
d. None of the above
Ans: b. Government websites
26. Data is a complex entity – it is full of numbers and if anyone wants to make some sense out of it,
they have to work some patterns out of it.
a. Data acquiring
b. Data mining
c. Data analysis
d. None of the above
Ans: a. Data acquiring
27. The __________ makes the data understandable for humans as we can discover trends and
patterns out of it.
a. Random Data
b. Graphical Representation
c. Unstructured Data
d. None of the above
Ans: b. Graphical Representation
30. AI modelling where the rules are defined by the developer is known as __________.
a. Rule Based Approach
b. Learning based Approach
c. Both a) and b)
d. None of the above
Ans: a. Rule Based Approach
31. _______ which tells us about the conditions on the basis of which we can decide
a. Dataset
b. Rule Based
c. Learning based
d. None of the above
Ans: a. Dataset
33. Which one of the following is correct for Learning based approach?
a. Supervised Learning
b. Unsupervised Learning
c. Reinforcement Learning
d. All of the above
Ans: d. All of the above
34. In a _________ model, the dataset which is fed to the machine is labeled.
a. Supervised Learning
b. Unsupervised Learning
c. Reinforcement Learning
d. All of the above
Ans: a. Supervised Learning
36. Where the data is classified according to the labels is known as ________.
a. Classification
39. The ______________ models are used to identify relationships, patterns and trends out of the data
which is fed into it. It helps the user in understanding what the data is about and what are the major
features identified by the machine in it.
a. Unsupervised learning
b. Supervised learning
c. Reinforcement Learning
d. All of the above
Ans: a. Unsupervised learning
42. Unsupervised learning algorithm which can cluster the unknown data according to the patterns or
trends identified out of it is known as ___________.
a. Clustering
b. Dimensionality Reduction
c. Both a) and b)
d. None of the above
Ans: a. Clustering
43. We humans are able to visualize up to 3-Dimensions only but according to a lot of theories and
algorithms, there are various entities which exist beyond 3-Dimensions.
a. Clustering
b. Dimensionality Reduction
44. In Unsupervised learning models, if we need to reduce their dimension, which algorithm do we
have to use?
a. Supervised algorithm
b. Dimensionality reduction algorithm
c. Clustering algorithm
d. None of the above
Ans: b. Dimensionality reduction algorithm
45. _________ helps to test data so that one can calculate the efficiency and performance of the
model.
a. Accuracy
b. Evaluation
c. Precision
d. None of the above
Ans: b. Evaluation
47. ___________ are loosely modelled after how neurons in the human brain behave.
a. Neural networks
b. Neural science
c. Neural Analysis
d. None of the above
Ans: a. Neural networks
48. The key advantage of ___________ are that they are able to extract data features automatically
without needing the input of the programmer.
a. Data Science
b. Deep Learning
c. Neural Network
d. All of the above
Ans: c. Neural Network
50. A Neural Network is divided into multiple layers and each layer is further divided into several
blocks called __________.
a. Nodes
b. Connector
52. The job of an ________ is to acquire data and feed it to the Neural Network.
a. Output Layer
b. Input Layer
c. Neural Layer
d. All of the above
Ans: b. Input Layer
53. In Neural Network, The ________ are the layers in which the whole processing occurs.
a. Output Layer
b. Input Layer
c. Hidden Layer
d. All of the above
Ans: c. Hidden Layer
55. In a Neural Network, Each node of these hidden layers has its own _______ which it executes on
the data received from the input layer.
a. Machine Learning Methods
b. Machine Learning Approach
c. Machine Learning Algorithm
d. All of the above
Ans: c. Machine Learning Algorithm
56. The last hidden layer passes the final processed data to the _________ which then gives it to the
user as the final output.
a. Output Layer
b. Input Layer
c. Hidden Layer
d. All of the above
Ans: a. Output Layer
57. A secret AI hiring tool was being developed by Amazon. The machine learning experts
discovered a significant issue: their new recruiting engine disliked women. The system has already
learned that male candidates were preferred. The resumes with “women” on them were punished. As
a result, the tool failed. This is an example of___________.
a. Data Privacy
b. AI Access
58. Which of the following datasets can have a particular structure or pattern?
a. Semi Structure
b. Structured
c. Unstructured
d. Fully Structured
Ans: b. Structured
59. Which of the following models the connections or data patterns that the developer defined?
a. Rule Base Approach
b. Learning Based Approach
c. Knowledge Based Approach
d. Decision Based Approach
Ans: a. Rule Base Approach
60. It gives us a suitable framework that can help us get closer to the objective of our AI project.
a. 4Ws Canvas
b. AI Project Cycle
c. Project Model
d. AI Models
Ans: b. AI Project Cycle
61. The computer is trained using an enormous quantity of data in _________, which helps it train
itself based on the data.
a. Supervised learning
b. Deep learning
c. Classification
d. Unsupervised learning
Ans: b. Deep learning
62. Choose one of the following statements regarding the unsupervised learning-based model that is
false.
a. We can provide a very large data set.
b. The algorithm itself analyzes the data set and determines relationships within that data.
c. The labeled data is fed with some rules by the developers.
d. lets us make predictions and improve the algorithms on its own.
Ans: c. The labeled data is fed with some rules by the developers.
64. Select the Decision Tree nodes from the list below.
a. Decision nodes
b. End nodes
c. Chance nodes
d. All of the above
Ans: d. All of the above
66. Which one of the following approaches is not taken into account while modelling:
a. Rule-based approach
b. Learning-based approach
c. Knowledge-based approach
d. All of these
Ans: c. Knowledge-based approach
75. The data that is provided as input during data collecting is referred to as ___________.
a. Testing Data
b. Training Data
c. Input data
d. None of the above
Ans: b. Training Data
76. The kind of data that is being gathered during data collection is referred to as __________.
a. System Mapping
b. Web Scraping
c. 4Ws Canvas
d. Data Features
Ans: d. Data Features
77. Initial problem definition is the first step in the AI process, which is afterwards
a. Designing >> Brainstorming >> Building
b. Designing >> Deploying >> Brainstorming
c. Brainstorming >> Designing >> Building
d. Designing >> Brainstorming >> Building
Ans: c. Brainstorming >> Designing >> Building
78. Sumit is studying the phases of an AI project. She was aware of the problem statement template,
but she is now attempting to recall it. For the problem statement template, which of the following
statements is true?
a. Help people in creating a single overview that includes all the important details
b. Help in looking back and analyzing the issue in the future
c. Contains fundamental information regarding the issue’s general dimensions.
d. All of the above
Ans: d. All of the above
79. Data was entered into the system by Vikash, who is presently receiving the results, which she will
examine. The name of this result set is _______.
a. Result Set
b. Database
c. Training Data
d. Testing Data
80. Manish work in a Nirma Constructions. He has a list of questions and inquiries that his clients can
respond to with a yes or no. This approach is called ____________.
a. Surveys
b. Web scraping
c. Sensors
d. None of the above
Ans: d. None of the above
81. The team leader Rakesh wants to use the observational method to gather data. Which tool from
the list below may be utilize for the same?
a. Website Article
b. Google Forms
c. Checklist
d. All of these
Ans: c. Checklist
83. Data can be directly downloaded from any website. What kinds of data are available for free
download and use?
a. Someone’s property
b. Data generated by a specific group
c. Open Source Data
d. Closed Source Data
Ans: c. Open Source Data
84. Amit has gathered information. But he discovered that the material he gathered is exceedingly
challenging to comprehend. given that data is always
a. Complex entity
b. Filtered and bifurcated
c. Sophisticated structured
d. None of the above
Ans: a. Complex entity
85. When Rajesh explores data, she wants to compare the data and demonstrate certain cyclical
variations. Which graph from the following can be used?
a. Bar Graph
b. Line Graph
c. Pie Graph
d. Histogram Graph
Ans: a. Bar Graph
86. Select the five phases of the AI project cycle in the proper order.
a. Data Acquisition -> Problem Scoping -> Data Exploration -> Modelling -> Evaluation
b. Evaluation -> Problem Scoping -> Data Exploration -> Data Acquisition -> Modelling
c. Problem Scoping -> Data Acquisition -> Data Exploration -> Modelling -> Evaluation
d. Problem Scoping -> Data Exploration -> Data Acquisition -> Evaluation -> Modelling
87. In order to have a clearer view, we examine several parameters that have an impact on the
problem we’re trying to address under the _______ stage of the AI Project Cycle.
a. Data Exploration
b. Evaluation
c. Modelling
d. Problem Scoping
Ans: d. Problem Scoping
90. How many SDGs have been officially announced by the UN?
a. 18
b. 17
c. 16
d. 15
Ans: b. 17
91. People that experience the mentioned issue and would gain from the solution are referred to as
___________.
a. Key Persons
b. Stakeholders
c. End user
d. None of the above
Ans: b. Stakeholders
92. The person starting a project should be absolutely clear with ___________.
a. Problem Reasons
b. Problem Statement
c. Problem Solutions
d. None of the above
Ans: b. Problem Statement
95. Which of the following 4Ws canvas problems aids in the direct or indirect analysis of those who
are affected?
a. What
b. Who
c. Why
d. Where
Ans: b. Who
96. The nature of the problem is determined by which of the 4Ws of problem scoping is used.
a. What
b. Who
c. Why
d. Where
Ans: a. What
97. The ________ helps in collecting all the important information into a single template for problem
scoping.
a. Problem Taking Template
b. 4ws of problem scoping
c. Information Template
d. Problem Statement Template
Ans: b. 4ws of problem scoping
98. Which block of the 4ws problem canvas focuses on the problem’s context, circumstance, or
location?
a. What
b. Who
c. Why
d. Where
Ans: d. Where
102. Which of the following is not a reliable source for acquiring data?
a. System Hacking
b. Surveys
c. Website
d. None of the above
Ans: a. System Hacking
Q4. Under __________ we look at various parameters which affect the problem we wish to solve so
that the picture becomes clearer.
a. Evaluation
b. Modelling
c. Data Exploration
d. problem scoping
Ans: d. problem scoping
Q8. Which block of 4W helps in analysing the people getting affected directly or indirectly due to
problem?
a. Where
b. What
c. Who
d. Why
Ans: c. Who
Q9. Under the _______ block, you need to determine the nature of the problem.
a. Where
b. Why
c. What
d. Who
Ans: c. What
Q10. ___________ are the people who face this problem (identified by you) and would be benefited
with the solution.
a. Stakeholders
b. Staker
c. Problem holder
d. None of the above
Ans: a. Stakeholders
Q15. Ananya is talking about the various stages of AI Project cycle. She is telling that, in this stage
we acquire data for the project. She is talking about ______________ stage of AI Project Cycle.
a. Problem Scoping
b. Data Exploration
c. Data Acquisition
d. Modelling
Ans: c. Data Acquisition
Q16. Aman want to make an Artificially Intelligent system which can predict the salary of any
employee based on his previous salaries. He has to feed the data of his previous salaries. This is the
data with which the machine can be trained. The previous salary data here is known as ____________
while the next salary prediction data set is known as the ___________
a. Testing Data, Training Data
b. Training Data, Testing Data
c. Training Data, Next Data
d. First Data, Testing Data
Ans: b. Training Data, Testing Data
Q17. For better efficiency of an AI project, the Training data needs to be and ____
a. relevant and useless
b. relevant and authentic
c. irrelevant and useful
d. relevant and not required
Ans: b. relevant and authentic
Q19. Ways by which you can collect data for your AI Project is _____
a. Surveys
b. Cameras
c. Sensors
d. All of the above
Ans: d. All of the above
Q22. ________________ approach refers to the AI modelling where the rules are defined by the
developer.
a. Rule based
b. Learning based
c. Machine learning
d. Deep learning
Ans: a. Rule based
Q23. In ___________ approach we fed the data along with rules to the machine and the machine after
getting trained on them is now able to predict answers for the same.
a. Rule based
b. Learning based
c. Machine learning
d. Deep learning
Ans: a. Rule based
Q25. ________ refers to the AI modelling where the machine learns by itself
a. Rules based approach
b. Learning based approach
c. Deep learning
d. Machine learning
Ans: b. Learning based approach
Q28. In a ___________ learning model, the dataset which is fed to the machine is labelled.
a. Supervised
b. Unsupervised
c. Reinforcement
Q30. ________________ type of supervised learning model works on discrete dataset which means
the data need not be continuous.
a. Classification
b. Regression
c. Both of the above
d. None of the abov
Ans: a. Classification
Q31. ______________ type of Supervised Learning models work only on continuous data.
a. Classification
b. Regression
c. Both of the above
d. None of the above
Ans: a. Classification
Q33. ________ means that the data which is fed to the machine is random and there is a possibility
that the person who is training the model does not have any information regarding it.
a. Labelled dataset
b. Partial dataset
c. Unlabelled dataset
d. Complete dataset
Ans: c. Unlabelled dataset
Q34. Aman have a random data of 1000 dog images. He wish to understand some pattern out of it, so
he would feed this data into the ______ and would train the machine on it.
a. supervised learning model
b. unsupervised learning model
c. reinforcement learning model
d. None of the above
Ans: b. unsupervised learning model
Q35. Sonal wants to identify relationships, patterns and trends out of the random data. Which of the
following learning model is suitable for her?
a. Supervised
b. Unsupervised
c. Reinforcement
d. None of the above
Q37. _________ refers to the unsupervised learning algorithm which can cluster the unknown data
according to the patterns or trends identified out of it.
a. Clustering
b. Dimensionality Reduction
c. Non Clustering
d. None of the above
Ans: a. Clustering
Q41. Which of the following parameter is used to calculate the efficiency of the model?
a. Accuracy
b. Precision
c. F1 Score
d. All of the above
Ans: d. All of the above
Q43. A Neural Network is divided into multiple layers and each layer is further divided into several
blocks called _________
Q49. There can be multiple hidden layers in a neural network system. (T/F)
a. True
b. False
Ans: a. True
Q50. The _________ hidden layer passes the final processed data to the output layer which then gives
it to the user as the final output.
a. Second
b. First
c. Last
d. Third
Ans: c. Last
3. Name the 4Ws of problem canvases under the problem scoping stage of the AI Project Cycle.
a. Who, b. what c. where d. why
9. Which type of graphical representation suits best for continuous type of data like monthly
exam scores of a student?
Linear graph
1. What are the two different approaches for AI modelling? Define them.
There are two approaches for AI Modelling; Rule Based and Learning Based.
The Rule based approach generates pre-defined outputs based on certain rules programmed by
humans. Whereas, machine learning approach has its own rules based on the output and data used to
train the models.
OR
Rule Based Approach Refers to the AI modelling where the relationship or patterns in data are
defined by the developer. The machine follows the rules or instructions mentioned by the developer,
and performs its task accordingly. Whereas in Learning based approach, the relationship or patterns in
4. Mention the precautions to be taken while acquiring data for developing an AI Project.
It should be from an authentic source, and accurate. Look for redundant and irrelevant data
parameters that does not take part in prediction.
6. Write the names for missing stages in the given AI project cycle:
Ans:
OR
It can work with incomplete knowledge and may produce output even with incomplete information.
It has fault tolerance which means that corruption of one or more cells of ANN does not
prevent it from generating output.
It has the ability to learn events and make decisions by commenting on similar events.
It has Parallel processing capability i.e. ANN have numerical strength that can perform more
than one job at the same time.
OR
Neural Networks have the ability to learn by themselves and produce the output that is not
limited to the input provided to them.
The input is stored in its own networks instead of a database; hence the loss of data does not
affect its working.
These networks can learn from examples and apply them when a similar event arises, making
them able to work through real-time events.
Even if a neuron is not responding or a piece of information is missing, the network can detect
the fault and still produce the output.
They can perform multiple tasks in parallel without affecting the system performance
14. What are the different types of sources of data from where we can collect reliable and
authentic datasets? Explain in brief.
Data can be a piece of information or facts and statistics collected together for reference or analysis.
Whenever we want an AI project to be able to predict an output, we need to train it first using data
There could be many ways and sources from where we can collect reliable and authentic datasets
namely Surveys, Web scrapping, Sensors, Cameras, Observations, Research, Investigation, API etc.
Sometimes Internet is also used to acquire data but the most important point to keep in mind is that
the data should be taken from reliable and authentic websites only. Some reliable data sources are
UN, Google scholar, Finance, CIA, Data.gov etc.
2. Explain the relation between data size and model performance of an Artificial Neural
Network.
The basis for any kind of AI development is BIG DATASET. The performance of any AI based
application depends on the data supplied
ANN models are also known as Learning models and are used for prediction purposes. These are
mostly developed without paying much cognizance to the size of datasets that can produce models of
high accuracy and better generalization. Although, the general belief is that, large dataset is needed to
construct a predictive learning model. To describe a data set as large in size, perhaps, is circumstance
dependent, thus, what constitutes a dataset to be considered as being big or small is somehow vague.
3. Draw the 4Ws problem canvas and explain each one of them briefly.
The 4Ws problem canvas is the basic template while scoping a problem and using this canvas, the
picture becomes clearer while we are working to solve it.
a) Who: The “Who” block helps you in analyzing the people getting affected directly or indirectly
due to it? Under this, you find out who the ‘stakeholders’ to this problem are and what you know
about them. Stakeholders are the people who face this problem and would be benefitted with the
solution.
b) What: Under the “What” block, you need to look into what you have on hand. At this stage, you
need to determine the nature of the problem. What is the problem and how do you know that it is a
problem?
c) Where: In this block, you need to focus on the context/situation/location of the problem. It will
help you look into the situation in which the problem arises, the context of it, and the locations where
it is prominent.
d) Why: in the “Why” canvas, think about the benefits which the stakeholders would get from the
solution and how would it benefit them as well as the society.
5. What is an Artificial Neural Network? Explain the layers in an artificial neural network.
Artificial Neural Network: Modeled in accordance with the human brain, a Neural Network was built
to mimic the functionality of a human brain. The human brain is a neural network made up of
multiple neurons, similarly, an Artificial Neural Network (ANN) is made up of multiple perceptrons.
A neural network consists of three important layers:
Input Layer: As the name suggests, this layer accepts all the inputs provided by the programmer.
Hidden Layer: Between the input and the output layer is a set of layers known as Hidden layers. In
this layer, computations are performed which result in the output. There can be any number of hidden
layers
Output Layer: The inputs go through a series of transformations via the hidden layer which finally
results in the output that is delivered via this layer.
OR
Supervised learning is a learning in which we teach or train the machine using data which is well
labelled that means some data is already tagged with the correct answer. After that, the machine is
provided with a new set of examples (data) so that supervised learning algorithm analyses the training
data (set of training examples) and produces a correct outcome from labelled data.
OR
In a supervised learning model, the dataset which is fed to the machine is labelled. It means some data
is already tagged with the correct answer. In other words, we can say that the dataset is known to the
person who is training the machine only then he/she is able to label the data.
Unsupervised Learning: An unsupervised learning model works on unlabeled dataset. This means
that the data which is fed to the machine is random and there is a possibility that the person who is
training the model does not have any information regarding it. The unsupervised learning models are
used to identify relationships, patterns and trends out of the data which is fed into it. It helps the user
in understanding what the data is about and what are the major features identified by the machine in
it.
OR
Unsupervised learning is the training of a machine using information that is neither classified nor
labelled and allowing the algorithm to act on that information without guidance. Here the task of the
machine is to group unsorted information according to similarities, patterns and differences without
any prior training of data.
8. Differentiate between classification and clustering algorithms with the help of suitable
examples.
Classification is a process of finding a function which helps in dividing the dataset into classes based
on different parameters. In Classification, a computer program is trained on the training dataset and
based on that training; it categorizes the data into different classes. The task of the classification
algorithm is to find the mapping function to map the input(x) to the discrete output(y).
Example: The best example to understand the Classification problem is Email Spam Detection. The
model is trained on the basis of millions of emails on different parameters, and whenever it receives a
new email, it identifies whether the email is spam or not. If the email is spam, then it is moved to the
Spam folder.
Regression is a process of finding the correlations between dependent and independent variables. It
helps in predicting the continuous variables such as prediction of Market Trends, prediction of House
prices, etc. The task of the Regression algorithm is to find the mapping function to map the input
variable(x) to the continuous output variable(y).
9. Five sustainable Development Goals are mentioned below. Write 2 problems under each goal
that you think should be addressed for achieving the goal.
a. Quality Education
b. Reduced Inequalities
c. Life on Land
d. No Poverty
e. Clean Water and Sanitation
a. Quality Education:
i. Providing education remotely, leveraging hi-tech, low-tech and no-tech approaches;
ii. Ensure coordinated responses and avoid overlapping efforts;
iii. Ensuring return of students to school when they reopen to avoid an upsurge in dropout rates.
b. Reduced inequalities:
i. Reduction of relative economic inequalities inequality in some countries having poorest and most
vulnerable communities.
ii. Improving the situations in countries with weaker health systems.
c. Life on Land:
i. Prevention of Deforestation caused by humans and restoration of land
ii. Preventions and cure of diseases that are transmissible between animals and humans
d. No Poverty
i. Creation of Strong social protection systems to prevent people from falling into poverty
ii. Reduction of social exclusion, and high vulnerability of certain populations to disasters and
diseases.
iii. Responsible distribution of resources.
3. How you can figure out the data using problem scoping?
Answer –
a. You need to acquire data which will become the base of your project.
b. Collect data from various reliable and authentic sources
c. After exploring the patterns, you can decide upon the type of model you would build to achieve the
goal.
d. You can test the selected models and figure out which is the most efficient one.
e. The most efficient model is now the base of your AI project and you can develop your algorithm
around it
f. Once the modelling is complete, you now need to test your model on some newly fetched data. The
results will help you in evaluating your model and improving it.
11. What is the difference between Training Data & Testing Data?
Answer – The datasets are divided into two groups in machine learning. The first subset, referred to
as the training data, is a section of our actual dataset that is used to train a machine learning model.
Second subset, referred to testing data, Once your machine learning model is built, you need unseen
data to test your model. This data is called testing data.
Note – Training data use 80% of the whole data and testing data use 20%.