Board QP Solution and Notes
Board QP Solution and Notes
False
13. Data samples are essential for understanding the efficiency of a model.
True
14. Accuracy is measured by comparing the chatbot’s answers with predefined
outputs.
True
15. Relevance of answers is not important for the evaluation of a chatbot.
False
16. Underfitting model’s accuracy is lower.
True
17. Overfitting model’s accuracy is high.
False
18. Script bots are deployed in the customer care section of various companies.
True
(1 Mark Questions)
Q3. Expand the term?
a) NLP
b) CBT
Ans. To extract specific, important information from large datasets and provide an
overview while avoiding redundancy and maximizing content diversity.
Ans. It helps companies understand customer opinions, assess their reputation, and gain
insights into purchasing decisions by analysing online sentiments.
Q8. Which technology allows virtual assistants to understand and respond to spoken
commands?
Ans. A perfect fit occurs when a model matches well with the true function.
Ans. Their job is to answer some basic queries that they are coded for and connect them
to human executives once they are unable to handle the conversation.
Ans. Whole textual data from all the documents altogether is known as corpus.
Q12. What does sentence segmentation mean?
Ans. Dividing the whole corpus into sentences is known as sentence segmentation.
Ans. Tokens is a term used for any word or number or special character occurring in a
sentence.
Ans. Stopwords are the words which occur very frequently in the corpus but do not add
any value to it.
(2 Marks Questions)
Q1. How does sentiment analysis help companies understand customer feedback and
brand reputation?
Ans. Identifies emotions and opinions in text, helping companies understand customer
feedback and brand reputation.
Ans. One example is spam filters in emails. Text classification helps separate unwanted
emails (spam) from important ones, making email management easier.
Q3. Discuss the role of NLP in understanding emotional meaning from social media data.
Ans. NLP helps analyse feelings and opinions in social media posts. This allows
companies to understand how people feel about their brand or products and make
better decisions.
Ans.
a) Happens when a model learns the training data too well, including its noise, and
performs poorly on new data.
b) Underfitting occurs when a model’s output does not match the true function at all.
Its accuracy is lower.
Q7. Write any two differences between Script bot and Smart bot.
Ans.
Script-bot Smart-bot
Script bots work around a script which is Smart bots work on bigger databases and
programmed in them. other resources directly
No or little language processing skills Coding is required to take this up on board
Limited functionality Wide functionality
Ans. Automatic summarization picks out the important information from large data to
extract key information, such as creating summaries of news articles or blogs.
Q9. How Stemming is different from Lemmatization? Explain with one example of each.
Ans. Stemming is a text normalization technique used in NLP to reduce words to their
root forms. It often results in incomplete or incorrect root forms
Example:
Word: "studies"
Stemmed form of word: "studi"
1. _______________________
2. ______________________
Ans. 1. Lemmatization
2. Stemming
Chapter: Evaluation
Q1. Fill in the blanks:
1. If Prediction and Reality both matches to Yes, then this condition is termed
as _____________.
True Positive
2. If Prediction and Reality both matches to No, then this condition is termed
as _____________.
True Negative
3. When Prediction is Yes and Reality is No. This condition is termed as
_____________.
False Positive
4. When Prediction is No and Reality is Yes. This condition is termed as
_____________.
5. False Negative
(1 Mark Questions)
Q3. Why confusion matrix is important?
Q3. What is the ideal value for the F1 Score, and what is the range of F1 Score values?
Ans. The ideal value for the F1 Score is 1 (100%), and the F1 Score ranges from 0 to 1.
(2 Marks Questions)
Q1. With reference to evaluation stage of AI project cycle, explain the term Precision.
Also give the formula to calculate it.
Ans. Precision is defined as the percentage of true positive cases out of all the cases
where the model predicted the positive class (i.e., true positives + false positives).
The formula for precision is:
Q2. With reference to evaluation stage of AI project cycle, explain the term Recall. Also
give the formula to calculate it.
Ans. Recall is the fraction of actual positive cases that are correctly identified by the
model.
Formula:
Q3. With reference to evaluation stage of AI, give any 2 real life scenarios when Precision
is More Important than Recall.
Example scenarios:
1. Spam detection: You don’t want to classify important emails as spam.
2. Medical diagnosis: Misdiagnosing a healthy person (false positive) as having a
disease can lead to unnecessary stress and expensive treatments.
Q4. With reference to evaluation stage of AI, give any 2 real life scenarios when Recall is
More Important than Precision.
Ans. Recall is important when false negatives (failing to detect positives) are more
harmful.
Example scenarios:
1. Cancer detection: Missing a cancer diagnosis (false negative) could lead to
severe consequences for the patient.
2. Fraud detection: Missing fraudulent activities can result in significant financial
losses.
Q5. With reference to evaluation stage of AI project cycle, explain the term F1 Score.
Also give the formula to calculate it.
Ans. F1 score is the measure of balance (harmonic mean) between precision and recall.
Formula:
(4 Marks Questions)
Q1. Suppose a model predicts the positive cases with the following outcomes:
• True Positives (TP): 30
• False Positives (FP): 10
• False Negatives (FN): 20
What will be the F1 score for the given problem?
Step 3: F1 Score
The F1 Score is 0.67 in this case, indicating a balanced performance of the model.
ARTIFICIAL INTELLIGENCE
BOARD PAPER 2022
Section-A
(1 Mark Questions)
Ans. Wage employment is a type of employment where a person works for someone else
in exchange for wages or salary.
Ans. Food insecurity, overproduction, unsustainable agricultural practices, and food waste
contribute to issues in sustainable development. Solutions include sustainable
farming, reducing waste, and equitable food distribution.
3. "To get success, every business idea needs to be unique or special." Is the
above statement a myth or fact?
Ans. Myth.
Explanation:
Success depends on execution, meeting customer needs, and market conditions,
not just uniqueness.
Ans. The major purpose of SDGs is to eliminate poverty, reduce inequality, promote
sustainability, and ensure global prosperity and peace by 2030.
(2 Marks Questions)
Ans. 1. Decision-making
An entrepreneur makes decisions what to produce or sell
2. Managing the Business
The entrepreneur plans for the future of his / her business
3. Taking Risk
Entrepreneurs take risks to protect the business from losses.
4. Innovation
Entrepreneurs innovate by trying new ideas or methods to increase income.
6. Sustainable development can actually happen only when each one of us works
towards it. Mention any four ways which we can do at our end to reduce
inequality.
Ans. 1. Promoting education for all
2. Ensuring gender equality
3. Encouraging fair wages
4. Supporting marginalized communities
Section – B
(1 Mark Questions)
7. What is NLP?
Ans. 1. Chatbots,
2. Virtual assistants
3. Sentiment analysis
4. Machine translation.
(any two)
Ans. True Positive (TP) refers to cases where the model correctly predicts a positive
outcome when it is actually positive.
Ans. The F1 score is the harmonic mean of precision and recall, providing a single metric
for evaluating model performance.
(2 Marks Questions)
Ans. The Evaluation stage assesses the performance and reliability of the AI model by
using metrics such as accuracy, precision, recall, and F1 score. It ensures the model
meets the desired objectives.
15. What is Tokenization? Count how many tokens are present in the following
statement:
I find that the harder I work, the more luck I seem to have.
Ans. Tokenization is the process of splitting text into smaller units, such as words or
sentences, called tokens.
Token Count:
Tokens: [I, find, that, the, harder, I, work, the, more, luck, I, seem, to, have]
Count: 14
16. Kaira, a beginner in the field of NLP is trying to understand the process of Stemming.
Help her in filling up the following table by suggesting appropriate affixes and stem
of the words mentioned there:
Ans.
S. No. Word Affixes Stem
i. Tries -es Try
ii. Learning -ing Learn
Explanation:
• Tries: The affix -es is removed to derive the root word Try.
• Learning: The affix -ing is removed to derive the root word Learn.
17. With reference to evaluation stage of AI project cycle, explain the term Accuracy.
Also give the formula to calculate it.
Ans. Accuracy:
Accuracy is a metric used in the evaluation stage of the AI project cycle that measure
the percentage of correct predictions out of all the observations.
Formula:
or
where:
• TP = True Positives
• TN = True Negatives
• FP = False Positives
• FN = False Negatives
18. Explain the following picture which depicts one of the processes on NLP.
Also mention the purpose which will be achieved by this process.
Ans. The image depicts the process of converting text to a common case, typically
lowercase, in NLP. This ensures that words with different capitalizations (e.g., "HELLO",
"HeLLo") are treated as the same, reducing case sensitivity.
Section - C
(COMPETENCY-BASED QUESTIONS)
(3 Marks Questions)
Apply all the four steps of Bag of words model of NLP on the above given documents
and generate the output.
Document 2:
Sahil likes to play cricket Sajal too also basketball
0 1 0 0 1 1 1 0 0
Document 3:
Sahil likes to play cricket Sajal too also basketball
0 1 1 1 0 1 0 1 1
20. With reference to NLP, explain the following terms in detail with the help of suitable
example:
• Term frequency
• Inverse Document Frequency
Formula:
21. Traffic Jams have become a common part of our lives now-a-days. Living in an urban
area means you have to face traffic each and every time you get out on the road.
Mostly, school students opt for buses to go to school. Many times, the bus gets late
due to such jams and students are not able to reach their school on time. Thus, an AI
model is created to predict explicitly if there would be a traffic jam on their way to
school or not. The confusion matrix for the same is
The Confusion Matrix Actual: 1 Actual: 0
Predicted: 1 50 50
Predicted: 0 0 0
Ans.
ARTIFICIAL INTELLIGENCE
BOARD PAPER 2023
Section-A
Objective Type Questions
(1 Mark Questions)
ii) This is a fact that all human beings have all nine types of intelligences, but at
different levels.
Name any two such intelligences.
(iv) The information extracted through data science can be used to make
decisions about it:
o This is correct. Data science involves extracting insights from data, which are
then used to make informed decisions.
(iv) During Data Acquisition, feeding previous data into the machine is called:
(a) Training Data
(b) Predicting Data
(c) Testing Data
(d) Evaluating Data
(v) Regression is one of the type of supervised learning model, where data is
classified according to labels and data need not to be continuous.
(True/False)
Ans. False
Reason:
In regression, the output is continuous rather than discrete, unlike
classification where output is discrete.
(iii) For _______ the whole corpus is divided into sentences. Each sentence is
taken as a different data so now the whole corpus gets reduced to
sentences.
(a) Text Regulation
(b) Sentence Segmentation
(c) Tokenisation
(d) Stemming
(iv) ___________ helps to find the best model that represents our data and
how well the chosen model will work in future.
Ans. Evaluation
Explanation:
Evaluation is the process that helps to determine the best model that fits the
data and assesses its performance for future predictions or tasks. It ensures
the model's reliability and efficiency.
Thus, recall considers True Positives (TP) and False Negatives (FN).
(vi) With reference to NLP, consider the following plot of occurrence of words
versus their value:
Explanation:
• Stop Words: High occurrence, low value (e.g., the, is, and).
• Frequent Words: Moderate occurrence, moderate value.
• Rare/Valuable Words (X): Low occurrence, high value, often carrying the
key meaning of the text.
(ii) Two conditions when prediction matches with the reality are true positive and
____________.
Ans. true negative.
(iii) Which of the following is the correct feature of Neural network?
(a) It can improve the efficiency of two models.
(b) It is useful with small dataset.
(c) They are modelled on human brains and nervous system.
(d) They need human intervention.
Ans. (c) They are modelled on human brains and nervous system.
(v) Under __________, one looks at various parameters which affect the problem
we wish to solve, as this would make many lives better.
Ans. ethics
(vi) In this learning model, the data set which is fed to the machine is labelled.
Name the model.
Ans. Token
(ii) When the prediction matches the reality, the condition is termed as
____________.
(iii) Smart Assistants such as Alexa, Siri are the examples of:
(a) Natural Language Processing
(b) Data Science
(c) Machine Learning
(d) Computer Vision
(vi) Which of the following talks about how true the predictions are by any model?
(a) Accuracy
(b) Reliability
(c) Recall
(d) F1 score
Section - B
Subjective Type Questions
(2 Marks Questions)
Ans. Meditation helps in proper blood circulation, reducing anxiety and relaxing the body
which collectively reduce stress levels.
Q9. Gurmeet has just bought a new computer for his office. Suggest him any two points
which he should keep in mind to prevent his computer from virus infection.
Q10. Define the term agricultural entrepreneurship. How are farmers benefitted from it?
Benefits to farmers:
1. Increased Income: Adoption of modern techniques and value-added products
increases profitability.
2. Market Access: Entrepreneurs help farmers connect directly with markets,
reducing dependency on intermediaries.
Ans. A confusion matrix is a table that records the result of comparison between the
prediction and reality.
Q15. Face lock feature of a smartphone is an example of computer vision. Briefly discuss
this feature.
Ans. The face lock feature uses computer vision to identify facial patterns via camera,
comparing them with stored biometric data to authenticate and unlock the
smartphone.
Q16. With reference to data processing, expand the term TFIDF. Also give any two
applications of TFIDF.
(4 Marks Questions)
Answer the given questions in 50-80 words each.
Q17. Ms. Sooji is a beginner in the field of Artificial Intelligence. She got confused among
the core terms like Artificial Intelligence (AI), Machine Learning (ML) and Deep
Learning (DL). Many a times, these terms are used interchangeably but are they the
same? Justify your answer. Help her in understanding these terms by drawing a well
labelled diagram to depict the interconnection of these three fields.
AI
ML
DL
Q18. What is the significance of AI project cycle? Also explain in detail about how Data
Acquisition is different from data exploration.
Ans. The AI project cycle provides a structured approach for building AI systems, ensuring
efficient and effective solutions.
Data Acquisition involves collecting raw data from various sources, while Data
Exploration focuses on understanding the data by cleaning, visualizing, and analysing
it for patterns or anomalies. Data exploration ensures the data is ready for modelling,
making it critical for AI project success.
Q19. Create a document vector table from the following documents by implementing all
the four steps of Bag of words model. Also depict the outcome of each step.
Document 1: Sameera and Sanya are classmates.
Document 2: Sameera likes dancing but Sanya loves to study mathematics.
Document 2:
Sameera and Sanya are classmates likes dancing but loves to study mathematics
1 0 1 0 0 1 1 1 1 1 1 1
OR
Sameera 1 1
and 1 0
Sanya 1 1
are 1 0
classmates 1 0
likes 0 1
dancing 0 1
but 0 1
loves 0 1
to 0 1
study 0 1
mathematics 0 1
Q20. Will it be valid to say that not all the devices which are termed as "smart" are AI-
enabled? Justify this statement. Explain any two examples from the daily life which are
commonly misunderstood as AI.
Ans. Not all "smart" devices are AI-enabled, as some rely on pre-programmed responses
or basic automation rather than intelligent learning. For instance:
1. A fully automatic Washing Machines: It works on its own, but it requires human
intervention to select the parameters of washing.
2. Wi-Fi enabled AC: It can be turned on and off remotely with the help of internet
but does not have decision-making abilities. This is an example of Internet of
Things (IoT)
Q21. Recently the country was shaken up by a series of earthquakes which has done a huge
damage to the people as well as the infrastructure. To address this issue, an AI model
has been created which can predict if there is a chance of earthquake or not. The
confusion matrix for the same is:
Reality
Confusion Matrix
Yes No
Yes 50 05
Predicted
No 25 20
(i) How many total cases are True Negative in the above scenario?
(ii) Calculate precision, recall and F1 score.
(ii)
ARTIFICIAL INTELLIGENCE
BOARD PAPER 2024
Section-A
Objective Type Questions
(1 Mark Questions)
(iv) The most important software in any computer is the ________. This is the
software that starts working as soon as we switch on a computer.
(a) Web Browsers
(b) Operating System
(c) Office Software
(d) Designing Software
(v) Which of the following types of communication takes place when one
individual address a large gathering?
(a) Written communication
(b) Public communication
(c) Small group communication
(d) Interpersonal communication
Ans. (b) Public communication
(vi) Assertion (A) : Organic farming technique is an example of a green skill that is
essential for sustainable agriculture.
Reason (R): Organic farming technique prioritise environment friendly and
sustainable practices such as using natural fertilisers, avoiding synthetic
pesticides, and promoting soil health.
(a) Both (A) and (R) are true and (R) is the correct explanation for (A).
(b) Both (A) and (R) are true and (R) is not the correct explanation of (A).
(c) (A) is true, but (R) is false.
(d) (A) is false, but (R) is true.
Ans. (a) Both (A) and (R) are true, and (R) is the correct explanation for (A).
(ii) This real-life application of NLP is used to provide an overview of a news item
or blog post, while avoiding redundancy from multiple sources and
maximising the diversity of content obtained. Which is this application?
(a) Chatbot
(b) Virtual Assistant
(c) Sentiment Analysis
(d) Automatic Summarisation
(iii) Which of the following represent a machine that is smart but not considered
Artificial Intelligence (AI) enabled?
(a) A robotic vacuum cleaner that can navigate and clean floors
autonomously.
(b) A chatbot that engages in natural language conversations and answers
questions.
(c) A smartphone with facial recognition for unlocking the device.
(d) A digital alarm clock that rings at a set time every morning.
Ans. (d) A digital alarm clock that rings at a set time every morning.
(iv) Which of the following words represent an example of a lemma resulting from
lemmatisation for "caring" in context to Natural Language Processing (NLP)?
(a) Care
(b) Cared
(c) Cares
(d) Car
Ans. (c) Describes the level of self-awareness someone has, starting from realizing
weaknesses, strengths, to recognizing their own feelings.
(vi) For Data Science, usually the data is collected in the form of tables. These
tabular datasets can be stored in different formats. Which of the following
formats is not used for storing data in a tabular format?
(a) CSV
(b) Website
(c) SQL
(d) Spreadsheet
Ans. (c) Problem Scoping, Data Acquisition, Data Exploration, Modelling, Evaluation
(iii) __________ is a concept to unify statistics, data analysis, machine learning and
their related methods in order to understand and analyse actual phenomena
with data.
(a) Computer Vision
(b) Natural Language Processing
(c) Data Science
(d) Computer Science
(iv) In computer vision which of the following tasks is used for multiple objects?
(a) Classification
(b) Classification + Localisation
(c) Instance Segmentation
(d) Localisation
(v) In spam email detection, which of the following will be considered as"False
Negative" ?
(a) When a legitimate email is accurately identified as not spam.
(b) When a spam email is mistakenly identified as legitimate.
(c) When an email is accurately recognised as spam.
(d) When an email is inaccurately labelled as important.
(vi) Which of the following applications is not associated with Natural Language
Processing (NLP)?
(a) Sentiment Analysis
(b) Speech Recognition
(c) Spam Filtering in emails
(d) Stock Market Analysis
(ii) Which form of unsupervised learning does the following diagram indicate?
(a) Clustering
(b) Regression
(c) Reinforcement learning
(d) Classification
(iii) Bag of Words is a model which helps in extracting features out of the text which
can be helpful in machine learning algorithms.
(a) Data Science (DS)
(b) Virtual Reality (VR)
(c) Natural Language Processing (NLP)
(d) Computer Vision (CV)
Ans. (d) A music streaming platform that suggests songs and playlists based on user
listening history.
Ans. (c) To assess how well the chosen model will work in future.
Ans. (a) Both (A) and (R) are true, and (R) is the correct explanation for (A).
(ii) When a machine possesses the ability to mimic human traits, i.e., make
decisions, predict the future, learn, and improve on its own, it is said to have:
(a) Computational Skills
(b) Learning Capability
(c) Artificial Intelligence
(d) Cognitive Processing
(iv) The concept of ________ is used to apply face filters on various social media
platforms.
(a) NLP
(b) Computer Vision
(c) Data Science
(d) Block chain Technology
(v) The 4 W's Problem Canvas helps in identifying the key elements related to the
given problem.
Which of the following is NOT one of the blocks of the Problem Canvas?
(a) When
(b) Where
(c) What
(d) Why
(vi) Which domain of AI is used for interacting with virtual assistants such as Siri
and Alexa?
(a) Machine Learning (ML)
(b) Computer Vision (CV)
(c) Natural Language Processing (NLP)
(d) Technical Vision (TV)
Section B
(Subjective Type Questions)
8. What is the importance of time management in effectively dealing with stress? Provide
any one strategy for improving time management skills to reduce stress.
Ans. Time management helps prioritize tasks, avoiding last-minute pressure and reducing
stress.
Strategy: Use a to-do list to organize tasks and set deadlines.
9. Mention any two measures that individuals or organisations can take to protect their
data from theft and viruses.
10. The method of communication that you choose could affect the relationship with your
peers, superiors, and customers. Write the four factors on the basis of which you can
choose the right method of communication.
12. What are the primary differences between Script-bots and Smart-bots?
13. What do you mean by Evaluation of an AI model? Also explain the concept of
overfitting with respect to AI model Evaluation.
Ans. Supervised learning is preferred as it uses labelled data to predict disease outbreaks
accurately, based on medical records.
15. What role does data play in AI based applications? Name any two sources of online
data collection for building any AI based application.
Ans. Data is used to train AI models to identify patterns and make decisions.
Sources: APIs (e.g., Twitter API), Open datasets (e.g., Kaggle), Open-sourced
Government Portals (any two)
Ans. 1. Grayscale images use shades of Gray, with one intensity channel.
2. RGB images combine red, green, and blue channels to display colours.
Ans. Neural networks are computational models inspired by the human brain, used for
tasks like classification and regression.
• Input Layer: Receives raw data for processing.
• Hidden Layer(s): Perform calculations and extract features.
• Output Layer: Produces the result or prediction.
18. Give any four examples of applications of AI that we see around us.
["ML", "and", "DL", "are", "part", "of", "AI", "is", "a", "subset"]
Document 1:
ML and DL are part of AI is a subset
1 1 1 1 1 1 1 0 0 0
Document 2:
ML and DL are part of AI is a subset
1 0 1 0 0 1 0 1 1 1
OR
Word Doc 1 Doc 2
ML 1 1
and 1 0
DL 1 1
are 1 0
part 1 0
of 1 1
AI 1 0
is 0 1
a 0 1
subset 0 1
20. Consider the following graphs (Figure 1 and Figure 2) that demonstrate the two types
of Supervised Learning Models of Artificial Intelligence. Identify and explain each
model giving suitable examples of each.
Ans. Figure 1:
Classification Model: Categorizes data into distinct classes, like spam detection or
image recognition.
Figure 2:
Regression Model: Predicts continuous values, such as house prices or stock market
trends.
21. A binary classification model has been developed to classify news articles as either
"Fake News" or "Real News". The model was tested on a dataset of 500 news articles,
and the resulting confusion matrix is as follows:
Reality
Confusion Matrix
Yes No
Yes 45 15
Predicted
No 20 420
(A) How many total cases are True Negative in the above scenario?
(B) Calculate Precision, Recall and F1-Score.
Ans.