Ai 2-Marks
Ai 2-Marks
UNIT 3
1. What is Reinforcement Learning?
Reinforcement Learning is a feedback-based Machine learning technique in which an agent learns to
behave in an environment by performing the actions and seeing the results of actions. For each good action,
the agent gets positive feedback, and for each bad action, the agent gets negative feedback or penalty.
In case of passive RL, the agent's policy is fixed which means that it is told what to do. ... Therefore,
the goal of a passive RL agent is to execute a fixed policy (sequence of actions) and evaluate it while that
of an active RL agent is to act and learn an optimal policy.
9. What is meant by passive and active reinforcement learning and how do we compare the two?
Both active and passive reinforcement learning are types of RL. In case of passive RL, the agent’s policy
is fixed which means that it is told what to do. In contrast to this, in active RL, an agent needs to decide
what to do as there’s no fixed policy that it can act on. Therefore, the goal of a passive RL agent is to
execute a fixed policy (sequence of actions) and evaluate it while that of an active RL agent is to act and
learn an optimal policy.
10.What is Adaptive Dynamic Programming(ADP)
ADP is a smarter method than Direct Utility Estimation as it runs trials to learn the model of the environment by
estimating the utility of a state as a sum of reward for being in that state and the expected discounted reward of
being in the next state.
Where R(s) = reward for being in state s, P(s'|s, π(s)) = transition model, γ = discount factor and Uπ(s) = utility of
being in state s'.
It can be solved using value-iteration algorithm. The algorithm converges fast but can become quite costly to
compute for large state spaces. ADP is a model based approach and requires the transition model of the
environment. A model-free approach is Temporal Difference Learning.
11.What is Temporal Difference Learning (TD)
TD learning does not require the agent to learn the transition model. The update occurs between successive states
and agent only updates states that are directly affected.
A language can be defined as a set of strings. “print(2 + 2)” is a legal program in the language Python, whereas
“2)+(2 print” is not. Since there are an infinite number of legal programs, they cannot be enumerated; instead they
are specified by a set of rules called a grammar. Formal languages also have rules that define the meaning or
semantics of a program.
An N-gram language model predicts the probability of a given N-gram within any sequence of words in the
language. If we have a good N-gram model, we can predict p(w | h) – what is the probability of seeing the word w
given a history of previous words h – where the history contains n-1 words.
Adaptive dynamic programming (ADP) learns a model and a reward function from observations and then uses
value or policy iteration to obtain the utilities or an optimal policy. ADP makes optimal use of the local constraints
on utilities of states imposed through the neighborhood structure of the environment.
17. What are the merits and demirits in Direct Utility Estimation?
Advantages
It eventually computes the correct average values, using just sample transitions
Disadvantages
18.What is NLP?
NLP stands for Natural Language Processing, which is a part of Computer Science, Human language, and Artificial
Intelligence. It is the technology that is used by machines to understand, analyse, manipulate, and interpret
human's languages. It helps developers to organize knowledge for performing tasks such as translation, automatic
summarization, Named Entity Recognition (NER), speech recognition, relationship extraction, and topic
segmentation.
NLP helps users to ask questions about any subject and get a direct response within seconds.
NLP offers exact answers to the question means it does not offer unnecessary and unwanted information.
Most of the companies use NLP to improve the efficiency of documentation processes, accuracy of documentation,
and identify the information from large databases.
1. Question Answering
Question Answering focuses on building systems that automatically answer the questions asked by humans in a
natural language.
2. Spam Detection
3. Sentiment Analysis
Sentiment Analysis is also known as opinion mining. It is used on the web to analyse the attitude, behaviour, and
emotional state of the sender. This application is implemented through a combination of NLP (Natural Language
Processing) and statistics by assigning the values to the text (positive, negative, or natural), identify the mood of the
context (happy, sad, angry, etc.)
4. Machine Translation
Machine translation is used to translate text or speech from one natural language to another natural language.
5. Spelling correction
Microsoft Corporation provides word processor software like MS-word, PowerPoint for the spelling correction.
6. Information extraction
Information extraction is one of the most important applications of NLP. It is used for extracting structured
information from unstructured or semi-structured machine-readable documents.
UNIT – IV
1 Define Language.
Language enables us to communicate most of what we have observed
about the environment.
7. Define Parsing.
Parsing is the process of finding a parse tree for a given input string.
It is also known as syntactic analysis.
UNIT – V
1. What exactly is a ROBOT?
A robot is a machine capable of sensing and interacting with its
environment.
2. Define manipulators?
An robotics, a manipulator is a device used to manipulate materials without direct
physical contact by the operator
3. Define Active Sensor?
Active sensors involve the emission of energy by a sensor apparatus into the environment, which is then
reflected back in some manner to the robot.
▪ Ultrasonic sensing
▪ Laser range finding are two common active sensor modalities
▪ rotation counter
▪ light sensor
Passive sensors use energy naturally present in the environment to obtain information.