0% found this document useful (0 votes)
27 views41 pages

AI Study Note

Use this to educate yourself on how AI WORKS

Uploaded by

jeffrey chika
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views41 pages

AI Study Note

Use this to educate yourself on how AI WORKS

Uploaded by

jeffrey chika
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 41

ARTIFICIAL INTELLIGENCE.

Artificial intelligence is the simulation of human intelligence processes by


machines, especially computer systems. Specific applications of AI include expert
systems, natural language processing, speech recognition and machine vision.
What Are the 4 Types of AI?
Artificial intelligence can be categorized into one of four types.
1. Reactive AI uses algorithms to optimize outputs based on a set of inputs.

Chess-playing AIs, for example, are reactive systems that optimize the best
strategy to win the game. Reactive AI tends to be fairly static, unable to
learn or adapt to novel situations. Thus, it will produce the same output
given identical inputs.
2. Limited memory AI can adapt to past experience or update itself based on

new observations or data. Often, the amount of updating is limited (hence


the name), and the length of memory is relatively short. Autonomous
vehicles, for example, can "read the road" and adapt to novel situations, even
"learning" from past experience.
3. Theory-of-mind AI are fully-adaptive and have an extensive ability to learn

and retain past experiences. These types of AI include advanced chat-bots


that could pass the Turing Test, fooling a person into believing the AI was a
human being. While advanced and impressive, these AI are not self-aware.
4. Self-aware AI, as the name suggests, become sentient and aware of their

own existence. Still in the realm of science fiction, some experts believe that
an AI will never become conscious or "alive".
What are the Applications of Artificial Intelligence?
Here is the list of the top 18 applications of AI (Artificial Intelligence):
1. AI Application in E-Commerce
Personalized Shopping
Artificial Intelligence technology is used to create recommendation engines
through which you can engage better with your customers. These
recommendations are made in accordance with their browsing history, preference,
and interests. It helps in improving your relationship with your customers and their
loyalty towards your brand.
AI-Powered Assistants
Virtual shopping assistants and chat bots help improve the user experience while
shopping online. Natural Language Processing is used to make the conversation
sound as human and personal as possible. Moreover, these assistants can have real-
time engagement with your customers. Did you know that on amazon.com, soon,
customer service could be handled by chat bots?
Fraud Prevention
Credit card frauds and fake reviews are two of the most significant issues that E-
Commerce companies deal with. By considering the usage patterns, AI can help
reduce the possibility of credit card fraud taking place. Many customers prefer to
buy a product or service based on customer reviews. AI can help identify and
handle fake reviews.
2. Applications of Artificial Intelligence in Education
Although the education sector is the one most influenced by humans, Artificial
Intelligence has slowly begun to seep its roots into the education sector as well.
Even in the education sector, this slow transition of Artificial Intelligence has
helped increase productivity among faculties and helped them concentrate more on
students than office or administration work.
Some of these applications in this sector include:
Administrative Tasks Automated to Aid Educators
Artificial Intelligence can help educators with non-educational tasks like task-
related duties like facilitating and automating personalized messages to students,
back-office tasks like grading paperwork, arranging and facilitating parent and
guardian interactions, routine issue feedback facilitating, managing enrollment,
courses, and HR-related topics.
Creating Smart Content
Digitization of content like video lectures, conferences, and textbook guides can be
made using Artificial Intelligence. We can apply different interfaces like
animations and learning content through customization for students from different
grades.
Artificial Intelligence helps create a rich learning experience by generating and
providing audio and video summaries and integral lesson plans.
Voice Assistants
Without even the direct involvement of the lecturer or the teacher, a student can
access extra learning material or assistance through Voice Assistants. Through this,
printing costs of temporary handbooks and also provide answers to very common
questions easily.
Personalized Learning
Using top AI technologies, hyper-personalization techniques can be used to
monitor students’ data thoroughly, and habits, lesson plans, reminders, study
guides, flash notes, frequency or revision, etc., can be easily generated.
3. Applications of Artificial Intelligence in Lifestyle
Artificial Intelligence has a lot of influence on our lifestyle. Let us discuss a few of
them.
 Autonomous Vehicles
Automobile manufacturing companies like Toyota, Audi, Volvo, and Tesla use
machine learning to train computers to think and evolve like humans when it
comes to driving in any environment and object detection to avoid accidents.
 Spam Filters
The email that we use in our day-to-day lives has AI that filters out spam emails
sending them to spam or trash folders, letting us see the filtered content only. The
popular email provider, Gmail, has managed to reach a filtration capacity of
approximately 99.9%.
 Facial Recognition
Our favorite devices like our phones, laptops, and PCs use facial recognition
techniques by using face filters to detect and identify in order to provide secure
access. Apart from personal usage, facial recognition is a widely used Artificial
Intelligence application even in high security-related areas in several industries.
 Recommendation System
Various platforms that we use in our daily lives like e-commerce,
entertainment websites, social media, video sharing platforms, like
YouTube, etc., all use the recommendation system to get user data and
provide customized recommendations to users to increase engagement. This
is a very widely used Artificial Intelligence application in almost all
industries.

4. Applications of Artificial Intelligence in Navigation


Based on research from MIT, GPS technology can provide users with accurate,
timely, and detailed information to improve safety. The technology uses a
combination of Convolutional Neural Networks and Graph Neural Networks,
which makes lives easier for users by automatically detecting the number of lanes
and road types behind obstructions on the roads. AI is heavily used by Uber and
many logistics companies to improve operational efficiency, analyze road traffic,
and optimize routes.

5. Applications of Artificial Intelligence in Robotics


Robotics is another field where Artificial Intelligence applications are commonly
used. Robots powered by AI use real-time updates to sense obstacles in its path and
pre-plan its journey instantly.
It can be used for:
 Carrying goods in hospitals, factories, and warehouses
 Cleaning offices and large equipment
 Inventory management

6. Applications of Artificial Intelligence in Human Resource


Did you know that companies use intelligent software to ease the hiring process?
Artificial Intelligence helps with blind hiring. Using machine learning software,
you can examine applications based on specific parameters. AI drive systems can
scan job candidates' profiles, and resumes to provide recruiters an understanding of
the talent pool they must choose from.
7. Applications of Artificial Intelligence in Healthcare
Artificial Intelligence finds diverse applications in the healthcare sector to build
sophisticated machines that can detect diseases and identify cancer cells. AI
applications are used in healthcare. Artificial Intelligence can help analyze chronic
conditions with lab and other medical data to ensure early diagnosis. AI uses the
combination of historical data and medical intelligence for the discovery of new
drugs.

8. Applications of Artificial Intelligence in Agriculture


Artificial Intelligence is used to identify defects and nutrient deficiencies in the
soil. This is done using computer vision, robotics, and machine learning
applications, AI can analyze where weeds are growing. AI bots can help to harvest
crops at a higher volume and faster pace than human laborers.

9. Applications of Artificial Intelligence in Gaming


Another sector where Artificial Intelligence applications have found prominence is
the gaming sector. AI can be used to create smart, human-like NPCs to interact
with the players.
It can also be used to predict human behavior using which game design and testing
can be improved. The Alien Isolation game released in 2014 uses AI to stalk the
player throughout the game. The game uses two Artificial Intelligence systems -
‘Director AI’ that frequently knows your location and the ‘Alien AI,’ driven by
sensors and behaviors that continuously hunt the player.
10. Applications of Artificial Intelligence in Automobiles
Artificial Intelligence is used to build self-driving vehicles. AI can be used along
with the vehicle’s camera, radar, cloud services, GPS, and control signals to
operate the vehicle. AI can improve the in-vehicle experience and provide
additional systems like emergency braking, blind-spot monitoring, and driver-
assist steering.

11. Applications of Artificial Intelligence in Social Media


 Instagram
On Instagram, AI considers your likes and the accounts you follow to determine
what posts you are shown on your explore tab.
 Facebook
Artificial Intelligence is also used along with a tool called Deep-Text. With this
tool, Facebook can understand conversations better. It can be used to translate
posts from different languages automatically.
 Twitter
AI is used by Twitter for fraud detection, for removing propaganda, and hateful
content. Twitter also uses AI to recommend tweets that users might enjoy, based
on what type of tweets they engage with.

12. Applications of Artificial Intelligence in Marketing


Artificial Intelligence (AI) applications are popular in the marketing domain as
well.
 Using AI, marketers can deliver highly targeted and personalized ads with
the help of behavioral analysis, and pattern recognition in ML, etc. It also
helps with retargeting audiences at the right time to ensure better results and
reduced feelings of distrust and annoyance.
 AI can help with content marketing in a way that matches the brand's style
and voice. It can be used to handle routine tasks like performance, campaign
reports, and much more.
 Chatbots powered by AI, Natural Language Processing (NLP), Natural
Language Generation (NLG), and Natural Language Understanding (NLU)
can analyze the user's language and respond in the ways humans do.
 AI can provide users with real-time personalization’s based on their
behavior and can be used to edit and optimize marketing campaigns to fit a
local market's needs.

13. Applications of Artificial Intelligence in Chatbots


AI chat bots can comprehend natural language and respond to people online who
use the "live chat" feature that many organizations provide for customer service.
AI chat bots are effective with the use of machine learning and can be integrated in
an array of websites and applications. AI chat bots can eventually build a database
of answers, in addition to pulling information from an established selection of
integrated answers. As AI continues to improve, these chat bots can effectively
resolve customer issues, respond to simple inquiries, improve customer service,
and provide 24/7 support. All in all, these AI chat bots can help to improve
customer satisfaction.

14. Applications of Artificial Intelligence in Finance


It has been reported that 80% of banks recognize the benefits that AI can provide.
Whether it’s personal finance, corporate finance, or consumer finance, the highly
evolved technology that is offered through AI can help to significantly improve a
wide range of financial services. For example, customers looking for help
regarding wealth management solutions can easily get the information they need
through SMS text messaging or online chat, all AI-powered. Artificial Intelligence
can also detect changes in transaction patterns and other potential red flags that can
signify fraud, which humans can easily miss, and thus saving businesses and
individuals from significant loss. Aside from fraud detection and task automation,
AI can also better predict and assess loan risks.
15. AI in Astronomy
Astronomy, on the other hand, is a largely unexplored topic that is just as
intriguing and thrilling as the rest. When it comes to astronomy, one of the most
difficult problems is analyzing the data. As a result, astronomers are turning to
machine learning and Artificial Intelligence (AI) to create new tools. Having said
that, consider how Artificial Intelligence has altered astronomy and is meeting the
demands of astronomers.
 The changing sky has captured everyone's attention as one of the most

astounding projects of all time. This project seeks to survey the whole night
sky every night, gathering over 80 terabytes of data in one go to study how
stars and galaxies in the cosmos change over time.

16. AI in Data Security


Data security, which is one of the most important assets of any tech-oriented firm,
is one of the most prevalent and critical applications of AI. With confidential data
ranging from consumer data (such as credit card information) to organizational
secrets kept online, data security is vital for any institution to satisfy both legal and
operational duties. This work is now as difficult as it is vital, and many businesses
deploy AI-based security solutions to keep their data out of the wrong hands.
Because the world is smarter and more connected than ever before, the function of
Artificial Intelligence in business is critical today. According to several estimates,
cyberattacks will get more tenacious over time, and security teams will need to rely
on AI solutions to keep systems and data under control.

 Identifies Unknown Threats


A human may not be able to recognize all of the hazards that a business confronts.
Every year, hackers launch hundreds of millions of assaults for a variety of
reasons. Unknown threats can cause severe network damage. Worse, they can have
an impact before you recognize, identify, and prevent them.
As attackers test different tactics ranging from malware assaults to sophisticated
malware assaults, contemporary solutions should be used to avoid them. Artificial
Intelligence has shown to be one of the most effective security solutions for
mapping and preventing unexpected threats from wreaking havoc on a corporation.
 Flaw Identification

AI assists in detecting data overflow in a buffer. When programs consume more


data than usual, this is referred to as ‘buffer overflow’. Aside from the fault
caused by human triggers breaking crucial data. These blunders are also observable
by AI, and they are detected in real-time, preventing future dangers.
AI can precisely discover cyber security weaknesses, faults, and other problems
using Machine Learning. Machine Learning also assists AI in identifying
questionable data provided by any application. Malware or virus used by hackers
to gain access to systems as well as steal data is carried out via programming
language flaws.
 Threat Prevention

Artificial Intelligence technology is constantly being developed by cyber security


vendors. In its advanced version, AI is designed to detect flaws in the system or
even the update. It’d instantly exclude anybody attempting to exploit those issues.
AI would be an outstanding tool for preventing any threat from occurring. It may
install additional firewalls as well as rectify code faults that lead to dangers.

17. AI in Travel and Transport


Intelligent technology has become a part of our daily lives in recent years. And, as
technology advances across society, new uses of AI, notably in transportation, are
becoming mainstream. This has created a new market for firms and entrepreneurs
to develop innovative solutions for making public transportation more comfortable,
accessible, and safe.
Intelligent transportation systems have the potential to become one of the most
effective methods to improve the quality of life for people all around the world.
There are multiple instances of similar systems in use in various sectors.
 Traffic Management

Clogged city streets are a key impediment to urban transportation all around the
world. Cities throughout the world have enlarged highways, erected bridges, and
established other modes of transportation such as train travel, yet the traffic
problem persists. However, AI advancements in traffic management provide a
genuine promise of changing the situation.
Intelligent traffic management may be used to enforce traffic regulations and
promote road safety. For example, Alibaba's City Brain initiative in China uses AI
technologies such as predictive analysis, big data analysis, and a visual search
engine in order to track road networks in real-time and reduce congestion.
Building a city requires an efficient transformation system, and AI-based traffic
management technologies are powering next-generation smart cities.
 Ride-Sharing

Platforms like Uber and OLA leverage AI to improve user experiences by


connecting riders and drivers, improving user communication and messaging, and
optimizing decision-making. For example, Uber has its own proprietary ML-as-a-
service platform called Michelangelo that can anticipate supply and demand,
identify trip abnormalities like wrecks, and estimate arrival timings.
 Route Planning

AI-enabled route planning using predictive analytics may help both businesses and
people. Ride-sharing services already achieve this by analyzing numerous real-
world parameters to optimize route planning.
AI-enabled route planning is a terrific approach for businesses, particularly
logistics and shipping industries, to construct a more efficient supply network by
anticipating road conditions and optimizing vehicle routes. Predictive analytics in
route planning is the intelligent evaluation by a machine of a number of road usage
parameters such as congestion level, road restrictions, traffic patterns, consumer
preferences, and so on.
Cargo logistics companies, such as vehicle transport services or other general
logistics firms, may use this technology to reduce delivery costs, accelerate
delivery times, and better manage assets and operations.
18. AI in Automotive Industry
The importance of Artificial Intelligence and machine learning in the automotive
sector cannot be overstated. With Artificial Intelligence driving more applications
to the automotive sector, more businesses are deciding to implement Artificial
Intelligence and machine learning models in production.
 Manufacturing

Infusing AI into the production experience allows automakers to benefit from


smarter factories, boosting productivity and lowering costs. AI may be utilized in
automobile assembly, supply chain optimization, employing robots on the
manufacturing floor, improving performance using sensors, designing cars, and in
post-production activities.
 Passenger and Driver Experience

We all wish to have a pleasant journey in our vehicles. Artificial Intelligence can
also help with this. When driving, Artificial Intelligence (AI) may assist drivers in
remaining focused by decreasing distractions, analyzing driving behaviors, and
enhancing the entire customer experience. Passengers can benefit from customized
accessibility as well as in-car delivery services thanks to AI.
 Inspections

The procedure of inspecting an automobile by a rental agency, insurance provider,


or even a garage is very subjective and manual. With AI, car inspection may go
digital, with modern technology being able to analyze a vehicle, identify where the
flaws are, and produce a thorough status report.
 Quality Control

Everyone desires a premium vehicle and experience. Wouldn't you prefer to know
if something is wrong with your automobile before it breaks down? In this
application, AI enables extremely accurate predictive monitoring, fracture
detection, and other functions.
Why Is AI Used?
People leverage the strength of Artificial Intelligence because the work they need
to carry out is rising on a daily basis. So automating everyday tasks is a smart idea.
This saves the organization's staff while also increasing output. Furthermore, the
organization may obtain competent individuals for the company's development
through the use of Artificial Intelligence.
Also, businesses nowadays believe that they wish to automate all regular and
ordinary tasks. And they believe they can automate such routine tasks using a
simple application. Because automation is becoming more widespread as data
science advances. This AI is most commonly used in online chat portals. You've
undoubtedly seen the welcome message when you visit the websites. After that, the
genuine talk generally begins.
LEARNING IN AI.
Learning is “a process that leads to change, which occurs as a result of experience
and increases the potential for improved performance and future learning.

Goals of Artificial Intelligence


Here are the main Goals of AI:
1. It helps you reduce the amount of time needed to perform specific tasks.

2. Making it easier for humans to interact with machines.

3. Facilitating human-computer interaction in a way that is more natural and

efficient.
4. Improving the accuracy and speed of medical diagnoses.

5. Helping people learn new information more quickly.

6. Enhancing communication between humans and machines.

Subfields of Artificial Intelligence


Here, are some important subfields of Artificial Intelligence:
 Machine Learning: Machine learning is the art of studying algorithms that
learn from examples and experiences. Machine learning is based on the idea
that some patterns in the data were identified and used for future predictions.
The difference from hard coding rules is that the machine learns to find such
rules.
 Deep Learning: Deep learning is a sub-field of machine learning. Deep
learning does not mean the machine learns more in-depth knowledge; it uses
different layers to learn from the data. The depth of the model is represented
by the number of layers in the model. For instance, the Google LeNet model
for image recognition counts 22 layers.
 Natural Language Processing: A neural network is a group of connected
I/O units where each connection has a weight associated with its computer
programs. It helps you to build predictive models from large databases. This
model builds upon the human nervous system. You can use this model to
conduct image understanding, human learning, computer speech, etc.
 Expert Systems: An expert system is an interactive and reliable computer-
based decision-making system that uses facts and heuristics to solve
complex decision-making problems. It is also considered at the highest level
of human intelligence. The main goal of an expert system is to solve the
most complex issues in a specific domain.
 Fuzzy Logic: Fuzzy Logic is defined as a many-valued logic form that may
have truth values of variables in any real number between 0 and 1. It is the
handle concept of partial truth. In real life, we may encounter a situation
where we can’t decide whether the statement is true or false.
FORMS OF LEARNING.
There are two main forms of learning in AI. They include:
1. Decision Tree Classification and the ID3Algorithm.
2. Statistical Learning
Decision Tree Classification and the ID3Algorithm.
1. Decision Tree is a supervised learning technique that can be used for both
classification and Regression problems, but mostly it is preferred for solving
Classification problems. It is a tree-structured classifier, where internal
nodes represent the features of a dataset, branches represent the
decision rules and each leaf node represents the outcome.
2. In a Decision tree, there are two nodes, which are the Decision Node and
Leaf Node. Decision nodes are used to make any decision and have multiple
branches, whereas Leaf nodes are the output of those decisions and do not
contain any further branches.
3. The decisions or the test are performed on the basis of features of the given
dataset.
4. It is a graphical representation for getting all the possible solutions to a problem/decision based
on given conditions.
5. It is called a decision tree because, similar to a tree, it starts with the root
node, which expands on further branches and constructs a tree-like structure.
6. In order to build a tree, we use the CART algorithm, which stands for
Classification and Regression Tree algorithm.
7. A decision tree simply asks a question, and based on the answer (Yes/No), it
further split the tree into sub trees.
Below diagram explains the general structure of a decision tree:

Note: A decision tree can contain categorical data (YES/NO) as well as numeric
data.

Why use Decision Trees?


There are various algorithms in Machine learning, so choosing the best algorithm
for the given dataset and problem is the main point to remember while creating a
machine learning model. Below are the two reasons for using the Decision tree:
 Decision Trees usually mimic human thinking ability while making a

decision, so it is easy to understand.


 The logic behind the decision tree can be easily understood because it shows
a tree-like structure.
Decision Tree Terminologies
1. Root Node: Root node is from where the decision tree starts. It represents
the entire dataset, which further gets divided into two or more homogeneous
sets.
2. Leaf Node: Leaf nodes are the final output node, and the tree cannot be
segregated further after getting a leaf node.
3. Splitting: Splitting is the process of dividing the decision node/root node
into sub-nodes according to the given conditions.
4. Branch/Sub Tree: A tree formed by splitting the tree.
5. Pruning: Pruning is the process of removing the unwanted branches from
the tree.
6. Parent/Child node: The root node of the tree is called the parent node, and
other nodes are called the child nodes.
How does the Decision Tree algorithm Work?
In a decision tree, for predicting the class of the given dataset, the algorithm starts
from the root node of the tree. This algorithm compares the values of root attribute
with the record (real dataset) attribute and, based on the comparison, follows the
branch and jumps to the next node.
For the next node, the algorithm again compares the attribute value with the other
sub-nodes and move further. It continues the process until it reaches the leaf node
of the tree. The complete process can be better understood using the below
algorithm:
Decision tree Example

The picture above depicts a decision tree that is used to classify whether a person is
Fit or Unfit.
The decision nodes here are questions like ‘’‘Is the person less than 30 years of age?’,
‘Does the person eat junk?’, etc.and the leaves are one of the two possible outcomes
viz. Fit and Unfit.
Looking at the Decision Tree we can say make the following decisions:
if a person is less than 30 years of age and doesn’t eat junk food then he is Fit, if a
person is less than 30 years of age and eats junk food then he is Unfit and so on.
The initial node is called the root node (colored in blue), the final nodes are called
the leaf nodes (colored in green) and the rest of the nodes are called intermediate or
internal nodes.
The root and intermediate nodes represent the decisions while the leaf nodes
represent the outcomes.
ID3 in brief
ID3 stands for Iterative Dichotomiser 3 and is named such because the algorithm
iteratively (repeatedly) dichotomizes (divides) features into two or more groups at
each step.
Invented by Ross Quinlan, ID3 uses a top-down greedy approach to build a
decision tree. In simple words, the top-down approach means that we start
building the tree from the top and the greedy approach means that at each iteration
we select the best feature at the present moment to create a node.
Most generally ID3 is only used for classification problems with nominal features
only.
Statistical Learning.
Statistical Learning is a set of tools for understanding data. These tools broadly
come under two classes: supervised learning & unsupervised learning.
Generally, supervised learning refers to predicting or estimating an output based on
one or more inputs.
Supervised learning: Supervised learning, as the name indicates, has the
presence of a supervisor as a teacher. Basically supervised learning is when we
teach or train the machine using data that is well-labeled. Which means some data
is already tagged with the correct answer. After that, the machine is provided with
a new set of examples (data) so that the supervised learning algorithm analyses the
training data (set of training examples) and produces a correct outcome from
labeled data.
For instance, suppose you are given a basket filled with different kinds of fruits.
Now the first step is to train the machine with all the different fruits one by one
like this:

 If the shape of the object is rounded and has a depression at the top, is red in
color, then it will be labeled as –Apple.
 If the shape of the object is a long curving cylinder having Green-Yellow

color, then it will be labeled as –Banana.


Now suppose after training the data, you have given a new separate fruit, say
Banana from the basket, and asked to identify it.
Since the machine has already learned the things from previous data and this time
has to use it wisely. It will first classify the fruit with its shape and color and would
confirm the fruit name as BANANA and put it in the Banana category. Thus the
machine learns the things from training data (basket containing fruits) and then
applies the knowledge to test data (new fruit).
Supervised learning is classified into two categories of algorithms:
 Classification: A classification problem is when the output variable is a

category, such as “Red” or “blue”, “disease” or “no disease”.


 Regression: A regression problem is when the output variable is a real

value, such as “dollars” or “weight”.


Supervised learning deals with or learns with “labeled” data. This implies that
some data is already tagged with the correct answer.
Types:-
1. Regression

2. Logistic Regression

3. Classification

4. Naive Bayes Classifiers

5. K-NN (k nearest neighbors)

6. Decision Trees

7. Support Vector Machine

Advantages:-
1. Supervised learning allows collecting data and produces data output from

previous experiences.
2. Helps to optimize performance criteria with the help of experience.

3. Supervised machine learning helps to solve various types of real-world

computation problems.
4. It performs classification and regression tasks.

5. It allows estimating or mapping the result to a new sample.

6. We have complete control over choosing the number of classes we want in

the training data.


Disadvantages:-
1. Classifying big data can be challenging.

2. Training for supervised learning needs a lot of computation time. So, it

requires a lot of time.


3. Supervised learning cannot handle all complex tasks in Machine Learning.

4. Computation time is vast for supervised learning.

5. It requires a labeled data set.

6. It requires a training process.

Steps
Unsupervised learning
Unsupervised learning is the training of a machine using information that is neither
classified nor labeled and allowing the algorithm to act on that information without
guidance. Here the task of the machine is to group unsorted information according
to similarities, patterns, and differences without any prior training of data.
Unlike supervised learning, no teacher is provided that means no training will be
given to the machine. Therefore the machine is restricted to find the hidden
structure in unlabeled data by itself.
For instance, suppose it is given an image having both dogs and cats which it has
never seen.

Thus the machine has no idea about the features of dogs and cats so we can’t
categorize it as ‘dogs and cats ‘. But it can categorize them according to their
similarities, patterns, and differences, i.e., we can easily categorize the above
picture into two parts. The first may contain all pics having dogs in them and the
second part may contain all pics having cats in them. Here you didn’t learn
anything before, which means no training data or examples.
It allows the model to work on its own to discover patterns and information that
was previously undetected. It mainly deals with unlabeled data.
Unsupervised learning is classified into two categories of algorithms:
 Clustering: A clustering problem is where you want to discover the inherent

groupings in the data, such as grouping customers by purchasing behavior.


 Association: An association rule learning problem is where you want to

discover rules that describe large portions of your data, such as people that
buy X also tend to buy Y.
Types of Unsupervised Learning:-
Clustering
1. Exclusive (partitioning)
2. Agglomerative
3. Overlapping
4. Probabilistic
Clustering Types:-
1. Hierarchical clustering
2. K-means clustering
3. Principal Component Analysis
4. Singular Value Decomposition
5. Independent Component Analysis

Supervised vs. Unsupervised Machine Learning:


Unsupervised machine
Parameters Supervised machine learning
learning
Algorithms are trained using Algorithms are used against data
Input Data
labeled data. that is not labeled
Computational
Simpler method Computationally complex
Complexity
Accuracy Highly accurate Less accurate
No. of classes No. of classes is known No. of classes is not known
Data Analysis Uses offline analysis Uses real-time analysis of data
Algorithms used Linear and Logistics K-Means clustering,
regression, Random forest,
Hierarchical clustering,
Support Vector Machine,
Apriority algorithm, etc.
Neural Network, etc.
Output Desired output is given. Desired output is not given.
Use training data to infer
Training data No training data is used.
model.
It is not possible to learn larger It is possible to learn larger and
Complex model and more complex models than more complex models with
with supervised learning. unsupervised learning.
Model We can test our model. We cannot test our model.
Supervised learning is also Unsupervised learning is also
Called as
called classification. called clustering.
Example: Find a face in an
Example: Optical character
Example image.
recognition.

Advantages of unsupervised learning:


1. It does not require training data to be labeled.
2. Dimensionality reduction can be easily accomplished using unsupervised

learning.
3. Capable of finding previously unknown patterns in data.

4. Flexibility: Unsupervised learning is flexible in that it can be applied to a

wide variety of problems, including clustering, anomaly detection, and


association rule mining.
5. Exploration: Unsupervised learning allows for the exploration of data and

the discovery of novel and potentially useful patterns that may not be
apparent from the outset.
6. Low cost: Unsupervised learning is often less expensive than supervised

learning because it doesn’t require labeled data, which can be time-


consuming and costly to obtain.
Disadvantages of unsupervised learning:
1. Difficult to measure accuracy or effectiveness due to lack of predefined
answers during training.
2. The results often have lesser accuracy.
3. The user needs to spend time interpreting and label the classes which follow
that classification.
4. Lack of guidance: Unsupervised learning lacks the guidance and feedback
provided by labeled data, which can make it difficult to know whether the
discovered patterns are relevant or useful.
5. Sensitivity to data quality: Unsupervised learning can be sensitive to data
quality, including missing values, outliers, and noisy data.
6. Scalability: Unsupervised learning can be computationally expensive,
particularly for large datasets or complex algorithms, which can limit its
scalability.
LESSON 3.
Problem Solving Techniques in Artificial
Intelligence (AI)

Problem-solving is commonly known as the method to reach the desired goal or find a
solution to a given situation. In computer science, problem-solving refers to artificial
intelligence techniques, including various techniques such as forming efficient
algorithms, heuristics, and performing root cause analysis to find desirable solutions.
Problem-solving in Artificial Intelligence usually refers to researching a solution to a
problem by performing logical algorithms, utilizing polynomial and differential equations,
and executing them using modeling paradigms. There can be various solutions to a
single problem, which are achieved by different heuristics. Also, some problems have
unique solutions. It all rests on the nature of the given problem.
1. Steps of problem solving in AI
2. AI methods of problem solving

Examples of Problems in Artificial Intelligence


Developers worldwide are using artificial intelligence to automate systems for efficient
utilization of time and resources. Some of the most common problems encountered in
day-to-day life are games and puzzles. These can be solved efficiently by using artificial
intelligence algorithms. Ranging from mathematical puzzles.
What is a Reflex Agent?
There are five primary agents used in Artificial Intelligence based on their capability of
perceiving intelligence. These agents are the following:
1. Simple Reflex Agents
2. Model-Based Reflex Agents
3. Goal-Based Agents
4. Utility-Based Agents
5. Learning Agents
These agents prove helpful in the mapping of states and actions. While solving a
complex problem, these agents often fail to comprehend the next step adequately; thus,
problem-solving agents solve such scenarios. These agents use techniques like B-tree
and heuristic algorithms to solve problems using artificial intelligence.
Problem Solving Techniques
Artificial Intelligence is beneficial for solving complex problems due to its efficient
methods of solving. Following are some of the standard problem-solving techniques
used in AI. You can explore about other problem-solving techniques apart from
searching.
Heuristics
The heuristic method helps comprehend a problem and devises a solution based purely
on experiments and trial and error methods. However, these heuristics do not often
provide the best optimal solution to a specific problem. Instead, these undoubtedly offer
efficient solutions to attain immediate goals. Therefore, the developers utilize these
when classic methods do not provide an efficient solution for the problem. Since
heuristics only provide time-efficient solutions and compromise accuracy, these are
combined with optimization algorithms to improve efficiency.
Example: Travelling Salesman Problem
The most common example of using heuristic is the Travelling Salesman problem.
There is a provided list of cities and their distances. The user has to find the optimal
route for the Salesman to return to the starting city after visiting every city on the list.
The greedy algorithms solve this NP-Hard problem by finding the optimal solution.
According to this heuristic, picking the best next step in every current city provides the
best solution.
Searching Algorithms
Searching is one of the primary methods of solving any problem in AI. Rational agents
or problem-solving agents use these searching algorithms to find optimal solutions.
These problem-solving agents are often goal-based and utilize atomic representation.
Moreover, these searching algorithms possess completeness, optimality, time
complexity, and space complexity properties based on the quality of the solution
provided by them.
Types of Searching Algorithms
There are following two main types of searching algorithms:
1. Informed Search
2. Uninformed Search
Informed Search
These algorithms use basic domain knowledge and comprehend available information
regarding a specified problem as a guideline for optimal solutions. The solutions
provided by informed search algorithms are more efficient than uninformed search
algorithms.
NB: Domain knowledge is used to refer to the general background knowledge of
the field or environment to which the methods of data science are being applied.
There are following main two types of informed search algorithms:
1. Greedy Search: A greedy search is an approach for solving a problem by selecting the
best option available at the moment.
2. A * Search: A * algorithm is a searching algorithm that searches for the shortest path
between the initial and the final state. It is used in various applications, such as maps. In
maps the A* algorithm is used to calculate the shortest distance between the source
(initial state) and the destination (final state).
Uninformed Search
These algorithms do not have the privilege of using basic domain knowledge, such as
the desired goal’s closeness. It contains information regarding traversing a tree and
identifying leaf and goal nodes. Uninformed search also goes by the name of blind
search because while traversing, there is no specific information about the initial state
and test for the goal. This search goes through every node till reaching the desired
destination.
Types of Uninformed Searching Algorithms
There are the following main five types of uninformed search algorithms:
1. Breadth-First Search
2. Depth First Search
3. Uniform Cost Search
4. Iterative Deepening Depth First Search
5. Bidirectional Search

Evolutionary Computation
This problem-solving method utilizes the well-known evolution concept. The theory of
evolution works on the principle of survival of the fittest. It states that the organism
which can cope well with their environment in a challenging or changing environment
and reproduce, their future generations gradually inherit the coping mechanism,
generating the diversity in new child organisms. In this way, the new organisms are not
mere copies of the old ones but have the mixes of characteristics that go along with that
harsh environment. Humans are the most prominent example of the evolution process
that has evolved and diversified because of the accumulation of favorable mutations
over countless generations.
In AI, the evolution concept refers to the trial and error technique:
1. Randomly change the old solution to come up with the new one. Does it give better
results? If yes, then keep and throw away the rest of the solutions. If not, then discard it.
2. Go to point 1.

Genetic Algorithms
The evolution theory is the basis of genetic algorithms. These algorithms use the direct
random search method. The developers calculate the fit function to cross the two fittest
options to create a favorable child. The developers gather the population data and then
evaluate each member to calculate everyone’s fitness. It is estimated by how well each
member fits with the desired requirement. Then the developers use various selection
methods to keep the best members. Some of the ways are the following:
1. Rank Selection
2. Tournament Selection
3. Steady Selection
4. Roulette Wheel Selection (Fitness Proportionate Selection)
5. Elitism (A strategy in evolutionary algorithms where the best one or more
solutions, called the elites, in each generation, are inserted into the next, without
undergoing any change)
Although genetic algorithms do not always work best, they do not break easily, and the
inputs change slightly. The developers commonly use genetic algorithms to generate a
high-level solution to optimization and search problems by relying on bio-inspired
operations such as mutation, crossover, and selection.
The problem-solving techniques help in improving the performance of programs. The
AI-based searching algorithms provide high precision and maximum accuracy to results.
These algorithms are faster than others in execution and offer multiple searching
methods depending upon the problem faced. Implementing heuristics allows the
devising to conceptually more straightforward algorithms with cheaper computational
costs compared to using optimal algorithms. Evolutionary computations also help in
optimization and search problems. Overall, these techniques are the basis for solving
high-level problems in AI such as chess algorithms, and hill-climbing problems.
Why Problem Solving is Important in AI?
Decoding any type of problem needs specific organized measures to be observed.
Identical is the matter of solving issues by AI. The following are the details:
 Goal –In this phase as soon as a crisis appears, the AI agent puts a goal or a mark. This
needs the agent to thoroughly examine and clarify the issue. This is a vital action as if
the goal for the issue is poorly developed then all the actions carried out to achieve the
goal would be useless.
 Problem Description–This is one major stage of problem-solving. Whenever a problem
occurs, then the agent chooses what measures must be carried out to run to the
developed goal. This is accomplished in the subsequent actions:
 Describing the State–A state area can be described as a group of all the accurate
conditions in which an agent can be joined when discovering a key to the crisis.

o
 Specifying Primary State–For an agent to begin cracking the issue, it
must begin from a state. The primary state from where the agent begins
performing is directed to the primary state.
 Collect Details–Now the agent collects data and utilizes the data needed
by it to fix the issue. These details will be collected with one-time
incidents as well as present pieces of knowledge.
 Designing the Changes–Some issues are undersized and so these can
be deciphered efficiently. But most of the time issues will be such where
sound planning and implementation are required. Hence this needs
appropriate data structures and managing processes well in advance.

Problem Formulation in AI
It is one of the basic stages of problem-solving that determines what measure should be
brought to fulfill the developed goal. Problem formulation is the stage in problem
description that is utilized to comprehend and choose a course of activity that must be
evaluated to reach a goal. If there is more than one method an agent can attain its
objective, then it generates intricacy in terms of truly reaching the goal as there would
be too numerous measures and courses that the AI entity can carry to achieve the goal
that it induces chaos and a tremendous decline in the efficiency. Problem formulation
can be accomplished in many stages such as the description of the initial condition of
the agent, choosing probable steps that the agent can bear, and design of transition
standards to define the efforts of the agent.
Problem-solving agents in artificial intelligence
Here the issue is split into sub-issues. The effects of the different measures carried out
in cracking the last sub-problem are delivered to the following sub-problem and the
integrated outcome of the sub-problems ushers to the definitive solution. This needs
appropriate planning and implementation of changes.
 Testing with the Goal State–In this phase, the outcomes generated from the agent are
analogized with that of the objective state. If the goal has been achieved, then the
agents block any additional activities and the issue arrives at the final state. But if the
goal is not achieved then the agent persists to discover activities to run to the goal.
 Estimating the Expense of Path carried–Whenever an agent carries a course to
decipher a situation it permits a numeric value (or price) to that course. These prices are
then estimated by utilizing a price function. The estimated consequence is hence
employed in the agent’s implementation action. The solution which is achieved with the
minimum or most subordinate price of the path is called the perfect solution.

Steps of problem solving in AI


Performance benchmark is one of the vital things in AI problem solving which
determines the value of the algorithm utilized to fix the issue. There are four methods in
which the execution of an algorithm is calculated. These are as follows:
 Totality – Totality calculates the algorithm’s assurance to discover the solution for the
issue if there is any solution for it.
 Optimality – This step is utilized to calculate the tracking methods which discover an
optimal solution to the presented issue.
 Duration Complexity – This calculates the portion of time the algorithm abides to run till
the key for the assigned issue.
 Space Intricacy – This is the standard that is employed to specify the quantity of area

(in memory) that the algorithm needs to execute the quest .


Basics to AI, ML And DL
Artificial Intelligence (AI)

Artificial Intelligence (AI) can make lots of difference to human intelligence and their life. AI
helps machines behave more like humans in:

 Talking
 Thinking
 Learning
 Planning
 Understanding

AI is also known as machine/Computer intelligence and is the result of the human need to
perform complex calculations, and is achieved by high-speed computers. AI brings intelligence
to machines.

The purpose of making machines intelligent is to make a person's life better, improve
productivity, relieve people from complex tasks, and reach remote and dangerous places while
keeping humans safe.

Machine Learning (ML)

Machine learning (ML) is a type of AI that allows software applications to predict outcomes
accurately without the necessity of complex programming. Most ML algorithms use historical
data as input to predict new output values.

Hence, we can say ML is a subset of AI, i.e., all ML algorithms are classified as being part of AI.
ML enables the machine to automatically learn from data, improve performance from past
experiences, and make predictions. ML contains a set of algorithms that work on a huge amount
of data. ML algorithms use these data to train, build the model and generate new output
predictions from the built model.

Deep Learning (DL)

Deep Learning (DL) is a subset of ML. It was inspired by the working of human brains. The term
Deep Learning usually refers to Deep Artificial Neural Networks.

Artificial Neural Networks (ANN) is a depiction of natural neurons in the brain. ANNs are
mainly involved with deep learning algorithms and mimic the functionality of the human brain.
The ANN can be used on all types of ML algorithms based on its functionality.

Hence, DL effectively teaches computers to do what humans naturally do, say learning by
example. It uses Neural Networks (NNs) to extract higher-level data. NNs consist of 3 layers, say
the input layer, the hidden layer, and the output layer. Each successive layer uses the output of
the preceding layer as its input. For example, optical reading uses lower layers to identify edges
and higher layers to identify letters.

ML and its types

There are many types of ML that can be used in different applications based on the type of
datasets used. ML type is determined by different criteria like the type and amount of data, ML
model requirement, how to train the ML model, etc.

Based on this, ML is divided into four types:

 Supervised Learning
 Unsupervised Learning
 Semi-Supervised Learning
 Reinforcement Learning
DEEP LEARNING
What is Deep Learning?
Deep learning is a type of machine learning that uses artificial neural networks to learn from data.
Artificial neural networks are inspired by the human brain, and they can be used to solve a wide variety of
problems, including image recognition, natural language processing, and speech recognition.

Deep learning algorithms


Deep learning algorithms are typically trained on large datasets of labeled data. The algorithms learn to
associate features in the data with the correct labels. For example, in an image recognition task, the
algorithm might learn to associate certain features in an image (such as the shape of an object or the color
of an object) with the correct label (such as "dog" or "cat").

Once a deep learning algorithm has been trained, it can be used to make predictions on new data. For
example, a deep learning algorithm that has been trained to recognize images of dogs can be used to
identify dogs in new images.

How does deep learning work?


Deep learning works by using artificial neural networks to learn from data. Neural networks are made up
of layers of interconnected nodes, and each node is responsible for learning a specific feature of the data.
Building on our previous example with images – in an image recognition network, the first layer of nodes
might learn to identify edges, the second layer might learn to identify shapes, and the third layer might
learn to identify objects.

As the network learns, the weights on the connections between the nodes are adjusted so that the network
can better classify the data. This process is called training, and it can be done using a variety of
techniques, such as supervised learning, unsupervised learning, and reinforcement learning.
Once a neural network has been trained, it can be used to make predictions with new data it’s received.

Deep learning applications


Deep learning can be used in a wide variety of applications, including:
1. Image recognition: To identify objects and features in images, such as people, animals, places,
etc.
2. Natural language processing: To help understand the meaning of text, such as in customer service
chat bots and spam filters.
3. Finance: To help analyze financial data and make predictions about market trends
4. Text to image: Convert text into images, such as in the Google Translate app.
Types of deep learning
There are many different types of deep learning models. Some of the most common types include:

 Convolutional neural networks (CNNs)


CNNs are used for image recognition and processing. They are particularly good at identifying objects in
images, even when those objects are partially obscured or distorted.

 Deep reinforcement learning


Deep reinforcement learning is used for robotics and game playing. It is a type of machine learning that
allows an agent to learn how to behave in an environment by interacting with it and receiving rewards or
punishments.

 Recurrent neural networks (RNNs)


RNNs are used for natural language processing and speech recognition. They are particularly good at
understanding the context of a sentence or phrase, and they can be used to generate text or translate
languages.

Benefits of using deep learning models


There are a number of benefits to using deep learning models, including:
 Can learn complex relationships between features in data: This makes them more powerful than
traditional machine learning methods.
 Large dataset training: This makes them very scalable, and able to learn from a wider range of
experiences, making more accurate predictions.
 Data-driven learning: DL models can learn in a data-driven way, requiring less human
intervention to train them, increasing efficiency and scalability. These models learn from data that
is constantly being generated, such as data from sensors or social media.
Challenges in Deep Learning
Deep learning has made significant advancements in various fields, but there are still some challenges
that need to be addressed. Here are some of the main challenges in deep learning:

 Data availability: It requires large amounts of data to learn from. For using deep learning it’s a
big concern to gather as much data for training.
 Computational Resources: For training the deep learning model, it is computationally expensive
because it requires specialized hardware like GPUs and TPUs.
 Time-consuming: While working on sequential data depending on the computational resource it
can take very large even in days or months.
 Interpretability: Deep learning models are complex; it works like a black box. It is very difficult
to interpret the result.
 Over fitting: when the model is trained again and again, it becomes too specialized for the
training data, leading to over fitting and poor performance on new data.

Advantages of Deep Learning:


1. High accuracy: Deep Learning algorithms can achieve state-of-the-art performance in various
tasks, such as image recognition and natural language processing.
2. Automated feature engineering: Deep Learning algorithms can automatically discover and learn
relevant features from data without the need for manual feature engineering.
3. Scalability: Deep Learning models can scale to handle large and complex datasets, and can learn
from massive amounts of data.
4. Flexibility: Deep Learning models can be applied to a wide range of tasks and can handle various
types of data, such as images, text, and speech.
5. Continual improvement: Deep Learning models can continually improve their performance as
more data becomes available.
Disadvantages of Deep Learning:
1. High computational requirements: Deep Learning AI models require large amounts of data and
computational resources to train and optimize.
2. Requires large amounts of labeled data: Deep Learning models often require a large amount of
labeled data for training, which can be expensive and time- consuming to acquire.
3. Interpretability: Deep Learning models can be challenging to interpret, making it difficult to
understand how they make decisions.
4. Over fitting: Deep Learning models can sometimes over fit to the training data, resulting in poor
performance on new and unseen data.
5. Black-box nature: Deep Learning models are often treated as black boxes, making it difficult to
understand how they work and how they arrived at their predictions.
Deep Learning Architectures
1. RNN: Recurrent Neural Networks (RNNs)
RNN is one of the fundamental network architectures from which other deep learning
architectures are built. RNNs consist of a rich set of deep learning architectures. They can
use their internal state (memory) to process variable-length sequences of inputs. Let’s say
that RNNs have a memory. Every processed information is captured, stored, and utilized to
calculate the final outcome. This makes them useful when it comes to, for instance, speech
recognition.
Currently, we can indicate two types of RNN:
 Bidirectional RNN: They work two ways; the output layer can get information from past
and future states simultaneously[2].
 Deep RNN: Multiple layers are present. As a result, the DL model can extract more
hierarchical information.

2. LSTM: Long Short-Term Memory


It’s also a type of RNN. However, LSTM has feedback connections. This means that it can
process not only single data points (such as images) but also entire sequences of data (such as
audio or video files)[3].
LSTM derives from neural network architectures and is based on the concept of a memory
cell. The memory cell can retain its value for a short or long time as a function of its inputs,
which allows the cell to remember what’s essential and not just its last computed value.
A typical LSTM architecture is composed of a cell, an input gate, an output gate, and a forget
gate. The cell remembers values over arbitrary time intervals, and these three gates regulate
the flow of information into and out of the cell.
 The input gate controls when new information can flow into the memory.
 The output gate controls when the information that is contained in the cell is used in the
output.
 The forget gate controls when a piece of information can be forgotten, allowing the cell
to process new data.
Today, LSTMs are commonly used in such fields as text compression, handwriting
recognition, speech recognition, gesture recognition, and image captioning.

3. GRU
This abbreviation stands for Gated Recurrent Unit. It’s a type of LSTM. The major
difference is that GRU has fewer parameters than LSTM, as it lacks an output gate.
GRUs are used for smaller and less frequent datasets, where they show better performance.

4. CNN: Convolutional Neural Networks (CNNs)


This architecture is commonly used for image processing, image recognition, video analysis,
and NLP.
CNN can take in an input image, assign importance to various aspects/objects in the image,
and be able to differentiate one from the others. The name ‘convolutional’ derives from a
mathematical operation involving the convolution of different functions. CNNs consist of an
input and an output layer, as well as multiple hidden layers.
5. DBN: Deep Belief Network
DBN is a multilayer network (typically deep, including many hidden layers) in which each
pair of connected layers is a Restricted Boltzmann Machine (RBM). Therefore, we can state
that DBN is a stack of RBMs. DBN is composed of multiple layers of latent variables
(“hidden units”), with connections between the layers but not between units within each
layer. DBNs use probabilities and unsupervised learning to produce outputs. Unlike
other models, each layer in DBN learns the entire input. In CNNs, the first layers only filter
inputs for basic features, and the latter layers recombine all the simple patterns found by the
previous layers. DBNs work holistically and regulate each layer in order.
6. DSN: Deep Stacking Network
We saved DSN for last because this deep learning architecture is different from the others.
DSNs are also frequently called DCN–Deep Convex Network. DSN/DCN comprises a deep
network, but it’s actually a set of individual deep networks. Each network within DSN has its
own hidden layers that process data. This architecture has been designed in order to improve
the training issue, which is quite complicated when it comes to traditional deep learning
models. Thanks to many layers, DSNs consider training, not a single problem that has to be
solved but a set of individual problems.

7. Transformer
The Transformer is a powerful deep learning architecture that has significantly impacted the
field of natural language processing (NLP). It was first introduced in a 2017 paper by Google
researchers and has since become a cornerstone in various advanced language models.
Unlike traditional models that rely on Recurrent Neural Networks (RNNs) for sequential
information extraction, Transformers leverage self-attention mechanisms to understand
context and relationships between different elements in a sequence.
Key points about the Transformer architecture include:
 Self-Attention Mechanism
Transformers apply self-attention to model relationships between all elements in a
sequence, allowing them to capture dependencies regardless of position.
 Encoder-Decoder Structure
They consist of an encoder that processes the input sequence and a decoder that generates
the output sequence, both utilizing self-attention.
 Parallel Processing
Transformers can process input sequences in parallel, enhancing computational
efficiency.
 Evolution of NLP
Transformers have led to the development of advanced models like BERT, GPT, and
LaMDA, which excel in tasks such as language understanding, generation, and
translation.
 Applications
Transformers are widely used in tasks like machine translation, text generation, question-
answering, and more, showcasing their versatility and effectiveness in handling
sequential data.
The Transformer’s ability to capture complex relationships in data, its parallel processing
capabilities, and its impact on various NLP tasks make it a fundamental architecture in
modern deep learning research, driving advancements in language understanding and
generation.
8. Generative Adversarial Networks (GANs)
Generative Adversarial Networks (GANs) are a powerful class of deep learning models used
for generative tasks, where they automatically learn and generate new data instances that
resemble the original dataset. GANs consist of two primary components:
1. Generator: The generator network creates new data instances, such as images, based on
random input. It aims to generate outputs that are realistic and indistinguishable from real
data.
2. Discriminator: The discriminator network acts as a classifier, distinguishing between real
data instances from the original dataset and fake data generated by the generator. It assigns a
probability score to each input, indicating the authenticity of the data.
Key points about GANs include:
 Adversarial Training: GANs operate in a competitive manner where the generator and
discriminator are trained simultaneously. The generator aims to produce realistic outputs
to fool the discriminator, while the discriminator learns to differentiate between real and
generated data.
 Applications: GANs have diverse applications, including image generation, virtual
reality, predictive imagery, text-based image generation, and more. They are particularly
useful for tasks requiring the creation of new data based on existing patterns.
 Conditional GANs: Conditional Generative Adversarial Networks (cGANs) are a type
of GAN that generates outputs based on additional auxiliary information, enhancing the
control and specificity of the generated data.
GANs have revolutionized generative modeling by enabling the creation of high-quality, realistic
data that can be used in various domains such as image synthesis, content creation, and pattern
recognition. Their ability to learn complex patterns and generate new data has made them a
fundamental tool in the field of deep learning.

You might also like