Generative Ai
Generative Ai
-PART-
AT
Generative Al
AN
LEARNING OBJECTIVES
Timeline of Generative Al
Characteristics of Generative Al
importance of Generative Al
Types of Generative Al
GANPaint
Generative Al Tools
Benefits of Generative Al
Limitations of Generative Al
These models use algorithms and deep learning techniques to find patterns and structures in
existing data and generate new content similar to the input data.
Al Fact File!
DALL-E is a neural network model developed by OpenAl that generates images from textual
descriptions. It belongs to the class of generative Al models capable of creating visual
content based on the input provided.
In supervised learning, the model is trained using labelled data, where each data point has
an input (features) and a corresponding output (label). The algorithm learns to map these
inputs to their desired outputs based on the labelled examples it has been trained on.
Discriminative modelling specificall focuses on learning how to distinguish between different
classes or categories based on the features and characteristics present in the input data. It
aims to build a model that can accurately predica training examples. seen before, using what
it has learned from
In unsupervised learning, the model is trained using unlabelled data, it discovers patterns
and hidden Hructures. The model learns to represent the distribution of data by itself as it
has no specific oupur struct Generative Modelling focuses on generating new data points
that resemble the training data Using which they generate new images or text based on
learned patterns, allowing for the creation of unique and realistic outputs.
Timeline of Generative Al
Adobe Fyelly
2025 LLAMA 2023 Google Band
2020 GPT
AVART
Tensorflow
2019 GPT-2
2022 Midjourney
Stable Diffusion
2022 CharGPT
In 2011, IBM's Watson won jeopardy, the first time a computer beat humans in a game.
In 2014, Generative Adversarial Networks were developed by lan Goodfellow and his
colleagues June, 2014.
From 2016-2019, OpenAl Five was launched. It is a team of artificial intelligence agents
developed that plays the multiplayer online battle arena kurning from this was used to
develop ChatGPT
2016, Google's DeepMind AlphaGo beat the highest level players of Go
In 2017. Transformers were introduced in the paper "Attention is All You Need" by Vaswani.
2018. Chat hay most aderative Pre-trained Transformer) arbitere was Inroduced, which may
have been the most advanced Al model at that time.
jabsequently, newer models of ChatGPT were released in 2019, 2020, 2022, and 2023 with
updated architectures. 2022, there was a rise of designs Generative Al technologies, such
as DALL-E, CLIP, Imagen, and
Diffusion. These models made significant contributions to the field of artificial image
generation. in 2023, Bing Chat, Google Bardid LLAMA, and Adobe Firefly were launched,
each making
significant contributions to their field as advanced Al language models and creative tools. •
Bing Chat is an Al chatbot by Microsoft for information retrieval and task assistance.
• Google Bard is a Google-developed Al language model for generating coherent text and
storytelling. • LLAMA is a model for META (previously known as Facebook) specialising in
meta-learning across various tasks with minimal data.
Adobe Firefly is an Al tool for image and video creation and enhancement.
Al Review
Column A
Advancement in Al
1. Watson
2. Variational Autoencoder
4. TensorFlow
OpenAl Five
6. AlphaGo
7. Transformers
8. ChatGPT
Column B
Description
Characteristics of Generative Al
1. Content Creation: It generates new and unique content using learned patterns in text,
images, audio, video, and code.
2. Pattern Recognition: It identifies and learns from patterns found in large datasets, through
machine learning techniques
5. Creativity: It produces creative and original content through advanced pattern recognition
and adaptation.
6. Vastliny: It is applicable across various domains and industries, from creative arts to
scientif research, adapting its methods to different contexts.
7. Scalability: It is capable of handling large datasets and complex tasks.
8. Adaptability: It adapts to new trends and industry information, adjusting its models and
outputs to reflect current standards and user preferences.
Al Fact File!
Generative Al has gained traction in recent times. Companies like Linkedin are using Al to
enhance job hunting, while Amazon is investing hundreds of millions of dollars in generative
Al startups.
Importance of Generative Al
1. Content Creation: Generative Al's primary importance lies in content creation. We can use
it to generate innovative, original, and creative content in various mediums, such as text,
images, music, and videos.
2. Personalisation: Generative Al enhances the user experience by using their past data,
preferences, and history to create customised content tailored to their preferences and
needs. Streaming platforms like Netflix use it to provide recommendations based on the
user's viewing and search history.
Al Fact File!
Generatie Al
20
Healthcare: Generative Al helps doctors analyse medical images, diagnose diseases, create
and improve the effectiveness and efficiency of treatments. uses
Baslacions to boost marketing efforts creates targeted content and personalised customer
teractions to beinventoring efforts. It also helps improve decision-making by analysing
demand, managing inventory, and making logistics more efficiend
Education: Generative le can be used to adapt lessons to fit student's needs, helping them
Hnderstand concepts more comprehensively. It also empowers teachers to create interactive
earning materials and automate administrative tasks, thereby enhancing classreflect and
improving student
Al Brainstorming!
List a few places, finnliga tiith where you have encountered Generative Al in your daily
routine, and compare your findings with those of your classmates to see if there are common
or different experiences with Generative Al.
Conventional AΙ
Conventional Al, also known as Narrow Al, is used for performing specific tasks by analysing
data it has been trained on previously. This type of Al uses techniques like machine learning,
where algorithms learn from large sets of labelled data to recognise patterns and make
predictions. For example, in image recognition, a conventional Al system could learn to
distinguish between different objects such as cats and dogs by analysing thousands of
labelled images. Another example is, in speech recognition, it can understand the user's
speech and convert it into text by recognising the patterns in audio data.
Conventional Al's main strength lies in its ability to operate within well-defined parameters
and rules, this makes it reliable for performing tasks that require precise analysis and
decision-making based on existing data patterns.
The applications of conventional Al are vast; ranging from recommendation systems that
suggest products based on user preferences to medical diagnostic tools that analyse
symptoms and predict potential illnesses. By leveraging its ability to identify and utilise
patterns, it enhances the efficiency and accuracy in various fields.
similar to how a human creates things. One of the notable techniques used in generative Al
is Generative Adversarial Networks (GANs) One of the notable tephritions in various fields,
as discussed carliarticles. and des can use it Generative Al finds applove their writing style,
or even draft entire articles. Artists and designers can Employ generative Al to explore new
visual styles, generate artwork, or assist in architectural design Conventional Al vs
Generative Al
Conventional Al and Generative Al both have merits and demerits; there is no definitive way
to labe one approach as better than the other since each has its own use cases.
Conventional Al excels in problem-solving and accuracy based on data analysis, making it
the preferred choice for tasks such as fraud detection, recommendation systems, and
data-driven decision-making. In contrast, Generative Al is more versatile and creative,
suitable for applications that require the generation of new and unique content, such as
writing, art, and music. By utilising large datasets and advanced neural networks, Generative
Al can produce innovative outputs beyond the capabilities of Conventional Al Al models often
combine both approaches to create a well-rounded solution, capable of addressing diverse
user needs, integrating the analytical strengths of Conventional Al with the creative potential
of Generative Al.
Conventional Al
Analyses data to recognise patterns and make predictions within a specific domain
Generative Al
Creates new content based on learned patterns and user input, enhancing creativity and
producing unique outputs
Learning Approach
Scope
Output Quality
Flexibility
User Involvement
Data Dependency
Applications
Generative A
Aspect
Used in content creation applications such as writing, art generation, and music composition
Al Review
Which image in each pair is the real image, and which one is generated by Al?
1.
12
2.
Types of Generative Al
Generative Al can be classified in various ways, with the most common method based on
architecture and mechanisms. However, from a user's perspective, it is more practical to
classify Generative Al models by the type of content they generate text, images, music,
videos, or speech-as users are more concerned with the output rather than the underlying
technology.
Image
Music
Video
Speech
Code
VAE'S
Transformers
Autoregressive Models
RNNG
Classification based on output type, focuses on the data generated by the Generative Al
models, such as text, images, music, videos, or speech. This classification is focused on
what is being generated.
1. Text Generation: Text Generation models are designed to understand and generate
human-like text. Models like GPT-3, GPT-4, and BERT excel at tasks, such as answering
queries, engaging in conversations, and creating text-based questions.
2. Image Generation: Image Generation Models are used to generate realistic or artistic
images, providing visual content. Models like DALL-E, MidJourney, and StyleGAN are
utilised to create artwork, graphic content, and realistic images. Tools such as GANPaint
enable users to edit and modify images using GAN technology.
3. Music Generation: Music generation models are used to compose music; they create
original compositions or improve upon existing ones. Models like OpenAl's MuseNet and
AIVA specialise in generating music of various styles and genres.
4. Video Generation: Video generation models can generate or modify video content,
creating dynamic visual media. Generative Al models like Synthesia are used to create
Al-driven videos with avatars that can speak multiple languages based on text inputs.
Another example is DeepDream, which modifies existing videos using neural network-based
artistic enhancements.
5. Speech Generation: Speech generation models are used to produce human-like speech,
which help to enhance text-to-speech applications. Models like DeepMind's WaveNet
generate natural-sounding and highly intelligible speech from text. This has helped improve
human-machine interactions.
6. Code Generation: Code generation models are used to generate code snippets or entire
programs based on natural language prompts or code context. Models like GitHub Copilot,
OpenAl's Codex, and CodeBERT help developers by generating code suggestions,
auto-completing code, and debugging the program.
GitHub Copilot
Before we learn more about the types of Generative Al on the basis of their architecture, we
must first understand what a Neural Network is.
A Neural Network is a type of computer program designed to recognise patterns and make
decisions, similar to how the human brain works. It is essentially a network made up of
neurons, these neurons are similar to brain cells and are used to process information. Each
neuron takes input, processes it, and passes the output to the next neuron.
Al Fact File!
In 2017, the album "I AM AI" was released, which was the first ever music album created by
Al This album was created using the Al software Amper Music.
network consists of various layers: the input layer, which takes the initial input in the form of
datak the hidden layers, which process the information and recognise patterns, and the
output ree which provides the final output. The data is processed through each of these
layers by passing ormation from one neuron to the next until it reaches the output layer.
Input Layer
Hidden Layer
Output Layer
TAN
Now that you have a fundamental understanding of what a neural network and neurons are,
let us discuss the types of generative Al on the basis of the neural network architectures and
mechanisms.
• Generator: The task of the generator is to create synthetic data or content that could be
used in place of the real data, It learns to generate realistic data through feedback received
from the discriminator.
distinguish between synthetic data (produced by the generator) and real data (from the
training set). It is supposed to discriminate between the two to determine which outputs are
artificially manufactured. You performed a task similar to that of the discriminator in Activity
2, where you had to compare two images-one real and one fake.
GANs perform this process iteratively, with the generator producing fake data and improving
it in each iteration based on feedback from the discriminator, while the discriminator gets
better at distinguishing between real and fake data.
Generative Al
To illustrate this concept, consider the encoder as a translator that comes a fange data into a
simpler language known as the latent space. This encoding process provides trange of
interpretation rather than a single translation, allowing for multiple representations of the
data.
NLP
• GANPaint
GANPaint enables users to manipulate images by directly drawing on them. These drawings
trigger the GAN to generate realistic modifications by manipulating neurons with each brush
stroke. Users can add or remove objects such as trees, doors, and grass, with each object
corresponding to specific neural activations. Additionally, it also enables users to modify
attributes, such as adding windows to buildings.
GANPaint operates by selectively activating and deactivating neurons in a deep network that
correspond to objects and attributes within the image. This modification process generates a
new output image based on the user's interactions.
Generative Al
Generative Al Tools
Generative Al tools are software applications and platforms that utilise generative Al
techniques to create new content, solutions, or data. Some of the most popular Generative
Al tools at the moment include ChatGPT, DALL-E, and Google Gemini. These tools excel in
generating text, images, and engaging in human-like conversations. They demonstrate the
remarkable capabilities of Al in producing creative and contextually relevant content across
various media. Given here are short descriptions of some of the most popular Generative Al
tools.
Gemini
4. GANPaint: Developed by the MIT-IBM Watson AI Lab in 2019, this tool utilises GAN
(Generative Adversarial Network) technology for image editing. It enables users to
manipulate and create scenes using advanced Al capabilities.
5. Midjourney: Developed by Midjourney Al, this tool generates high-quality images from
textual descriptions. Midjourney is renowned for its ability to produce visually stunning and
artistically inspired content based on descriptive input.
Artbreeder: Developed by Joel Simon is a popular Al art application, used by artists to create
new images by leveraging various GAN models. Users can select and combine different
models to generate new and unique visual compositions.
9. Notion Al: Developed by Notion, this text-based generative tool combines various features
to enhance productivity in workspace, such as note-taking, databases management, and
project organisation.
10. Galileo.ai: Galileo.ai is a text-to-UI generative model designed to create user interface
designs based on textual prompts. This tool aims to boost the creativity and efficiency of
designers by automating the UI design process.
Ice-Breaker Activity 1
Visit the link: https://gemini.google.com/app. Use Gemini to learn how to write a prompt to
generate your desired output. Write the prompt you used and your findings. Design your
dream home. Write down the innovations that you would like to see in your future home.
Benefits of Generative Al
Generative Al has numerous benefits across different industries, transforming how we create
content and personalise user experiences. It automates tasks, boosts creativity, and
discovers innovative solutions, making it indispensable in today's competitive world. These
advantages have diverse applications, revolutionising businesses operations and customer
interactions, resulting productivity and efficiency. Some specific benefits efits include:
include: in increased
Content Generation
Generative Al is an invaluable tool for creating various forms of content, including text,
images, music, videos, and even code. It empowers users to generate creative, unique, and
novel content by suggesting ideas and producing outputs, thereby enhancing creativity and
efficiency. Additionally, it automates content creation processes, enabling users to
concentrate on strategic tasks while maintaining a consistent brand voice and style. For
example, marketers and sales specialists use generative Al for basic text creation and
copywriting.
Creativity Enhancement
Personalisation
Generative Al enables the creation of content and recommendations tailored to the individual
needs and preferences of users and customers. This enhances user experience and
increase customer satisfaction through targeted content delivery. For example, booking
platforms can recommend events, shows, and movies based on user's past activities and
purchases. Additionally, voice assistants like Siri and Alexa personalise interactions by
learning users' likes and dislikes.
Frite
ent ve se ed
Adaptive Learning
Generative Al continuously learns and adapts to new information, trends, and scenarios to
improve the quality of its outputs. It personalises the educational experience by adjusting
content and learning pathways based on real-time feedback and user interactions. This
customisation makes learning more accessible and comprehensible for students, catering to
their individual pace and understanding. For example, one student might benefit from more
illustrative examples, while another may prefer straightforward textbook definitions.
Limitations of Generative Al
Generative Al, despite its numerous advantages, possesses several limitations. Let us
understand each of these limitations in detail:
Data Dependency
Generative Al models heavily rely on the quantity and quality of data they are trained on. If
the training data is inaccurate, biased, incomplete or of low quality the outputs of the Al will
also be low quality. This dependency can lead to inaccuracies, biases, or inappropriate
content generation. For example, a generative Al trained on biased data may produce
outputs that exclude certain
ps.
DATA
DATA DATA
ATA
DATA
Generative Af
Lack of Understanding
Generative Al models do not possess genuine understanding of the content they generate,
despite their advanced capabilities. They simply analyse patterns in data and produce
outputs, based on these patterns, but they lack the ability to understand the context and
meaning of what they generate. This can result in outputs that are inappropriate or
nonsensical. For example, a generative Al might create a grammatically correct sentence
that is contextually irrelevant to the topic.
Ethical Concerns
Generative Al raises several ethical concerns including intellectual property issues, and the
potential for misuse. Al-generated content can be used to infringe on copyright laws, create
deepfakes, and spread rumours and misinformation. These ethical concerns must be
addressed and carefully considered to prevent harm. For example, deepfake technology has
been used to create false information about popular public figures, damaging their
reputations.
Generative Al incurs significant computational costs, requiring substantial resources for both
training and operation. This can have environmental implications, as energy consumption
associated with these processes contributes to carbon emissions. Additionally, these costs
can pose a barrier to entry for smaller organisations. For example, the extensive
computational power required to train models like GPT-4 is costly and often not feasible for
all businesses.
Generative Al is a valuable tool for content creation; however, it requires constant human
input and oversight to refine content to meet desired standards and avoid errors. Continuous
review, editing, and adjustment of the content are necessary. This dependency on human
intervention can diminish the benefits of automation and content generation. For example,
when generating an image of a tree, detailed input is essential, followed by ongoing
refinements to meet expectations. In some cases, the generated content may need to be
discarded if it fails to meet these standards.
With the increasing popularity and advancement of generative Al, it is crucial to address and
resolve ethical concerns promptly.
The primary ethical issues of generative Al include authenticity, ownership, and misuse. One
significant implication is the potential for Al-generated content to deceive or manipulate the
public by blurring the lines between what is real and what is fake.
هللا
this poses risks in areas such as misinformation, where Al can be used to create convincing
fake ws or forged documents. Additionally, there are concerns about intellectual property
rights and the ownership of Al-generated creations, raising questions about who holds
produced by machines. responsibility for content
eve these across various steel involves ensuring transparency, consent, and is deploytrisks
and to mis sectors, from art and entertainment to healthc. and accountability
gate these and to maximise benefits responsibly. Let us look at some of these ethical
implications and negative impacts of Generative Al in detail.
Bias and Discrimination: Generative Al models trained on biased datasets can amplify
existing societal biases, potentially leading to unfair outcomes or discrimination. For
example, biased language models may generate offensive or discriminatory content. It is
crucial to ensure that Al systems treat all users fairly, regardless of race, gender, or other
characteristics. Fairness considerations involve designing algorithms that mitigate biases
and promote equitable outcomes.
Privacy: Another major concern regarding Generative Al is its potential to use personal data
to create hyper-personalised content raising questions about data security. Protecting
individuals' identities and personal information becomes increasingly challenging as Al
techniques evolve.
Copyright and Legal Exposure: Generative Al tools trained on large datasets from various
sources, including the internet, may inadvertently generate content that infringes upon
copyright. Users may unknowingly face legal exposure if the generated content resembles
copyrighted material.
Misuse and Manipulation: Generative Al, while offering societal benefits, can also be
misused. The propagation of misinformation and fake content, facilitated by generative Al,
poses serious risks. This includes creating fake news, impersonating individuals through
deepfakes, and other forms of manipulation that can deceive the public and cause harm.
Al Fact File!
The term Deepfake has been derived from deep learning' and 'fake'. It refers to the use of
generative Al techniques to manipulate audiovisual content, typically to depict someone
saying or doing something they have not actually said or done.
It is crucial to understand and learn more about the potential negative impacts of Generative
Al on society. These risks and impacts are causing major displacement in the society in the
present time and are likely to expand in the future. Let us examine some of these impacts.
• Facilitation of Plagiarism: Generative Al can generate content that mimics original works,
raising
Creation and Spread of Fake News and Misinformation: Generative AI can be exploited to
create and disseminate false information, influencing public opinion and potentially
destabilising societies.
Generative Al
Dependency on Al for Content Generation: Relying on Al models for content creation may
hinder individual creativity and innovation.
There must be proper regulation and accountability to ensure a balance between innovation
and deployment. It is essential to enforce robust regulatory frameworks and mechanisms to
maintain accountability. Achieving this requires collective efforts from policymakers,
technology developers, and society at large. Here are a few ways in which generative Al can
be used responsibly.
Ethical guidelines and standards must be established to govern the development and
deployment of Generative Al technologies, along with the adoption of transparent practices.
This will help minimise the negative effects and potential harms of generative Al, while
maxirnising its advantages and fostering innovation in Al technologies, safeguarding
individuals' rights, privacy, and societal well-being
• It is crucial to use diverse and representative training data that is fair and unbiased.
Outputs generated by Al models must be scrutinised and periodically checked for bias and
misinformation. It is also important to ensure that the model does not generate hallucinated
or nonsensical outputs.
Al Fact File!
In generative Al, a hallucination occurs is when the model produces content that is not based
on facts or present in its training data. These hallucinations may happen in any form, such
as images, text or other creative outputs. For example, an Al model asked to generate an
image of a tiger may provide an image of a tiger with a trunk like an elephant or three ears.
It is crucial to maintain the privacy and consent of users and ensure that the rights of
creators are not violated when generating content. Proper attribution and clear guidelines on
ownership should be provided
Multimodal Generative Al
Generative Al has made significant strides in creating diverse types of content such as text,
video, images, music, and code. However, current models typically specialise in generating
one type of content at a time-text-based models generate text, image-based models create
visual content. Engineers are now advancing towards multimodal generative Al, which aims
to generate multiple types of content simultaneously. For example, the third version of the
text-to-image tool DALI-E can embed high-quality textual descriptions within its generated
images. This capability provides it with a competitive edge over other image-generating
models that lack the ability to incorporate textual context directly into their visual outputs.
DALL-E 3
Fig.4.17 DALL-E 3
Enhanced Creativity
One of the limitations of Generative Al is its restricted scope of creativity often constrained
by the patterns in its training data. However, as Generative Al advances, it holds the promise
of generating more creative and versatile content beyond the scope of its training data,
approaching the creativity of the human mind. This evolution could potentially approach the
levels of creativity seen in human minds, allowing Al to produce complex and innovative
content across various industries. For example, in the future, Al may have the capability to
create new genres of music by blending elements from different styles in novel ways that no
human has thought of.
Interactive Al
Mustafa Suleyman, co-founder of DeepMind and CEO of Microsoft Al, has proposed the
concept of interactive Ation or task con beyond generative Al. In this phas proposed the
concept simple conversation or partial task completion; they are capable of
autonomousleiend tasks either independently or by coordinating with other software tonis
pusly performing entire can ask bots to create cold emails, send them to the larges
audience, respond to any queries, follow up, and even schedule meetings for users. Another
emerging application is exemplified by tools like ChatGPT, which can currently assist in
writing and debugging code snippets. In the future, sach Al could potentially develop entire
applications with minimal user guidance, handling tasks from initial creation through testing
and ongoing maintenance.
language is continually advancing, enabling them to grasp the human language more
effectively. As Al continues to improve in this area, it will lead to enhancement in its ability to
comprehend and generate human-like text, facilitating more natural and intuitive
communication between machines and humans.
There is a growing emphasis on developing ethical guidelines and frameworks to ensure the
responsible use of generative AL. This effort aims to address current challenges such as
bias, misinformation, and copyright issues. The goal is to establish standards that promote
fairness, transparency, and accountability in the development and deployment of Al
technologies. Furthermore, the open-source movement enhances transparency by enabling
broader participation in discussing and implementing ethical considerations in Al
development.
DeepMind