0% found this document useful (0 votes)

30 views69 pages

Introduction GenAI EoAI

The document provides an introduction to Generative AI, covering its definition, applications, and the underlying technologies such as machine learning and natural language processing. It discusses the architecture and training processes of large language models (LLMs), including their capabilities and limitations. The content also highlights the significance of prompt engineering and the advancements in AI that have made Generative AI more accessible and effective.

Uploaded by

syed.shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views69 pages

Introduction GenAI EoAI

Uploaded by

syed.shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 69

Introduction to Generative AI

SALIMA LAMSIYAH, UNIVERSITY OF LUXEMBOURG

[email protected]
1
Lecture Plan

1. Introduction to Machine Learning

2. Generative AI Definition
3. Introduction to Natural Language Processing
4. Large Language Models
• Architecture
• Training Process
• Usage
• Capabilties
• Limitations

2
Introduction to Machine
Learning

3
AI Terminology: Artificial Intelligence
• Artificial Intelligence or AI is a subfield of computer
science that enables machines to mimic human
behaviours,
Artificial Intelligence
• It creates intelligent systems that can perform tasks that
typically require human intelligence, such as visual
perception, speech recognition, decision-making, …

• AI Terminology:
• Machine Learning
• Deep Learning
• Reinforcement Learning
• Natural Language Processing
• Generative AI
• …
4
AI Terminology: Machine Learning

[Authur Samuel 1959]: Machine learning is a subfield of

Artificial Intelligence that gives Computers the Ability to Artificial Intelligence
Learn from Data, without being explicitly programmed.
Machine Learning
Machine Learning algorithms construct predictive models
by learning from a large number of training examples.

By using machine learning, the computer can learn to

recognize patterns and make decisions based on data,
without requiring explicit programming instructions for
every possible scenario.

5
AI Terminology: Deep Learning

Deep learning (DL) is a subfield of ML that uses Artificial Intelligence

Artificial Neural Networks to learn complex
patterns from data.
Machine Learning

Deep
Learning

6
AI Terminology: Deep Learning
Examples of deep Learning
models:
- Feed Forward Neural Networks
- Convolutional Neural Networks
- Recurrent Neural Networks
- Transformers
- …

7
Machine Learning: Learning Paradigm

1. Supervised Machine Learning

2. Unsupervised Machine Learning

3. Semi-Supervised Machine Learning

3. Reinforcement Learning

4. Self-Supervised Machine Learning

8
Supervised Machine Learning

Source

• Conventional Machine Learning (Linear Regression, Logistic Regression, Support

Vector Machines, Decision Trees, Naïve Bayes, …)
• Deep Learning models (e.g., Feed-Forward NNs, Recurrent NNs, Convolutional NNs,
…) 9
Unsupervised Machine Learning

Source

• Clustering algorithms (e.g., k-means clustering, hierarchical clustering),

• Deep Learning models (e.g., autoencoders, variational autoencoders, generative
adversarial networks).

10
Supervised Vs Unsupervised Learning

11
Semi-Supervised Machine Learning

Source
12
Discriminative Vs Generative Machine Learning

1. Discriminative
• Classify or predict
• Usually trained using labeled data
• Learns representation of features for data based on the labels
using conditional probability: 𝑷 𝒄|𝒅
2. Generative
• Generates new data
• focuses on the distribution of a dataset to return a probability for
a given example using Joint Probability.
𝑷 𝒅 ∩ 𝒄 = 𝑷 𝒅|𝒄 . 𝒑 𝒄

13
Discriminative Vs Generative Machine Learning

Source
14
Discriminative Vs Generative Machine Learning

Source
15
Source
16
Reinforcement Learning

• Examples of RL applications include:

• Game playing: Chess, Go, Atari, Tic-Tac-Toes games
• Robotics, Autonomous vehicles, …
17
Generative AI Definition

18
Artificial Intelligence

Machine Learning
Generative AI is
subset of Deep Deep Learning
Learning
Generative
AI

19
Generative AI (GenAI)
McKinsey defines the Generative AI as:
• Generative AI refers to a branch of AI that focuses on creating or
generating new content, such as images, text, video, synthetic data, or
other forms of media, using machine learning examples.

• It does this by learning patterns from existing data, then using this
knowledge to generate new and unique outputs.

• The process of learning from existing content is called training and

results in the creation of a statistical model.

• When given a prompt, GenAI uses this statistical model to predict what
an expected response might be-and this generate new content.

• Recent breakthroughs in the field, such as GPT (Generative Pre-trained

Transformer) and Midjourney, have significantly advanced the
capabilities of GenAI.
Source
20
Generative AI
• Text Generation involves using machine learning models to generate new text based on patterns
learned from existing text data. The models used for text generation can be Markov Chains,
Recurrent Neural Networks (RNNs), and more recently, Transformers, which have
revolutionized the field due to their extended attention span. Text generation has numerous
applications in the realm of natural language processing, chatbots, and content creation.
• Application: ChatGPT, developed by OpenAI, is a successful platform that uses Text Generation to
generate human-like responses in chat conversations.

• Image Generation is a process of using deep learning algorithms such as VAEs, GANs, and more
recently Stable Diffusion, to create new images that are visually similar to real-world images.
Image Generation can be used for data augmentation to improve the performance of machine
learning models, as well as in creating art, generating product images, and more.
• Application: Very successful platforms such as MidJourney and DALL-E have become a popular
choice for anyone seeking to generate realistic images through Image Generation techniques.
21
Generative AI
• Video Generation involves deep learning methods such as GANs and Video Diffusion to
generate new videos by predicting frames based on previous frames. Video Generation can be
used in various fields, such as entertainment, sports analysis, and autonomous driving.
• Application: Platforms such as DeepBrain and Synthesia utilize Video and Speech Generation to
create realistic video content, that appears as if a human was speaking on camera.

• Data augmentation is a process of generating new training data by applying various image
transformations such as flipping, cropping, rotating, and color jittering. The goal is to increase the
diversity of training data and avoid overfitting, which can lead to better performance of machine
learning models.
• Application: Synthesis AI simplifies the process of building and optimizing machine learning models
by providing a platform for creating AI models using automated machine learning techniques.

22
Why GenAI Now?

• Availability of large • Advancements in • Generative Adversarial

and diverse datasets hardware; GPUs Networks (GANs)
• AI models learn • Access to cloud • Transformers
patters, correlations, computing Architecture
and characteristics of • Open-source software, • Reinforcement
large datasets Hugging Face Learning from human
• Pre-trained state-of- feedback (RLHF)
the-art models
23
Introduction to Natural
Language Processing

24
Natural Language Processing
• Natural Language Processing (NLP) is a field at
the intersection of:
• Computer Science
Artificial Intelligence
• Artificial Intelligence (ML, DL, GenAI)
• And Linguistics. Machine Learning

• Goal: for computers to process or “Understand” Deep

natural language in order to perform tasks: Learning
• Translation, Question Answering, Siri, Google
Assistant, … Generative Natural
AI Language
• Fully understanding and representating the Processing
meaning of a language is a difficult goal.
• Perfect language understanding is AI-complete
(AI-hard) 25
Why NLP is hard

• Language consists of many levels of

linguistic knowledge.

• Humans fluently integrate all of

these to produce and understand
language

• Ideally, so would a computer!

26
Why is NLP hard?

• Ambiguity
• Scale
• Variation
• Expressivity
• Unknown representation

27
Ambiguity
Ø Ambiguity at multiple levels:

• Lexical Ambiguity: The fisherman go the bank (finance or river?)

• Syntactic ambiguity: I can see a man with a telescop

• Semantic ambiguity: I gave a present to the children or The chicken is

ready to eat

• Referential ambiguity: Alice invited Maya for dinner but she cooked
her own food (she = Alice or Maya ?)

28
Scale and
Variation
• ~7K languages

• Thousands of language
varieties

• Variation of domains
(news, biomedical,
historical, …)

29
Expressivity
• Not only can one form have different meanings (ambiguity) but the same
meaning can be expressed with different forms:

• She gave the book to Aria vs. She gave Aria the book
• Is that door still open? vs. Please close the door

30
Unknown Representation
• Very difficult to capture what is the representation of the text or speech,
since we don’t even know how to represent the knowledge a human needs:

• What is the “meaning” of a word or sentence?

• How to model context?
• Other general knowledge?

31
Large Language Models
(LLMs)

32
Large Language Models (LLMs)

Large Language Models is Deep Learning

subset of Generative AI
Generative AI

Large, general-purpose
language models can be pre- LLMs
trained and then fine-tuned
for specific purposes

33
Large Language Models – Architecture

• Encoder
• Decoder
• Encoder-decoder

https://arxiv.org/abs/1706.03762 34
Timeline of Language Models Evolution: 2018-2023

https://arxiv.org/abs/2304.13712

35
Large Language Models – Characteristic

https://arxiv.org/abs/2304.13712
36
How do Transformer-based LLMs Work?
A simplified version of LLM training process

Je suis étudiant

Input

Output

I am a student
https://www.youtube.com/watch?v=t45S_MwAcOw
37
How do Transformer-based LLMs Work?
A simplified version of LLM training process

https://www.youtube.com/watch?v=t45S_MwAcOw
38
Generative Pre-trained Transformer (GPT) --
Architecture (Decoder-Only)

39
Large Language Models - Training

1. Pretraining using Self-supervised learning

2. Supervised fine-tuning (Instruction Tuning)

3. Reinforcement learning from human feedback (Alignment

with human values)
• nudging the LLM towards values you desire

40
Training a dog

A good canine citizen

Source 41
Training a dog for Special Services

police dog

+ special trainings guide dog

hunting dog

Source
42
Similar idea applied to Large Language
Models (LLMs)

43
Pre-training or Self-supervised Learning
• Model at the start:
• Zero knowledge about the world
• Can't form English words (doesn’t have language skills)
• Learning objective: Next token prediction
• Giant corpus of text data
• Often scraped from the internet "unlabeled”
• Self-supervised learning
• After training
• Learns language
• Learns knowledge

Source 44
Two Types of Large Language Models (LLMs)

Source 45
How to use LLMs?

1. Fine-Tuning the LLMs (Supervised Learning)

2. LLMs Prompt Engineering

46
Why Fine-tuning?

Source
49
What does fine-tuning do for your model?

• Steers the model to more consistent outputs

• Reduces hallucinations
50
• Customizes the model to a specific use case
How to use LLMs?

1. Fine-Tuning the LLMs

2. LLMs Prompt Engineering

51
Prompting is revolutionizing AI Application Development

Get
Train AI Deploy
Supervised Learning Labeled
Model the Model
(Fine-Tuning) Data
1 month 3 months 3 months

Specify Deploy
Prompt-based AI prompt the Model

Minutes/Hours Hours/Days

52
WHAT IS PROMPT
ENGINEERING
- Prompt engineering is the practice of
designing and refining specific text
prompts to guide generative AI
models, such as Large Language
Models (LLMs), in generating desired
outputs.

- It involves crafting clear and specific

instructions and allowing the model
sufficient time to process information.

"Prompt engineering is more about communicating than

Picture generated by DALLE-3
coding."
53
Prompting Techniques

• Many advanced prompting techniques have been designed to improve performance on

complex tasks
1. In-Context Learning (ICL)
2. Chain-of-thought (CoT) prompting
3. Self-Consistency
4. Knowledge Generation Prompting
5. ReAct
6. …

54
Text-to-Text Foundation Models since GPT3

*only LLMs with >1B parameters & EN as the main training language are shown.
*Comprehensive list: https://crfm.stanford.edu/helm/v1.0/?models=1
55
Models Access

All model components are publicly available: Only research paper or blog is available and
• Limited access falls somewhere
• Open source code in between open and closed. may include overview of
• Training data
•Training data
• Sources and their distribution • The access can be via API or •Architecture and training details (including
• Data pre-processing and curation steps through a review process of call infrastructure)
•Model weights for research proposals and then •Evaluation results
•Paper or blog summarizing granting approved proposals •Adaptation to the model
• Architecture and training details limited model access. • Safety filters
• Evaluation results
• Training with human feedback
• Adaptation to the model
• Safety filters
• Training with human feedback

https://crfm.stanford.edu/helm/v1.0/?models=1 56
Text-to-Text Foundation Models since GPT3

* Hugging Face has become the defacto hub for open source ML.
https://crfm.stanford.edu/helm/v1.0/?models=1
57
To recap

58
How LLMs are
Built?

https://arxiv.org/pdf/2402.06196
Web-based vs Software application use of LLMs

60
Benefits of using Large Language Models
1. A single model can be used for different tasks

2. The fine-tuning process requires minimal field data (Transfer Learning,

Domain Adaptation)

3. The performance is continuously growing with more data and parameters

61
LLM Development vs. Traditional Development

62
Large Language Models – Capabilities

https://arxiv.org/pdf/2402.06196
Some Limitations of LLMs

• They sometimes write plausible-sounding but incorrect or nonsensical

answers
• Lack of common sense
• Inability to handle complex tasks
• Limited knowledge base
• Lack of emotional intelligence
• Biases in the training data

64
Hallucinations

Hallucination are words or

phrases that are generated by
the model that are often
nonsensical or grammatically
incorrect.

65
To conclude

66
Source 67
Source 68
Questions Everyone Asks

How Exactly can

Is Generative AI How can I use
I use Generative
a threat of an my data securely
AI to gain a
opportunity for with Generative
competitive
my business? AI?
advantage?

69
Reading
• https://www.cloudskillsboost.google/journeys/118/course_templates/536

• https://cloud.google.com/ai/generative-ai

• https://smlbook.org/book/sml-book-draft-latest.pdf?fbclid=IwAR2ztL1GkSuhYJHJJeWACwRFEnAtqZshuq6l-S-
0Z6_MHT9o90Qzoy6eMgA

• https://www.simplilearn.com/tutorials/artificial-intelligence-tutorial/what-is-generative-ai

• GenAI studio: https://cloud.google.com/generative-ai-studio?hl=fr

• Survey on Large Language Models https://arxiv.org/pdf/2303.18223.pdf

70
Thank You!

Generative AI - A Beginner's Guide
No ratings yet
Generative AI - A Beginner's Guide
62 pages
Mini Project On Generative AI 2
No ratings yet
Mini Project On Generative AI 2
44 pages
PostgreSQL Internals Through Pictures
100% (3)
PostgreSQL Internals Through Pictures
72 pages
Genai Overview Final June 2024
No ratings yet
Genai Overview Final June 2024
58 pages
AI For Executive 1738738378
No ratings yet
AI For Executive 1738738378
14 pages
Curriculum Development
67% (3)
Curriculum Development
65 pages
Generative AI
No ratings yet
Generative AI
24 pages
Intro Gen AI 6p
100% (1)
Intro Gen AI 6p
6 pages
Generative Ai and Large Language Models (LLMS) : Unit - 7
No ratings yet
Generative Ai and Large Language Models (LLMS) : Unit - 7
42 pages
Introduction To Generative AI
No ratings yet
Introduction To Generative AI
5 pages
GPT Models
No ratings yet
GPT Models
10 pages
Gen AI Unit 1
100% (1)
Gen AI Unit 1
86 pages
AI Made Easy For All
No ratings yet
AI Made Easy For All
54 pages
Seminar
No ratings yet
Seminar
20 pages
Unit 1 Intoduction To Generative AI
No ratings yet
Unit 1 Intoduction To Generative AI
8 pages
Generative AI
No ratings yet
Generative AI
9 pages
PR and Media Relations Scope of Works
50% (2)
PR and Media Relations Scope of Works
3 pages
Lavazza
No ratings yet
Lavazza
3 pages
Gen AI Module1
No ratings yet
Gen AI Module1
130 pages
Module - 1
No ratings yet
Module - 1
95 pages
Chatgpt & Genai Landscape: - Aditya Jain
No ratings yet
Chatgpt & Genai Landscape: - Aditya Jain
23 pages
AIML
No ratings yet
AIML
13 pages
Gen AI Learning Concepts Linkedin
No ratings yet
Gen AI Learning Concepts Linkedin
18 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
14 pages
ACSIC Guide Book July 2023
No ratings yet
ACSIC Guide Book July 2023
37 pages
3.BP Travel - Create Quotes - Functional Requirements Questionnaire (FRQ)
No ratings yet
3.BP Travel - Create Quotes - Functional Requirements Questionnaire (FRQ)
11 pages
(Ebook PDF) Health, Safety, and Nutrition For The Young Child 9th Edition Download
100% (1)
(Ebook PDF) Health, Safety, and Nutrition For The Young Child 9th Edition Download
57 pages
Module 1 Gen
No ratings yet
Module 1 Gen
48 pages
Arts and Crafts Movement - History, Influene and Important Figures (Contribution
No ratings yet
Arts and Crafts Movement - History, Influene and Important Figures (Contribution
66 pages
Generativeaiuni 1
No ratings yet
Generativeaiuni 1
47 pages
Summary of Generative AI Concepts
No ratings yet
Summary of Generative AI Concepts
2 pages
Unit3sem7 Generative Ai
No ratings yet
Unit3sem7 Generative Ai
41 pages
Alchemy and Saltmaking - The Homebrewery
No ratings yet
Alchemy and Saltmaking - The Homebrewery
6 pages
LW3148
No ratings yet
LW3148
2 pages
CPCS335 - Chapter 10-Final
No ratings yet
CPCS335 - Chapter 10-Final
27 pages
Generative AI - Overview
No ratings yet
Generative AI - Overview
35 pages
Module1 L1 L2
No ratings yet
Module1 L1 L2
35 pages
Lecture 1
No ratings yet
Lecture 1
37 pages
Generative AI On Amazon Web Services Ebook
No ratings yet
Generative AI On Amazon Web Services Ebook
33 pages
Assessment of Existing Steel Structures - Reccomendations For Estimation of Exisitng Fatigue Life
No ratings yet
Assessment of Existing Steel Structures - Reccomendations For Estimation of Exisitng Fatigue Life
109 pages
Gen Ai
No ratings yet
Gen Ai
17 pages
Multilayer Perceptron PDF
No ratings yet
Multilayer Perceptron PDF
5 pages
Generative A I
No ratings yet
Generative A I
23 pages
CMoS s5 Phy Chem Calculations Seminar 01?
100% (1)
CMoS s5 Phy Chem Calculations Seminar 01?
3 pages
Lect-Gen Ai-1
No ratings yet
Lect-Gen Ai-1
23 pages
Sample Memorial
No ratings yet
Sample Memorial
18 pages
Madhavan M Ts Report
No ratings yet
Madhavan M Ts Report
32 pages
Generative AI
No ratings yet
Generative AI
19 pages
Google Cloud Skills
No ratings yet
Google Cloud Skills
27 pages
Session 2 Introduction To Generative AI
No ratings yet
Session 2 Introduction To Generative AI
17 pages
03 GenAI Intro
No ratings yet
03 GenAI Intro
13 pages
9199751-Class Ix Ai - Part B - Unit 4 Generative Ai
No ratings yet
9199751-Class Ix Ai - Part B - Unit 4 Generative Ai
15 pages
Unit - DL
No ratings yet
Unit - DL
22 pages
Day 2 Module 1 - Introduction To Generative AI
No ratings yet
Day 2 Module 1 - Introduction To Generative AI
15 pages
Gen AI
No ratings yet
Gen AI
20 pages
Week 6
No ratings yet
Week 6
19 pages
Quick Reference Guide For Understanding AI
No ratings yet
Quick Reference Guide For Understanding AI
14 pages
Intro To Generative AI-STM
No ratings yet
Intro To Generative AI-STM
10 pages
Writing A Creative Writing PHD Proposal - Guide Feb 2023
No ratings yet
Writing A Creative Writing PHD Proposal - Guide Feb 2023
3 pages
OCI GIA & LLM Fundations
No ratings yet
OCI GIA & LLM Fundations
11 pages
DLL English 9
No ratings yet
DLL English 9
2 pages
Gen AI
No ratings yet
Gen AI
9 pages
Workshop Notes
No ratings yet
Workshop Notes
10 pages
Generative AI For Software Practitioners
No ratings yet
Generative AI For Software Practitioners
9 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
9 pages
Lecture 1
No ratings yet
Lecture 1
7 pages
This Release Contains:: How To Upgrade From Previous Versions
No ratings yet
This Release Contains:: How To Upgrade From Previous Versions
8 pages
How GenAI Works
No ratings yet
How GenAI Works
5 pages
Gai Etict
No ratings yet
Gai Etict
8 pages
Introduction To Gen AI
No ratings yet
Introduction To Gen AI
7 pages
Note 23 Feb 2025
No ratings yet
Note 23 Feb 2025
7 pages
ASWIN TS Gen Ai and Autoregressive Ai Simplified Notes Unit 1
No ratings yet
ASWIN TS Gen Ai and Autoregressive Ai Simplified Notes Unit 1
4 pages
Class Note 1: Introduction To Generative AI (Beginner Level)
No ratings yet
Class Note 1: Introduction To Generative AI (Beginner Level)
4 pages
Cat Driver Information Card - LEDT7022
No ratings yet
Cat Driver Information Card - LEDT7022
2 pages
Pe 1
No ratings yet
Pe 1
5 pages
26a CSS Tema Oscuro v2
No ratings yet
26a CSS Tema Oscuro v2
22 pages
Math210 03notes
No ratings yet
Math210 03notes
4 pages
IJRPR24698
No ratings yet
IJRPR24698
4 pages
A Uniform Thin Ring of Radius R and Mass M Suspended in A Vertical Pla
No ratings yet
A Uniform Thin Ring of Radius R and Mass M Suspended in A Vertical Pla
1 page
Index
No ratings yet
Index
2 pages
Download
No ratings yet
Download
1 page
Comparative Table
No ratings yet
Comparative Table
9 pages
Module 3 - Vocabulary & Grammar
No ratings yet
Module 3 - Vocabulary & Grammar
2 pages
Ourlog 5343
No ratings yet
Ourlog 5343
10 pages
Interview Writing
No ratings yet
Interview Writing
4 pages
Aditya Praksh Jalan Saraswati Vidya Mandir, Kudlum: Online Class Routine
No ratings yet
Aditya Praksh Jalan Saraswati Vidya Mandir, Kudlum: Online Class Routine
1 page
Bonding Chem20 Exam
No ratings yet
Bonding Chem20 Exam
5 pages
Thingsboard EN
No ratings yet
Thingsboard EN
4 pages
Kenken Puzzle
No ratings yet
Kenken Puzzle
3 pages
The Beginner’s Guide to AI – DeepSeek
From Everand
The Beginner’s Guide to AI – DeepSeek
Steven Mcananey
No ratings yet