Generative AI

The document provides an overview of generative AI, focusing on recurrent neural networks (RNNs) and the rise of transformer architectures, particularly highlighting their impact on natural language processing and machine translation. It outlines the development timeline of large language models (LLMs) such as ChatGPT and discusses their training phases, limitations, and methods to enhance their responses. Additionally, it mentions various LLMs and frameworks available in the field.

Uploaded by

n200677

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views15 pages

Generative AI

Uploaded by

n200677

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Generative AI:

An Overview
Understanding Recurrent Neural Networks (RNNs)
RNNs are a type of neural network.
They are designed to process
sequential data.
These architectures were widely
used for NLP tasks, speech
processing, and time series.
Challenge-?
The Rise of Transformers: Self-Attention
In 2017, researchers at Google
published a paper that proposed a
novel neural network architecture
for sequence modeling known as
Transformer.
Outperformed recurrent neural
networks (RNNs) on machine
translation tasks, both in terms of
translation quality and training cost.
A Timeline of Large Language Models
2022: ChatGPT
Generative Pre-trained Transformer 2.

2024: Meta's Llama 3, Claude 3, and Q2, and Mistral's Mixtral 8x7B
Larger and more powerful model.

2025: DeepSeek-R1
Multimodality: Text, Image, Video
Diving into ChatGPT
Generative Pre-trained Transformer
Next word prediction LLM is pre-trained on massive Encoder-decoder architecture
amount of text

Why did ChatGPT couldn't replace Google Search?

How was ChatGPT trained?

Large Language Models
What do LLMs essentially do?
LLMs as Machine Learning Task?
LLMs as Deep Learning Task?
Training Data for LLMs
Next word Generation
Phases of LLM Training
Pre-training Instruction fine tuning Reinforcement Learning
Massive amount of text data Curating Q n A dataset to from Human Feedback
from internet - books, train the model to answer (RLHF)
research papers, websites questions or instructions Align the output closer to
Model learns to predict the Model learns to become a human like responses
next word helpful assistant Responses are updated
considering human
feedback and preference.
Limitation of LLMs
1. Hallucination
2. Mathematical Problem solving
3. Context window
4. Cost
How to make LLMs respond better?
Zero-Shot Few-Shot Chain-of-Thought(CoT)
Give some instructions to Give some examples of how For complex tasks- prompt
solve a task. to solve a task. an LLM to <think step by
step=
Latest LLMs & Frameworks
LLMs Frameworks
Mistral Together AI- https://www.together.ai/

Mixtral Groq- https://groq.com/

Llama Replicate- https://replicate.com/

Gemini LiteLLM - https://www.litellm.ai/

DeepSeek Hugging Face- https://huggingface.co/

Generative AI Project Lifecycle

Generative AI For Dummies
67% (3)
Generative AI For Dummies
6 pages
Whitepaper - Foundational Large Language Models & Text Generation
100% (2)
Whitepaper - Foundational Large Language Models & Text Generation
75 pages
Whitepaper - Foundational Large Language Models & Text Generation - v2
100% (1)
Whitepaper - Foundational Large Language Models & Text Generation - v2
86 pages
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
No ratings yet
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
325 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
100% (5)
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
326 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
No ratings yet
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
279 pages
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
100% (14)
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
132 pages
Quick Start Guide To LLMs by Sinan Ozdemir 1703540700
100% (3)
Quick Start Guide To LLMs by Sinan Ozdemir 1703540700
275 pages
Atharva Presentation
No ratings yet
Atharva Presentation
20 pages
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
No ratings yet
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
285 pages
LLM and Gen AI
No ratings yet
LLM and Gen AI
4 pages
The Best LLMs Cheatsheet - Part 1
No ratings yet
The Best LLMs Cheatsheet - Part 1
16 pages
Uh60 Malfunction
100% (1)
Uh60 Malfunction
41 pages
LLM - Introduction 2024
No ratings yet
LLM - Introduction 2024
77 pages
AI Tools
No ratings yet
AI Tools
19 pages
Mould Materials
100% (1)
Mould Materials
22 pages
Introduction To Natural Language Processing NLP
No ratings yet
Introduction To Natural Language Processing NLP
10 pages
Generative AI and LLMS
No ratings yet
Generative AI and LLMS
34 pages
Advanced AI Applications
No ratings yet
Advanced AI Applications
14 pages
Option MCQ
No ratings yet
Option MCQ
10 pages
Week4 LLMs EN
No ratings yet
Week4 LLMs EN
48 pages
(English) Introduction To Large Language Models (DownSub - Com)
No ratings yet
(English) Introduction To Large Language Models (DownSub - Com)
9 pages
TM4112 - 10 Building The Dynamic Model - SCAL
100% (1)
TM4112 - 10 Building The Dynamic Model - SCAL
91 pages
Lec # 12
No ratings yet
Lec # 12
26 pages
BTech Advanced AI Unit03
No ratings yet
BTech Advanced AI Unit03
109 pages
Python Qualis Doctests
0% (2)
Python Qualis Doctests
3 pages
To Create A LLM
No ratings yet
To Create A LLM
53 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
Know Thy Frenemy
No ratings yet
Know Thy Frenemy
40 pages
Large Language Models
No ratings yet
Large Language Models
40 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
DAB311 DL Week 11 RNN
No ratings yet
DAB311 DL Week 11 RNN
25 pages
Definition:: Large Language Models (LLMS)
No ratings yet
Definition:: Large Language Models (LLMS)
41 pages
A Beginner's Guide To Large Language Models
No ratings yet
A Beginner's Guide To Large Language Models
25 pages
Thoughts On NLP Research in The (Post-) LLM Era: Yijia Shao Yuanpei College 2023/04/28
No ratings yet
Thoughts On NLP Research in The (Post-) LLM Era: Yijia Shao Yuanpei College 2023/04/28
51 pages
Large Language Model
0% (1)
Large Language Model
38 pages
Understanding GPT The AI Revolution in Language Processing
No ratings yet
Understanding GPT The AI Revolution in Language Processing
10 pages
SRM-3006-Tools en 02 06-2011
No ratings yet
SRM-3006-Tools en 02 06-2011
68 pages
II. Introduction To Chatbots and ChatGPT
No ratings yet
II. Introduction To Chatbots and ChatGPT
8 pages
Chapter 2 - Storage Methods
No ratings yet
Chapter 2 - Storage Methods
13 pages
Generative AI For Everyone: Doç. Dr. Murat Mühendislik Fakültesi, Bilgisayar, Gazi Üniversitesi, E-Mail: My Gazi - Edu.tr
No ratings yet
Generative AI For Everyone: Doç. Dr. Murat Mühendislik Fakültesi, Bilgisayar, Gazi Üniversitesi, E-Mail: My Gazi - Edu.tr
44 pages
AILLM
No ratings yet
AILLM
3 pages
LLM Models
No ratings yet
LLM Models
23 pages
Fai Unit-5 TB
No ratings yet
Fai Unit-5 TB
7 pages
ST m3 PPT
No ratings yet
ST m3 PPT
145 pages
RT8885A
No ratings yet
RT8885A
59 pages
Introduction To Large Language Models
No ratings yet
Introduction To Large Language Models
10 pages
Exploring Generative AI
No ratings yet
Exploring Generative AI
12 pages
What Is A Neural Network
No ratings yet
What Is A Neural Network
7 pages
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
No ratings yet
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
11 pages
How LLM's Work, How GPT Was Trained, and How GPT Generates Outputs
No ratings yet
How LLM's Work, How GPT Was Trained, and How GPT Generates Outputs
12 pages
Buildinwg A PDF Knowledge Bot With Open-Source LLMs - A Step-by-Step Guide - Shakudo
No ratings yet
Buildinwg A PDF Knowledge Bot With Open-Source LLMs - A Step-by-Step Guide - Shakudo
9 pages
The Diverse Landscape of Large Language Models Deepsense Ai
No ratings yet
The Diverse Landscape of Large Language Models Deepsense Ai
16 pages
Creación de Aplicaciones LLM Modelos de Lenguaje
No ratings yet
Creación de Aplicaciones LLM Modelos de Lenguaje
5 pages
LLM Model
No ratings yet
LLM Model
3 pages
Large Language Models
No ratings yet
Large Language Models
10 pages
Unit 4 LLM
No ratings yet
Unit 4 LLM
11 pages
LLM 1 GPT
No ratings yet
LLM 1 GPT
12 pages
1st Note
No ratings yet
1st Note
3 pages
What Are LLMs
No ratings yet
What Are LLMs
3 pages
Attention Is All You Need.
No ratings yet
Attention Is All You Need.
5 pages
Osy Repeated Only Quesstion
No ratings yet
Osy Repeated Only Quesstion
9 pages
LLMS&TRANSFORMERS
No ratings yet
LLMS&TRANSFORMERS
4 pages
LLM 1
No ratings yet
LLM 1
6 pages
Introduction To LLMs
No ratings yet
Introduction To LLMs
2 pages
CE 579 Lecture 4 Stability-Energy Method LRG Deflections
No ratings yet
CE 579 Lecture 4 Stability-Energy Method LRG Deflections
11 pages
Basic Concepts of Chemistry Questions
No ratings yet
Basic Concepts of Chemistry Questions
4 pages
Shift Registers-1
No ratings yet
Shift Registers-1
11 pages
Understanding Heat Exchangers - Types, Designs, Applications and Selection Guide
No ratings yet
Understanding Heat Exchangers - Types, Designs, Applications and Selection Guide
11 pages
8 Errors Common To Spectrum Analysis
No ratings yet
8 Errors Common To Spectrum Analysis
10 pages
ACE 318 - Table
No ratings yet
ACE 318 - Table
6 pages
Cambridge: Igcse
No ratings yet
Cambridge: Igcse
17 pages
Effects of Rotational Inertia On A Fastball
No ratings yet
Effects of Rotational Inertia On A Fastball
10 pages
BIO 11 Lect - Botany Part 2 Assignment
No ratings yet
BIO 11 Lect - Botany Part 2 Assignment
2 pages
Tutorial 3 - Electromagnetic Fields and Waves
No ratings yet
Tutorial 3 - Electromagnetic Fields and Waves
2 pages
CASCADE: Contextual Sarcasm Detection in Online Discussion Forums
No ratings yet
CASCADE: Contextual Sarcasm Detection in Online Discussion Forums
11 pages
T 9 Sec 3
No ratings yet
T 9 Sec 3
5 pages
Diameter Training Plan
No ratings yet
Diameter Training Plan
10 pages
Solid State Power Controllers
No ratings yet
Solid State Power Controllers
4 pages
Befaco User Manual
No ratings yet
Befaco User Manual
5 pages
Factors Influencing Internet Banking Adoption in South African Rural Areas
No ratings yet
Factors Influencing Internet Banking Adoption in South African Rural Areas
8 pages
Brosur Isolation DRY HEXTA
No ratings yet
Brosur Isolation DRY HEXTA
4 pages
Network Topology
No ratings yet
Network Topology
5 pages
Palladin Protein
No ratings yet
Palladin Protein
4 pages
Heating: Surrounded by Quality
No ratings yet
Heating: Surrounded by Quality
2 pages
CHATGPT DALL.E 3: Complete Guide. Third Edition
From Everand
CHATGPT DALL.E 3: Complete Guide. Third Edition
Hesham Mohamed Elsherif
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Unveiling the Secrets of ChatGPT Inside the Mind of an AI
From Everand
Unveiling the Secrets of ChatGPT Inside the Mind of an AI
Nelson Ambrose
No ratings yet
Mastering Dynamic Programming in Java
From Everand
Mastering Dynamic Programming in Java
Ed A Norex
No ratings yet
Gensim for Natural Language Processing: Definitive Reference for Developers and Engineers
From Everand
Gensim for Natural Language Processing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet

Generative AI

Uploaded by

Generative AI

Uploaded by

Generative AI:

Why did ChatGPT couldn't replace Google Search?

How was ChatGPT trained?

Mixtral Groq- https://groq.com/

Llama Replicate- https://replicate.com/

Gemini LiteLLM - https://www.litellm.ai/

DeepSeek Hugging Face- https://huggingface.co/

You might also like